1️⃣ Question 1

Why run an unsupervised model when no diagnoses are available?

Explanation:
Unsupervised learning finds patterns without labels, perfect when no diagnosis labels exist.

2️⃣ Question 2

How does hierarchical clustering (dendrogram) help determine the number of clusters?

Explanation:
A dendrogram shows where clusters merge, helping identify a cutoff.

What does a K-means centroid represent?

Explanation:
A centroid is the mean of all points in a cluster.

Why is DBSCAN good for detecting unusual activity?

Explanation:
DBSCAN naturally handles noise and irregular clusters.

Why is t-SNE used for 2D scatter plots?

Explanation:
t-SNE preserves local structure, revealing natural groupings.

Why is PCA suitable for environmental factor research?

Explanation:
PCA reduces dimensionality while retaining major variance.

Advantage of t-SNE for user interaction visualization:

Explanation:
t-SNE creates a meaningful lower-dimensional visualization where similar users cluster together.

Q	Correct Answer
1	Uncover natural patient groups
2	Visualize similarity levels with dendrogram
3	Centroid = average of cluster
4	DBSCAN detects clusters + outliers
5	t-SNE preserves neighborhood similarities
6	PCA reduces data to key components
7	t-SNE forms meaningful 2D/3D clusters