When you navigate to the
Embeddings view of the Lightly Platform, you can view the embeddings of all the samples in your dataset in a 2d representation.
On the top of the view, you can choose between three well-known dimensionality reduction methods.
All the samples of your active tag are shown in the scatterplot. This allows you to
pan, by holding your mouse down and moving around
zoom, by using your mouse wheel
Lightly uses different dimensionality reduction methods to get a 2d representation of your data.
- PCA: Principal Component Analysis (PCA) is a dimensionality reduction method that is often used to reduce the dimensionality of large datasets so they can be visualized in 2d space. The idea of PCA is simple — reduce the number of variables of a dataset while preserving as much information as possible. For a more thorough explanation, please refer to the PCA article on Wikipedia
- TSNE: t-distributed stochastic neighbor embedding (t-SNE or TSNE) is a dimensionality reduction method that is often used to reduce the dimensionality of large datasets so it can be visualized in 2d space. The idea of TSNE is that nearby points model similar objects, and dissimilar objects are modeled by distant points with high probability. For a more thorough explanation, please refer to the TSNE article on Wikipedia
- UMAP: Uniform Manifold Approximation and Projection (UMAP) is a novel manifold learning technique for dimension reduction. The result is a practical, scalable algorithm that applies to real-world data. The UMAP algorithm is competitive with t-SNE for visualization quality and arguably preserves more of the global structure with superior run-time performance. For a more thorough explanation, please refer to the UMAP article on Wikipedia
It's also possible to use the values of a third metadata to either colorize the samples by projecting the range of values on them or categorizing them to show/hide them as legends.
Once interesting clusters or outliers are detected, selecting them for further inspection is possible.
You can select a cluster by pressing the
shift key and holding the mouse down simultaneously. This will activate the lasso modus, with which you can select the samples of interest by drawing a circle.
If you have multiple clusters you want to inspect simultaneously, you can press the
alt keys while holding the mouse down to select multiple regions.
Updated over 1 year ago