This project processes and clusters text data using BERT embeddings, K-means, and dimensionality reduction. Visualizations include t-SNE plots and word clouds. Dataset and embeddings links are provided.
nlp pytorch transformer pca-analysis text-clustering kmeans-analysis bert-embeddings wordcloud-visualization textprocessing t-snes sihouette-score
-
Updated
Sep 2, 2024 - Jupyter Notebook