Original texts and analysis code are not provided(contact me if needed).
- Embedding Model: pre-trained SBERTs, OpenAI embedding models (models can be changed depending on the setting)
- Clustering: HDBSCAN
- Topic Representation: Title-based LLM summarization