document vectors for clustering of large document sets
Recently I have finished the work on understanding what exactly is the content of the GEOSS global data infrastructure. Many partial approaches were shown in previous posts and this one concludes the outcome. For all the 1.8M metadata records from GEOSS we calculated document vectors and tried to run cluster detection with cosine similarity as … Read moredocument vectors for clustering of large document sets