Thematic Clustering and Classification of Research in Digital Library Perspectives (2000–2024): A Machine Learning Approach

dc.contributor.authorMazumder, Sourav
dc.contributor.authorBarui, Tapan
dc.date.accessioned2026-01-24T15:08:00Z
dc.date.available2026-01-24T15:08:00Z
dc.date.issued2025
dc.description.abstractThe purpose of this study is to identify and classify research themes from the journal “Digital Library Perspectives” (2000–2024) using k-means clustering and machine learning–based classification models. Bibliographic data were retrieved from Dimensions (n = 715). Especially, abstracts were considered for analysis. The results show the trends of research publication in the journal with an annual average of 28.6. Cluster analysis reveals five clusters, and “Digitization and Metadata” emerged as the top cluster in the dataset. The cluster remained dominant throughout the years. SVM is recognized as the most effective model in terms of classifying clusters. Additionally, the confusion matrix has been included to explore correct classifications and misclassifications made by the classifiers. The study’s results are unique and offer implications for librarians, researchers, and policymakers.
dc.identifier.isbn978-81-990642-0-1
dc.identifier.urihttp://gbm.ndl.gov.in/handle/123456789/212
dc.publisherIndian Institute of Technology Kharagpur
dc.subjectSOCIAL SCIENCES::Statistics, computer and systems science::Informatics, computer and systems science
dc.subjectSOCIAL SCIENCES::Other social sciences::Library and information science
dc.titleThematic Clustering and Classification of Research in Digital Library Perspectives (2000–2024): A Machine Learning Approach
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
kedld2025_paper.pdf
Size:
1.03 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections