Thematic Clustering and Classification of Research in Digital Library Perspectives (2000–2024): A Machine Learning Approach
| dc.contributor.author | Mazumder, Sourav | |
| dc.contributor.author | Barui, Tapan | |
| dc.date.accessioned | 2026-01-24T15:08:00Z | |
| dc.date.available | 2026-01-24T15:08:00Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | The purpose of this study is to identify and classify research themes from the journal “Digital Library Perspectives” (2000–2024) using k-means clustering and machine learning–based classification models. Bibliographic data were retrieved from Dimensions (n = 715). Especially, abstracts were considered for analysis. The results show the trends of research publication in the journal with an annual average of 28.6. Cluster analysis reveals five clusters, and “Digitization and Metadata” emerged as the top cluster in the dataset. The cluster remained dominant throughout the years. SVM is recognized as the most effective model in terms of classifying clusters. Additionally, the confusion matrix has been included to explore correct classifications and misclassifications made by the classifiers. The study’s results are unique and offer implications for librarians, researchers, and policymakers. | |
| dc.identifier.isbn | 978-81-990642-0-1 | |
| dc.identifier.uri | http://gbm.ndl.gov.in/handle/123456789/212 | |
| dc.publisher | Indian Institute of Technology Kharagpur | |
| dc.subject | SOCIAL SCIENCES::Statistics, computer and systems science::Informatics, computer and systems science | |
| dc.subject | SOCIAL SCIENCES::Other social sciences::Library and information science | |
| dc.title | Thematic Clustering and Classification of Research in Digital Library Perspectives (2000–2024): A Machine Learning Approach | |
| dc.type | Article |