Trend analysis and forecasting of publication activities by Indian computer science researchers during the period of 2010–23

Huge collections of published research documents are available in various repositories in Indian universities and research organizations. The efficient retrieval, estimation of research trends, and identification of research gaps with the global trend in different research areas, can be a great guid...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems 2022-12, Vol.39 (10), p.n/a
Hauptverfasser: Kathiria, Preeti, Arolkar, Harshal
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Huge collections of published research documents are available in various repositories in Indian universities and research organizations. The efficient retrieval, estimation of research trends, and identification of research gaps with the global trend in different research areas, can be a great guiding tool for regulating research in the appropriate direction. This research attempts to analyse the trend of research activities carried out by Indian researchers in the discipline of Computer Science from 2010 to 19 and forecast the upcoming research trend for the years 2020–23. A repository of the s based on domains given in the Computer Science Ontology (CSO) published by the Indian researchers is developed from the Scopus database. Document Index Graph document representation model is used to store the repository, shared phrases across the documents are extracted and phrase‐based similarity is computed. Combining the Single‐term and Phrase‐based similarity, the hybrid similarity is generated and similar documents are clustered using the DBSCAN clustering technique. Topics are identified for each cluster using the Latent Dirichlet Allocation algorithm and are automatically labelled using CSO. For each topic, the trend analysis and forecasting have been done using the Auto‐Regressive Integrated Moving Average. For the assessment of the forecasting performance, the dataset from 2010 to 17 is used as a training dataset and 2018–19 as a testing dataset. The average forecasting for the year 2018 for all CSO domains belongs to the Good forecasting category with Mean Absolute Percentage Error (MAPE) 18.34, and 2019 shows reasonable forecasting with MAPE 30.20 as per the MAPE interpretation given by Lewis. For each topic the average forecasting for years 2018–19 shows either Highly accurate, Good or Reasonable forecasting. As a result, the top four domains for the years 2020–23 are also identified which can help initial researchers in the identification of a relevant topic for research and exploration.
ISSN:0266-4720
1468-0394
DOI:10.1111/exsy.13070