Highly Cited Publications Output by India in Computer Science 1996-15: A Scientometric Assessment

Gupta and Dhawan: Highly Cited Publications Output by India in Computer Science 1996-2015: A Scientometric Assessment

Authors

INTRODUCTION

India is making a strong commitment for ICT based digital economy and emerging as a global player in providing world class technology solutions and business services. As one of global top ranking IT hubs, India is aggressively focusing on IT exports, foreign direct investment in IT industry. As of 2015 India’s IT exports accounted for 4.26 per cent of the global market. India’s core competencies and strengths in IT software and hardware have attracted significant Foreign Direct Investment (FDI) worth US$ 21.02 billion between April 2000 and March 2016.[1] The Digital India initiative has given IT a key position inside and outside the country. The adoption of key technologies across sectors spurred by the ‘Digital India Initiative’ could help boost India’s Gross Domestic Product (GDP) by US$ 550 billion to US$ 1 trillion by 2025.[1] The Digital India program seeks to transform India into a digitally empowered society with emphasis on e-governance, e-retail, e-utility, e-education, telemedicine and mobile healthcare services and making the governance more participative.

India’s internet economy is expected to touch Rs 10 trillion (US$ 146.72 billion) by 2018, accounting for 5 percent of the country’s GDP. India’s internet user base has reached over 400 million by May 2016, the third largest in the world, while the number of social media users grew to 143 million by April 2015 and smartphones grew to 160 million. Both large and small and medium scale enterprises in IT industry are finding lucrative opportunities for investment. Given these developments, the IT industry has created significant demand and growth in the Indian academic and government sector, especially for engineering and computer science.[1] In a time of change towards digital economy, the country needs to restructure and reshape its R&D research base in ICT sector covering critical areas such as cyber security, computing systems and architectures, network infrastructures, software engineering and data management, digital content technologies, and human-technology interfaces.Keeping this in context, it is important to examine Indian research in computer sciences and its global impact. The paper attempts to address this issue by examining India’s highly cited papers in computer science.

Literature Review

Singh, Pramanik and Chakraborty examined comparative research trends in computer science and noted that Indian researchers tend to collaborate with researchers outside of India whereas Chinese tend to work among themselves. They also studied temporal evolution of the collaborative pattern in computer science and shift in research topics defining computer science (CS) research domain.[2] Das and Karanjaianalyzed 1408 research papers contributed by the Indian scientists during 1991-2000on computer science. They reported that a few institutions, like IITs (located at Kharagpur, Kanpur, Delhi, Chennai and Mumbai), Indian Statistical Institute at Kolkata and Indian Institute of Science (IISc) at Bangalore dominate in computer science research field and accounted for the largest publication share in the country. They concluded that India has potential of carrying out computer science research of international standard.[3] Gupta, Kshitij and Vermaana-lyzedcomputer science research published by India during 1999-2008 to understand the comparative status of the country in computer science vis-à-vis countries like China, South Korea, Taiwan and Brazil.[4] Gupta, Kshitij, Singh analyzed computer science research published by India during 1999-2008 to discover most productive institutions, authors, and high-cited papers in the country in computer science.[5] Singh, Uddin and Pinto in a scientometric and text based analysis of computer science research output data indexed in Scopus during the last 25 years period (1989-2013) sought to identify characteristic similarities and differences in CS research landscape of Indian institutions vis-à-vis world institutions. Uddin and Singh conducted a scientometric and keyword-based analysis of computer science research published by SAARC countries during the last 25 years. The study mapped publications trends to demographic and economic indicators of the SAARC countries, and presented inferences useful for determining guidelines for funding patterns and policy formulation for scientific research in CS domain. Singhal, Banshal, Uddin, and Singhconducted a scientometrics and text-based analysis of computer science research published by India during the last 25 years. The study presented the status of computer science research in India and identified thematic trends in CS domain. Gupta, Bala and Sharma analyzed publications output of India in computer science published during 1999-2008. The study ranked the most productive Indian institutions covering institutes of national importance, universities/ deemed universities, industrial enterprises, research institutes, Indian Institute(s) of Information Technology (IIIT(s)), select engineering colleges, and regional engineering colleges(RECs)/ National Institutes of Technology (NITs) in computer science research by using a series of publications and citation indicators.

It is evident from above that studies that have so far been undertaken so far have sought to focus their attention mainly on quantitative and qualitative analysis of computer science research in India, and not as much on high cited qualitative research. This study instead would seek to address the qualitative dimension of computer science research in the country, analyze highly cited papers in computer science published by India during 1996-2015 on a series of bibliometric indicators with the aim to understand the shift in the quality of research output in computer science over time.

METHODOLOGY

The study derived data on highly-cited papers from the Scopus, an international multidisciplinary bibliographical and citation database as on August 2016 and covered the period from 1996-2015. A highly-cited article (TC2015 ≥ 100) was defined as an article registering at least 100 citations since its publication up to August 2016. In all a total of 406 India’s highly cited articles in computer science received at least 100 citations since publication. The journal impact factor (IF) data used in this study based on the JCR 2013.

The study organized publication and citation data into various groups such as (i) first author publications (FP), (ii) corresponding author publications (RP), (iii) the number of citations since publication to 2015 referred to as TC2015, (iv) citations received in the year of publications (C0), (v) citations in the first year after publication (C1), (viii) the number of citations received in year 2015 is referred as C2015, (vi) national and international collaborative publications, and (vii) most productive journals etc.The data was tabulated to determine the quantum of research by publication year, global share of research, research quality, life cycle of research publications, and contribution of different types of Indian authors and organizations in computer science. Indian organizations have been classified into groups such as; (i) institutes of national importance, (ii) research institutes, (iii) universities, (iv)colleges, (v) engineering colleges, (vi) medical and allied colleges, (vii) industrial enterprises and (viii) non-profitable institutions.

The collaboration type was determined from institutional address data of the authors. An article could be either a single-country article, in which all authors’ addresses are from the same country, or bilateral or multilateral collaborative article, co-authored by researchers from 2 or more countries. In a single author article where authorship is unspecified, the single author is presumed both as first author and corresponding author. Similarly, in an article by authors from a single institution, the institution is classified both as the first author institution and the corresponding author institution. In addition, only the first affiliation of corresponding author was considered when the authorship had multiple affiliations.

At the individual level, a non-alphabetical name order sends a clear signal to the market that the author who is listed first actually contributed more. The first author is the person who contributed most to the work and writing of the article.[6] The corresponding author is perceived as the author contributing significantly to the article independent of the author position.[7] The corresponding author supervised the planning and execution of the study and the writing of the paper.[8] It is generally assumed that the first author and the corresponding author play significant roles, and they are acknowledged as the major contributors to a research paper. Thus, in this research, a newly developed indicator as suggested by Chuang and Ho,[10] the MCI, was used to assess the extent to which a researcher or an institution contributed to publishing an article. The MCI is calculated as the sum of first-author articles and corresponding articles divided by 2-times the total number of articles. It implies the percentage of instances one takes on the leadership role (first author or corresponding author) out of the total possible available opportunities. The equation is:

MCI=(FP=RP)/2TP,

Where FP is the number of first-author articles, RP is the number of corresponding-author articles, and TP is the number of total articles. When the MCI = 0, there is no first- or corresponding-author article. When the MCI = 1, all articles are either first- or corresponding author articles. MCI has two implications. First, it probably indicates a higher capability or productivity in conducting independent research. Second, it could, as well, indicate a more prominent role in collaborations. On the contrary, a low MCI is probably a sign of heavy reliance on collaboration, as well as relying on others to provide a leadership role in conducting research.[9]

OBJECTIVES

The main objective of this scientometric study is to understand the current status of computer science research in India and to know how has the country changed/improved in its quality and impact of research in IT duringthe past 20 years covering the period between 1996 and 2015. The study will identify highly cited papers in India’s computer science research and to analyze them for their publication and citation distributions in order to understand what role contributing authors, research organizations as well as international collaborative countrieshave played in Indian computer science research. In particular, the study will focus on (i)lndia’s world share of highly cited papers, (ii) distribution of highly cited papers by publication year, publication mode, and contributing authors and organizations, (iii) comparative share of highly cited papers from stand-alone single institutions and collaborating institutions; (iv) comparative share of highly cited papers from bilateral or multilateral collaboration, and (v) characteristics of top 10 publications on select bibliometric indicators.

ANALYSIS, RESULTS AND DISCUSSION

Publications Analysis

India contributed a total of 406 highly cited papers in computer science in a period 20 years during 1996-2015, accounting for0.32% world share (world output =126129 papers).As per data sourced from Scopus database, India’s count of highly cited papers in computer science changed on year-to-year basis down from 21 in 1996, the first year of this study period to 3 papers in 2015, the last year of this study period. Papers cited at least 100 times since their publication were considered as highly cited papers (HCPs) in this study.

Highly cited papers by India in computer science were published in bulk as research articles (325, 80.05% share) followed by conference papers 41 (10.10%) paper each as conference papers, 34 (8.37%) as reviews, 3 (0.74%) as books, 2 (0.49%) as letters and 1 (0.25%) as editorial during 1996-2015. As expected, review papers comparatively registered the highest citation impact per paper (363.59), followed by conference papers (323.88), books (294.67%), articles (239.46), letters (124.50) and editorial (117.0) during 1996-2015(Table 1).

Citations Analysis

India’s highly cited papers (406) in computer science cumulated 14059 citations in 20 years during 1996-Aug 2016, averaging to 257.92 citations per paper (CPP) in 20 years period, with annual CPP ranging between 108.33 (lowest in 2015) and 884.89 (highest in 2002). In this study, citations to papers have been counted from their publication year till August 2016; hence citation window in this study is not constant but variable varying from 1–20 years since publication. Citations to papers are treated as a proxy for describing the quality of research, to judge how highly cited papers inter-compare on quality and impact. CPP as such is not a valid indicator for comparing highly cited paperssince their citation window periodsare variable, varying from 1-20 years since publication. For comparative citation performance this study used another indicator – ‘citation density’. It computes citation density as a ratio: ‘citations per paper in a given year’ divided by ‘corresponding citation window period as # of years’. For example, ‘citation density’of 19 highly cited papers – published in the year 2002–comes as 63.21 citations per paper per citation year (16813/19/14) = 63.21. The citation density of 406 highly cited papers averaged to 12.90 citations per paper per citation year.

Figure 1:

Citation Density of Highly Cited Papers in Computer Science: 1996-2015.

https://s3-us-west-2.amazonaws.com/jourdata/jscires/JScientometRes-6-2-74-g001.jpg
Figure 2:

Citation Life Cycle of Top 10 Highly Cited Papers in Computer Science by Publication Year: 1996-2015

https://s3-us-west-2.amazonaws.com/jourdata/jscires/JScientometRes-6-2-74-g002.jpg

Citation density of highly cited papers in computer science research in India registered high rise during 2003-15 up from 17.66 in 2003 to 108.33 CPP/CPYin 2015. Comparatively, citation density during the first six years of this study, 1996-2002 registered moderate rise up from 20.79 in 1996 to 62.21 in 2002. Citation density was the highest with 108.33 CPP/PCY in the year 2015, the smallest with 11.59 CPP/PCY in the year 1998. High citation density data during 2010-15 implies that computer science research in India has indeed gained significant jump in quality and impact. Keeping in view the variability of citation window, we need to interpret the results with caution.(Figure 1)

The citation spectrum of highly cited papers is wide spreading across from 100 to 884.89 citations per paper on one end and 3148-12244 citations per paper on the other. The bulk of highly cited papers (71.43% share) correspond to papers in citation range 100-198 per paper. Only less than 3% highly cited papers correspond to top end citation range 841-12244 citations per paper. This shows that distribution of highly cited papers is highly skewed (Table2).

Distribution of Highly Cited Papers Output by Contributing Authors

Authorship to 406 highly cited papers varied widely from 1 to 150 authors per paper with an average of 3.86 authors per paper. Sole authorship was limited to 7.14% outputs, joint authorship to 36.45% output, and multiple authorship to 56.40% output. Multiple-authorship (3 -150 authors per paper) in computer science research is increasingly becoming the mode. Of the total of 1567 authors to 406 highly cited papers, 29 contributed one paper each in 20 years, 296 contributed two papers each,330 contributed three papers each, 240 contributed 4 papers each, 22contributed5 papers each, 72 contributed 6 papers each, 49 contributed 7 papers each, and 32 contributed 8 papers each. It shows that frequency distribution of authors of highly cited papers in computer science is not significant. (Table 3).

Top 25 Contributing Authors in Computer Science Research

The major contribution index MCI index varied from 0.0 to 1.0 for the top 25 highly cited authors, authors having citation of at least 3 citations per paper. MCI greater than 0.5 indicates that the author has high potential to conduct research independently, contribute to research productivity significantly, or play more prominent role in research collaboration. On the contrary low MCI is a sign of heavy reliance on others to play leadership role in conducting research or in research collaboration. In this study no correlation was found between their rank order and MCI index. In other words MCI is independent of size of contributions by authors. Table 4 is further subset of top 12 authors with at least 5 citations per paper.

Table 1:

Distribution of Highly Cited Papers by India across Publication Types: 1996-2015

Type of PublicationTPTCCPP
Articles32577824239.46
Reviews3412362363.59
Conference Paper4113279323.88
Books3884294.67
Letter2249124.50
Editorial1117117.00
Total406104715257.92

[i] TP=Total papers; TC=Total citations; CPP = Average citation per paper

Table 2:

India’s Highly Cited Papers in Computer Science by Citation Frequency Range: 1996-2015

Citation RangeNo of Highly Cited PapersTotal CitationsPublications ShareCitations Share
100-1982903809871.4336.38
200-297481173411.8211.21
307-3692685736.408.19
402-58923108585.6710.37
607-7741066582.466.36
841-944217850.491.70
1079-1596455300.995.28
3148-122443214790.7420.51
Total406104715

Of the top 25 authors, only three authors, namely P.Sarasu, A.Kumar and B.Singh registered the highest MCI value of 1 despite ranking low in publication count rank at 9, 19 and 20. The other significant authors with high MCI values were S. Vaidyanathan (MCI=0.952, Publication Rank=1), S.Mitra (MCI=0.875, Publication Rank=7), K.Deb (MCI=0.727, Publication Rank=2), N. Garg (MCI=0.667, Publication Rank=2), H. Singh (MCI=0.667, Publication Rank=22), A. Jain (MCI=0.667, Publication Rank=23), etc. The contributing authors high impact included K. Deb (CPP = of 1528 Publication Rank=2), followed by C. Bhattacharyya and K R K Murthy (CPP=431.33 each and Publication Ranks= 24 and 25).

Top 10 Organizations in Computer Science Research

In all 898 organizations (148 Indian and 750 foreign) had contributed to 406 highly cited papers in computer science research in India during 1996-2015. Of the 148 Indian research organizations, only 44 were comparatively more productive, with each contributing 2 to 46 highly cited papers in computer science during 1996-2015. The other 104 were not so good productivity organizations, contributing just one publication each during the same period.

Institutes of National Importance dominated the publications output of highly cited papers in computer science with largest share (60.10%, 244 papers). Also in terms of citations per paper, institute of national importance registered the highest citation impact of 302.65, followed by non-profit and others (301.45) and engineering colleges (161.09) (Table 5).

In terms of citation per paper, Indian Institute of Technology, Kanpur registered the highest citation impact of 734.15 (with Publication rank of 6), followed by Indian Institute of Science, Bangalore (CPP=383.48 and Publication Rank=3). In terms of MCI, Mepco Schlent Engineering College, Sivasaki and Dr S.R.University, Chennai registered the highest value of 1.0 each with publication rank 19 and 5, followed by Jadavpur University and Kolkata (MCI=0.96 and Publication Rank=9).

Table 3:

Author Productivity in Computer Science in India: 1996-2015

Authors per publicationNo. of PublicationTotal Authorship
12929
2148296
3110330
460240
522110
61272
7749
8432
9327
11111
12224
13113
15230
18118
24124
32132
80180
1501150
Total4061567

Of the 148 Indian organizations contributing to computer science research, 39 were industrial enterprises, 33 universities & colleges, 32 engineering colleges, 27 research institutes, 12 institutes of national importance and 5 nonprofit and other organizations. Among the 406 highly cited papers, 67 resulted from contribution with single organization each, 129 papers with 2 organizations each, 58 papers with 3 organizations each, 30 papers with 4 organizations each, 5, 6, 7 and 8 papers each with 11, 7, 4, 6 and 5 organizations, and more than 9 papers by 1, 2 and 3 organizations respectively (Table 6).

Collaboration in Highly Cited Papers

Of all the 406 highly cited publications in computer science, 130 resulted from co-authors from the same single parent organization (no-collaboration), 67 from national collaboration with co-authors/multiple authors from Indian organizations (national collaborative publications), and 209 from international collaboration with co-authors/ multiple authors from Indian and foreign organizations (international collaborative publications). Contrary to expectations, single institution publications had scored higher citation impact with 281.08 citations per publication compared to International collaborative publications with 257.92citations per publication and national collaborative publications with 185.69citations per publication (Table 7).

Table 4:

Highly Cited Authors in Computer Science Research in India: 1996-2015

S.NoName of the AuthorAffiliationTPTCCPPFP-RPFPRPMCI
1S. VaidyanathanDr S.R.University, Chennai212591123.381920.952
2K. DebIndian Institute of Technology, Kanpur11168081528.080.727
3S. BandyopadhyayIndian Statistical Institute, Kolkata1192183.734210.500
4V. SundarapandianDr S.R.University, Chennai91259139.8940.444
5S. DasJadavpur University, Kolkata82745343.13210.313
6N.R. PalIndian Statistical Institute, Kolkata81995249.38210.313
7S.K. PalIndian Statistical Institute, Kolkata71792256.0020.286
8N. GargIndian Institute of Technology, New Delhi61002200.4040.667
9P. SarasuDr S.R.University, Chennai5648129.6051.000
10S.R. MurthyIndian Institute of Technology, Madras5641128.2010.200
11B.B. ChaudhuriIndian Statistical Institute, Kolkata5721144.20220.600
12G.P.S. RaghavaInstitute of Microbial Technology, Chandigarh5949189.8030.300

[i] FP=Number of papers with first authors; RP=Number of papers with corresponding authors; TP=Total Papers; MCI=Major Contribution Index

Table 5:

Distribution of Highly Cited Papers across Types of S&T Research Organizations: 1996-2015

Type of OrganizationTPTCCPP%TPFP-RPFPRPMCI
Institutes of National Importance24473847302.6560.10113130.47
Universities8114648180.8419.9555100.69
Industrial Enterprises566560117.1413.7923000.41
Engineering Colleges4311277261.0910.5921000.49
Research Institutes386749177.619.3621000.55
Non-Profit & Others113316301.452.711000.09
23413

[i] FP=Number of papers with organizations affiliating to first authors; RP=Number of papers with organizations affiliating to corresponding authors; TP=Total Papers; MCI=Major Contribution Index

Collaborative Profile of Organizations: Authors from Single Institution

In all, 284 authors from 35 organizations collaborated in groups of various sizes (mainly from same parent organization) to publish130highly cited papers (32.02% share) in computer science. Of the 130 single-institution highly cited papers, 69were from 9 institutes of national importance, 34from 8 universities, 13 from 9 research institutes, 6 from 5 engineering colleges, and 6 from 4 industrial enterprises.

The detail of these organizations are as follows:(i) Institutes of National Importance – ISI-Kolkata (18 papers), IIT-Kharagpur and IIT-Delhi (11 papers each), IISc-Bangalore (8 papers), IIT-Madras (7 papers), IIT-Kanpur and IIT-Bombay (6 papers each), IIT-Roorkee (3 papers) and IIT-Guwahati (1 paper); Universities - Dr. S R University & Vel Tech University, Chennai (11 papers), University of Hyderabad (9 papers), Jadavpur University, Kolkata (7 papers).

The distribution of 130 papers by authorship per publications was as follows: 29papers resulted from contribution by 1 author each, 63by 2 authors each, 29by 3 authors each, 5by4 authors each, 2 by 5 authors each, and2by 6 authors in all. The authorship to130 papers averaged to 2.18authors per publication.

Table 6:

Top 10 Organizations in Computer Science Research: 1996-2015

S.NoName of the OrganizationTPTCCPPTP(FP, RP)FPRPMCI
1Indian Statistical Institute, Kolkata4610218222.134632130.74
2Indian Institute of Technology, New Delhi468123176.59462010.45
3Indian Institute of Science, Bangalore4216106383.484216220.43
4Indian Institute of Technology, Kharagpur336836207.15331210.38
5Dr S.R.University, Chennai283140112.1428281.00
6Indian Institute of Technology, Madras265934228.232680.31
7Indian Institute of Technology, Kanpur2619088734.15261310.52
8Indian Institute of Technology, Bombay235299230.39231010.46
9Jadavpur University, Kolkata235952258.782320220.96
10Microsoft Research, Bangalore132113162.541370.54

[i] TP=Total Papers; FP=Number of first-author top cited articles; RP=Number of corresponding top-cited articles

Table 7:

Citation Performance of Highly Cited Publications by Collaboration Type: 1996-2015

Type of CollaborationTotal PublicationsTotal CitationsAverage Citations per Publication
Co-authors with their affiliation to same single Institution13036541281.08
National collaborative Publications6712441185.69
International collaborative Publications20955733266.67
406104715257.92

Of the 130 single institution publications, 100 papers resulted from single authors (serving both as first author and corresponding author), 40from authors serving only as first author, and 20 from authors serving only as corresponding author.

Collaborative Profile of Organizations: National Collaborative institutions

In all, 178 authors from 141 Indian organizations collaborated in groups of various sizes (coming from 2 or more organizations) to publish 67 national collaborative highly cited papers (16.50% share) in computer science (Table 8). Of the 67 national collaborative highly cited papers, 42 were from 9 institutes of national importance, 26 from 23 universities, 9 from 9 research institutes, 20 from 17 engineering colleges, and 10 from 10 industrial enterprises; and 6 from 4 other organizations.

The organizations which collaborated in these 67 highly cited papers include: : (i) Institutes of National Importance- ISI-Kolkata (16 papers), IIT-Delhi (8 papers), IIT-Kanpur and IISc-Bangalore (6 papers each), and IIT-Bombay (3 papers)–The distribution of 67 papers by participating institution was as follows:61 papers each with 2 participating organizations, 5 papers each with 3 participating organizations and 1 paper with 4 participating organizations. The institutional authorship to 67 national collaborative papers averaged to 2.10 collaborating organizations per paper.

The distribution of 67 papers by authorship per publications was as follows: 33 papers were contributed by 2 authors each, 25 publications each by 3 authors each, 8 publications contributed by 4 authors each and1 paper contributed by 5 authors. The authorship to 67 papers averaged to 2.66 collaborating authors per publication.

Of the 67 national collaborative publications, 50 papers resulted from authors serving both as first author and corresponding author, 24 from authors serving only as first author, and 10 from authors serving only as corresponding author.

Collaborative Profile of Organization: International Collaborative Institutions

In all, 1105 authors from 1107 Indian and foreign organizations representing 45 countries collaborated in groups of various sizes (from Indian and foreign organizations) to publish 209 international collaborative highly cited papers (51.48% share) in computer science. Of the 209 international collaborative highly cited papers, 116 were from 11 institutes of national importance, 18 from13 engineering colleges, 23 from 11 universities, 13 from 13 research institutes, and 42 from 30 industrial enterprises.

Indian Institute of Science, Bangalore and Indian Institute of Technology, New Delhi contributed the largest number of papers (29 each), followed by Indian Institute of Technology, Bombay and Indian Institute of Technology, Madras (19 papers each).

The distribution of 209 international collaborative highly cited papersby participating organizations was as follows: 68 papers resulted from 2 participating organizations in each, 53 from 3 participating organizations in each, 29 from 4 participating organizations in each, 11 with 5 participating organizations in each, 7with 6 participating organizations in each, 4 with 7 participating organizations in each, 6 with 8 participating organizations in each, 5 with 9 participating organizations in each, and 3with 10 participating organizations in each. The average number of participating organizations per paper was 5.30.

Medium of Communication

Journals play an important role in the communication structure of science. Of the 406 highly cited papers in computer science research from India, 370were published in 150 peer-reviewed journals (Impact Factor information was available for 143 journals only). No significant correlation was found between citations to highly cited papers and impact factor of their reporting journals (Table 9-10).

Of 150 journals, 85 reported one highly cited publication each, 23 (13.33%) reported two publications each, 11 (11.11%) reported three publications each, 9, 6 and 2 journals reported four, five and six publication respectively and 2, 1 and 6 journals reported seven, eight and nine publications respectively and 3, 1 and 1 journals reported 10, 11 and 14 publications respectively. Table 11 (given at the end) lists the top 32 journals which published 4 or more highly cited publications. IEEE Transactions on Evolutionary Computationpublished largest number of the highly cited publications (14 papers), followed by Pattern Recognition (11 publications), IEEE Transactions on Systems, Man & Cybernetics. Part B, IEEE Journal on Selected Areas in Communication and IEEE Transactions on Information Theory (10 papers each), IEEE Transactions on Image Processing, IEEE Transactions on Wireless Communication, Applied Soft Computing Journal, Fuzzy Sets & Systems, Pattern Recognition Letters and Computer (9 papers each), IEEE Transaction on Industrial Electronics (8 papers), Bioinformatics and IEEE Communication Magazine (7 papers each),Expert Systems with applications and International Journal of Control Theory & Applications (6 papers each), etc.

Top 10 Highly Cited Papers

Of the top 10 highly cited papers, 8were published during 1996-2002 and other 2 during 2010-2011. Two papers were published in IEEE Transaction on Evolutionary Computation [IF=5.905], and 1 each in ACM Computer Surveys [IF=5.243], Computer[IF=1.115], Computer Methods in Applied Mechanics & Engineering [IF=2.203], IEEE Communication Magazine[IF=5.125], IEEE Transaction on Industrial Electronics [IF=6.383], IEEE Transaction on Image Processing [IF=3.73], and Neural Computation [IF =1.626]. One paper was published in conference proceedings. Both citation numbers and ranking for the TC2015 are displayed. The top ranking paper - “A fast and elitist multi-objective genetic algorithm: NSGA-II”was published by Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.in IEEE Transactions on Evolutionary Computation in 2002 and had TC2015 of 12279.

The study organized publication and citation data into seven groups such as (i) first author publications (FP), (ii) corresponding author publications (RP), (iii) the number of citations since publication to 2014 is referred as TC2014, (iv) citations received in the year of publications (C0), (v) citations in the first year after publication (C1), (viii) the number of citations received in year 2014 is referred as C2014, (vi) national and international collaborative publications, and (vii) most productive journals etc (Table 8).

Effect of Time Period on Citations Output

Citation life cycle of highly cited papers published in the time period 1996-2015 exhibit two trends i) papers that exhibit typical early peak, reaching their citation peak in 5 years since publication (Thakkar, K.N. et al. Nanomedicine 2010, 276 citations) (Rahman. I et al. Biochemical Pharmacology 2006, 488 citations), ii) papers that exhibit delayed recognition, delayed citation peak, reaching their citation peak in 13-14 years since publication. In overall, life cycle of highly cited papers (TC2015< 100) lasted from 6 to 14 years and that they all enter decline in their citation after reaching their peak. As can be seen, highly cited papers effectively have dated life cycle but they differ significantly in their cumulative citations output (TC2015) varying from 774 to 12279 citations. It is significant to note that effective from 2010 papers a more-rapid rise in citation numbers, and needed relatively fewer years to reach their citation peak. If such a trend continues to stay, high percentile articles will certainly reach their citation peaks even faster and would need relatively fewer years since their publication (Figure 2).

DISCUSSION

India published a total of 406 highly cited articles in computer science, constituting 0.32% world share during 1996-15. Only such papers that received at least 100 citations since their publication till August 2015 were covered in this study. The publications and citations data for the study was sourced from Scopus database. Though citation in research evaluation is viewed as an acknowledgement of intellectual debt and scientific progress but highly cited papers illustrate high quality performance in science and a useful tool for quality assessment of key (most influential) contributors to a given research field.

Table 8:

Collaborating Countries in Highly Cited Papers in Computer Science: 1996-2015

CountryTPTCCPPNumber of publications with
TotalBoth FP and RPFPRP
USA13838568279.481388811
U.K.223889176.772210
Canada183323184.61187
Singapore154956330.401572
France142569183.50143
Japan123947328.921231
Switzerland102512251.201021

[i] TP=Total Papers; FP=Number of first-author top cited articles; RP=Number of corresponding top-cited articles

Table 9:

Distribution of 370 Highly Cited Papers in Indian Computer Science by Citation and Impact Factor

IF Range 2015Range of Citations
100-199200-299300-399400-499500-599600 & MoreTotal
6.0 & More155221328
5.0 – 5.99204121533
4.0 – 4.99126200020
3.0 – 3.99365722355
2.0 – 2.9983124213105
1.0 – 1.996612850495
0.00 – 0.99312010034
Total262462414618370

A total of 406 highly cited papers in computer science were published across 150 Indian and foreign journals with IF 6.0 and above. IEEE Transactions on Evolutionary Computationpublished largest number of the highly cited publications (14 papers, 3.49%), followed by Pattern Recognition (11 publications, 2.70%), IEEE Transactions on Systems(10 papers, 2.46%), Man & Cybernetics Part B(10 papers, 2.46%), IEEE Journal on Selected Areas in Communication(10 papers, 2.46%)and IEEE Transactions on Information Theory (10 papers, 2.46%), etc.

These 406 highly articles have received 104715 citationstheir citation impact averaged to 257.92 citations per paper, with annual CPP ranging between 108.33 (lowest in 2015) and 884.89 (highest in 2002). Quality and impact of highly cited papers was compared on a citation density metric. It was highest with 108.33 CPP/PCY in the year 2015, the smallest with 11.59 CPP/PCY in the year 1998. High surge in citation density data of highly cited papers over time in particular during 2010-15 up from 37.52 to 108.33 CPP/PCY implies that computer science research in India has significantly improved in its quality and impact.

Citation life cycle of highly cited papers published in the time period 1996-2015 exhibit two trends i) papers that exhibit typical early peak, reaching their citation peak in 5 years since publication (Thakkar, K.N. et al. Nanomedicine 2010, 276 citations) (Rahman. I et al. Biochemical Pharmacology 2006, 488 citations), ii) papers that exhibit delayed recognition, delayed citation peak, reaching their citation peak in 13-14 years since publication. It is significant to note that effective from 2010 papers a more-rapid rise in citation numbers, and needed relatively fewer years to reach their citation peak. If such a trend continues to stay, high percentile articles will certainly reach their citation peaks even faster and would need relatively fewer years since their publication.

Of the 148 Indian organizations which contributed to computer science research during 1996-2015, 39 were industrial enterprises, 33 universities & colleges, 32 engineering colleges, 27 research institutes, 12 institutes of national importance and 5 non-profit and other organizations. Institutes of National Importance dominated the publications output of highly cited papers in computer science with largest share (60.10%, 244 papers), followed by universities (19.95% share, 81 papers), industrial enterprises (13.79% share, 56 papers each), engineering colleges (10.59% share, 43 papers), research institutes (9.36% share, 38 papers), non-profit & others (2.71% share, 11 papers) during 1996-2015.

Table 10:

Citations to Top 10 Most Highly Cited Papers in Computer Science from India, 1996-2015

TC1996199719981999200020012002200320042005200620072008
1Deb, K. et al. IEEE Trans on Evolutionary Computation,6(2), 182-971227926982172295476620
2Jain, A.K. et al. ACM Computing Surveys 1999;31(3):264-3236048223868122182288362476539
3Sandhu, R.S. et al. Computer1996;29(2):38-47304036718275672116148224250324283
4Deb, K. Computer Methods in Applied Mechanics & Engineering 2000, 186(2-4):311-381597382044445480105
5Hara, S et al. IEEE Communication Magazine1997;35(12):126-331480316246288121208175151163123
6Das, S. et al. IEEE Trans on Evolutionary Computation feB 2011;15(1):4-311273
7Kouro, S. et al. IEEE Trans on IndustriXal Electronics Aug
2010;57(8):2555-801389
8Srinivasan Reddy, B. et al. IEEE Trans on Image Processing 1996;5(8):1266-7190464691722233952655364
9Yang, X. S. et al. 2009 World Congress on Nature and Biologically Inspired Computing Controlled Release846
10Keerthi, S.S. et al. Neural Computation March 20011;3(3):637-4977429132429465651
20092010201120122013201420152016
1Deb, K. et al. IEEE Trans on Evolutionary Computation 2002; 6(2):182-978171064117112621563176018071119
2Jain, A.K. et al. ACM Computing Surveys 1999;31(3):264-
323554520521525578549489275
3Sandhu, R.S. et al. Computer 1996; 29(2):38-4727325423722420319216080
4Deb, K. Computer Methods in Applied Mechanics & Engineering 2000; 186(2-4):311-3813513016218617520115796
5Hara, S et al. IEEE Communication Magazine Dec 1997;35(12):126-3311776535057384119
6Das, S. et al. IEEE Trans on Evolutionary Computation feB 2011;15(1):4-3173151262263327197
7Kouro, S. et al. IEEE Trans on Industrial Electronics
2010;57(8):2555-80311105127211239217179
8Srinivasan Reddy, B. et al. IEEE Trans on Image Processing 1996;5(8):1266-718688837694557133
9Yang, X. S. et al. 2009 World Congress on Nature and Biologically Inspired Computing Controlled Release42659117188262190
10Keerthi, S.S. et al. Neural Computation 2001;13(3):637-495362667896698240

The top 10 most productive institutes in computer science research include Indian Statistical Institute, Kolkata (Output = 46, CPP = 222.13), Indian Institute of Technology, New Delhi (Output = 46, CPP = 176.59), Indian Institute of Science, Bangalore (Output = 42, CPP = 383.48), Indian Institute of Technology, Kharagpur (Output = 33, CPP = 207.15), Dr S.R.University, Chennai (Output = 28, CPP = 112.14), Indian Institute of Technology, Madras (Output = 26, CPP = 228.23), Indian Institute of Technology, Kanpur (Output = 26, CPP = 734.15), Indian Institute of Technology, Bombay (Output = 23, CPP = 230.39), Jadavpur University, Kolkata (Output = 23, CPP = 258.78), and Microsoft Research, Bangalore (Output = 13 CPP = 162.54).

In terms of MCI, Mepco Schlent Engineering College, Sivasaki and Dr S.R.University, Chennai registered the highest value of 1.0 each with publication rank 19 and 5, followed by Jadavpur University, Kolkata (MCI=0.96 and Publication Rank=9), Indian Statistical Institute, Kolkata(MCI=0.74 and Publication Rank=1), Institute of Microbial Technology, Chandigarh, Indian Institute of Technology, Roorkee and Defense R&D Organization(MCI=0.67 each and Publication Rank=12, 13 and 18 respectively), etc. MCI varies from 0.0 to 1.0. MCI greater than 0.500 indicates that the author has high potential to conduct research independently, contribute to research productivity significantly, or play more prominent role in research collaboration. On the contrary low MCI is a sign of heavy reliance on others to play leadership role in conducting research or in research collaboration.

Authorship to 406 highly cited papers varied widely from 1 to 150 authors per paper with an average of 3.86 authors per paper. Sole authorship was limited to 7.14% share in output, joint authorship to 36.45% output, and multiple authorship to 56.40% output. Multiple-authorship (3 -150 authors per paper) in computer science research is increasingly becoming the mode. It signals the onset a trend towards team based/ multi-institutional collaborative research to produce high quality research in computer science.

This study observed that internationally collaborated papers averaged higher citation rate per paper (204.1) relative to nationally collaborated papers (140.1). International collaboration is an indispensable to quality of computer science research. Of 406 highly cited papers, 229 resulted from international collaboration across 45 countries. United States participated in the largest number of publications (138), followed by U.K. (22), Canada (18), Singapore (15), Japan (12), Switzerland (10), China (8), Norway (8), Taiwan (7), South Korea (7), Australia (7), Germany (7), Italy (6), Israel (5), Denmark, Hong Kong, Poland, Spain, And Egypt (4 each), and Netherlands, Chile, and Greece (3 each).

CONCLUSION

India’s productivity of highly cited papers in computer scienceby authors is still not as significant as expected given the fact that as many as 208 authors were able to contribute justone paper each once in a long span of 15 years, and more so because India’s world share of highly cited papers in computer science has been abysmally low just at 0.32%. Besides, the country didn’t show a promising rising trend in its rate of growth in its output of highly cited papers over time. It remained range bound between 2 to 11 papers per year. The slow growth rate of high quality papers in computer scienceis indicative of dearth of high profile/high productivity scientists and of high-productivity scientific institutions in computer science in the country; it is a matter of great concern. Notably, bulk of the output of high quality and high impact research in computer science in India has resulted from select top academic and research organizations/institutions working in isolation and not in collaboration. Team based/multi-institutional research in computer science was limited to select few highly cited papers. The challenge before the top leadership in science in the country is how to encourage team-based/multi-institutional collaborative research in order to produce and publish high quality and high impact research work in computer science.

REFERENCES

1. 

India Brand Equity Foundation (IBEF) IT & ITeS Industry in India Report.www.ibef.org/industry/information-technology-india.aspx.

2. 

Singh Mayank, Pramanik Soumajit, Chakraborty Tanmoy , authors. PubIndia: A Framework for Analyzing Indian Research Publications in Computer Science. D-Lib Magazine. November/ December;2015;21(11/12)

3. 

Das Anup K, Karanjai , authors. Aruna Institutional distribution in computer science research in India : A study. Annals of Library and Information Studies. 2002;49(1):23–7

4. 

Gupta BM, Kshitij Avinash, Verma Charu , authors. Mapping of Indian computer science research output. 1999-2008. Scientometrics. 2011. 86(2):p. 261–83

5. 

Gupta BM, Kshitij Avinash, Singh Yoginder , authors. Indian computer science research output during 1999-2008: Qualitative Analysis. DESIDOC Journal of Library & Information Technology November. 2010;30(6):39–54

6. 

Singh Vivek K, Uddin Ashraf, Pinto David , authors. Computer science research: The top 100 institutions in India and in the world. Scientometrics. 2014. 104(2):p. 529–53

7. 

Uddin Ashrafand S, Vivek K , authors. Mapping the computer science research in SAARC Countries. IETE Technical Review. 2014. 31(4):p. 287–96

8. 

Singhal Khushboo, Banshal Sumit Kumar, Uddin Ashraf, Singh Vivek K , authors. A scientometric analysis of computer science research in IndiaContemporary Computing (IC3), 2015 Eighth International Conference on, NOIDA, 20-22 Aug; 2015. p. 177–182. 10.1109/IC3.2015.7346675. Publisher:. IEEE

9. 

Gupta BM, Bala Adarsh, Sharma Nandini , authors. Ranking of Indian institutions contributing to computer science research, 19992008. DESIDOC Journal of Library & Information Technology. 2011;31(6):460–8

10. 

Chuang Kun-Yang, Ho Yuh-Shan , authors. A bibliometric analysis of top-cited articles in pain Research. Pain Medicine. 2014;15:732–44

APPENDIX

Table 11:

List of journals publishing 4 or more high cited papers

S.NoName of the JournalIFNPPapers Citations
1IEEE Transactions on Evolutionary Computation5.9051412279, 1273, 663, 509, 313, 278, 223, 188, 182, 159, 142, 129, 124, 109
2Pattern Recognition3.39911612,554, 354, 307, 231, 196, 169, 134, 119, 116, 102
3IEEE Transactions on Systems, Man & Cybernetics. Part B6.2210619, 447, 201, 183, 168, 149, 142, 127, 121, 106
4IEEE Journal on Selected Areas in Communication3.67210500, 442, 359, 319, 317, 293, 114, 105, 101, 101
5IEEE Transactions on Information Theory1.73710315, 309, 201, 197, 184, 177, 144, 131, 124, 105
6IEEE Transactions on Image Processing3.739946, 322, 152, 142, 125, 124, 110, 101, 100
7IEEE Transactions on Wireless Communication2.9259466, 347, 298, 234, 186, 151, 133, 132, 120
8Applied Soft Computing Journal2.8579247, 167, 167, 157, 138, 125, 122, 119, 107
9Fuzzy Sets & Systems2.0989328, 194, 153, 130, 120, 115, 107, 101, 101
10Pattern Recognition Letters1.5869308, 248,195, 153, 143, 135, 121, 113, 110
11Computer1.11593160, 419, 407, 370, 324, 313, 265, 248, 102
12IEEE Transaction on Industrial Electronics6.38381081, 697, 293, 179, 157, 126, 125, 107
13Bioinformatics5.7667414, 217, 178, 153, 139, 117, 110
14IEEE Communication Magazine5.12571585, 437, 165, 153, 140, 135, 132
15Expert Systems with applications2.9816264, 192, 114, 111, 103, 103
16International Journal of Control Theory & Applications0.956142, 135, 129, 123, 119, 107
17IEEE Transactions on Fuzzy Systems6.7015447, 321, 210, 119, 103
18Information Sciences4.955280, 272, 208, 143, 123
19IEEE Transactions on Neural Networks4.8545368, 325, 208, 191, 110
20Journal Of Chemical Information & Modeling2.885266, 167, 135, 114, 108
21IEEE/ACM Transactions on Networking2.1865182, 181, 161, 158, 135
22IEEE Micro1.0915342, 146, 141, 123, 107
23Wireless Networks1.0065663, 284, 249, 119, 106
24Evolutionary Computation3.64610, 338, 237, 139
25Mechanical Systems & Signal Processing2.7714174, 149, 108, 105
26Information & Management2.1634256, 152, 130, 110
27IEEE Signal Processing Letters1.6614272, 226, 146, 101
28International Journal of Modeling, Identification & Control1.574139, 124, 110, 104
29Mobile Networks & Applications1.5384231, 120, 118, 108
30Journal of Chemical Information & Computer Science1.334266, 198, 167, 102
31IEEE Software0.824194, 159, 138, 117
32International Journal of Soft Computing0.264146, 143, 137, 131