Tim Brody <tdb01r@ecs.soton.ac.uk> [Project members: Tim Brody, Heinrich Stamerjohanns, François Vallières, Stevan Harnad, Yves Gingras, Charles Oppenheim, Steve Hitchcock]
Warning: The data presented here are preliminary unrefereed results that are still being analyzed and corrected (we welcome any suggestions or questions). This is not yet the "definitive" version of our findings. Please do not cite them without consultation with the authors.
See also The effect of open access and downloads ('hits') on citation impact: a bibliography of studies.
The following graphs show the size of the "OA advantage" in different fields of physics and mathematics across the years from 1992-2003.
The data come from the Institute for Scientific Information (ISI) CD-ROM citation database, which covers the main journals in all fields, and from the OA Physics/Mathematics Archive http://arxiv.org/ where authors can self-archive their articles to make them Open Access (OA).
The method was to take all the physics and mathematics articles indexed by ISI from 1992-2003 and first calculate how much each article is cited. Then all these articles are divided into those that are and are not in Arxiv, hence are or are not OA.
The OA advantage is then calculated from a comparison of the citation counts for the OA versus the non-OA articles: 100(OA/non) - 100% (OA divided by non-OA citation counts, minus 100%). This gives the percentage by which citation impact is altered, positively or negatively, by making the article OA (by self-archiving it in Arxiv).
As will be seen, virtually all of the OA impact effect (red) is positive: OA enhances citation impact substantially, sometimes by several hundred percent. This is to be expected, because increasing accessibility increases rather than decreases potential usage.
Another measure of interest is the percentage of the yearly articles in a field that are self-archived and thereby made OA (green) relative to the total number of articles in the field (gray). This percentage is (slowly) increasing across the years and is already especially high in nuclear and high-energy physics, the fields in which Arxiv began. (Institutional self-archiving, especially when mandated by the institution and research funder, will help accelerate this.)
The data are analysed in four different ways. The first column is the raw averages, all OA articles in the field compared to all non-OA articles, by year. The second column is a more sensitive, controlled comparison, comparing OA vs. non-OA articles only within the same journal, by year. The third column removes all self-citations. The fourth column is based on a random sample of non-OA articles within the same journal, rather than all of them (to make the numbers more equal, because there are far more non-OA articles in most journals than OA ones). The patterns for the four analyses are much the same, but some interesting differences do emerge.
The first and last entry in each graph is (first) the average across all 12 years and (last) the average for the most recent three years (2001-2003). There are also the correlations between the OA proportion and time (the OA proportion is growing across the years), between the OA advantage and time (?) and between the OA proportion and the OA advantage (?). The OA advantage seems to be the strongest within the first 3 years of publication (including a year before publication for the preprint). For the correlation between usage impact (downloads) and later citation impact, see: http://citebase.eprints.org/analysis/correlation.php
We thank ISI for providing us with access to the ISI's bibliographic database to perform this study.
As with all bibliographic databases the ISI data we use has limitations. The ISI data available covers articles published between 1980-2004. The journals included varies each year (with new journals being added). The Science database - which these initial results are from - has articles from 6443 distinct journals. The ISI WoS (the Web-based citation index tool) covers around 2000 additional journals.
To provide a summary of how the ISI data is used, I will answer the questions posed by Henk F. Moed (who estimates an error rate of 7% for referenced citations) in "The impact-factors debate: the ISI's uses and limits," Nature, 2002:
| Data Used | The four columns of graphs below represent different samples and analyses from two database: (i) data from the ISI CD-ROM citation database on 1041579 Physics, and 141990 Mathematics articles published in 311 Physics, and 180 Mathematics journals from 1992-2003 and (ii) data on 99741 Physics, and 6634 Mathematics articles in the same journals, self-archived by their authors in http://arxiv.org/ to make them Open Access (OA). The purpose is to compare the number of citations for articles that are and are not OA. Each row is a different field of Physics or Mathematics. Analyses become more focussed and controlled across columns: Column 1 compares overall OA vs. non-OA across all articles in a given field, summing across all journals in that field for each year. Column 2 ("Same-Journal Control") compares within the same journal and sums the averages across journals. Column 3 ("Self-Citations Removed") removes all author self-citations; and Column 4 ("Sample-Size Equalized") equalizes the comparison by comparing OA only with a random sample of the much larger number of non-OA articles. |
|---|---|
| Same-Journal Control (Col. 2) | To compare like with like, the OA/non-OA comparisons in each field are first calculated within each journal for each year, and then averaged across all journals in that field for each year. The purpose is to avoid comparing OA effects in one journal with non-OA effects in another. This reduces the number of articles included as those journals that have either no OA articles or no non-OA articles are excluded from the overall averages. |
| Self-Citations Removed (Col. 3) | Although it is unlikely that OA/non-OA citation differences should arise because of differences in self-citing patterns between self-archiving and non-self-archiving authors, self-citations are removed in Column 3. An author self-citation is assumed whenever the citing and cited article share at least one author's name. As it is possible that two different authors may share the same name (by LASTNAME-INITIALS) this may include false-positives. |
| Sample-Size Equalized (Col. 4) | An imbalance in the number of OA and non-OA articles could possibly bias the size of the OA/non-OA citation advantage. In Column 4, wherever there are more non-OA than OA articles in a given journal, only an equal-sized random sample of the OA articles is used for comparison, rather than the total number (and vice-versa). |
| Missing Years | All graphs have been padded with blank data where there are no data available for a particular year. This does not affect the overall Mean or 2001-3 mean. |
| Minus 100% OA Advantage | Some years may show a minus 100% OA/Non-OA (dis)advantage. This occurs whenever there are both OA and Non-OA articles in that year, but no citations to the OA articles. This is currently not being removed from the Mean. |
| OA Advantage | The OA/non-OA citation ratio is represented as an OA advantage, that is the percentage ratio minus 100%. If there is an OA disadvantage (non-OA articles average more citations than OA articles) this is coded as a black downward deviation from 100%, instead of a red upward deviation. The scale is fixed at 400% absolute, so any advantages greater than 300% are indicated by a broken bar. |
| Specialite | % Articles OA (OA/Non-OA) | % OA Advantage | |||
|---|---|---|---|---|---|
| Biology: | <1% | 4117/640100 | 49% | 8.11/5.13 | |
| Agricult & Food Science | <1% | 121/64462 | 12% | 5.58/4.76 | |
| Botany | <1% | 367/76783 | 34% | 9.58/7.06 | |
| Dairy & Animal Science | 11.8% | 2745/19992 | 22% | 5.95/4.25 | |
| Ecology | <1% | 2/4670 | 152% | 18/7.15 | |
| Entomology | 1% | 206/20172 | 38% | 8.26/5.72 | |
| General Biology | <1% | 148/72570 | 456% | 10.02/1.55 | |
| General Zoology | 1% | 266/23900 | 89% | 9.5/4.95 | |
| Marine Biology & Hydrobiology | <1% | 81/35732 | -25% | 4.55/6.38 | |
| Miscellaneous Biology | <1% | 46/12923 | 54% | 6.38/4.11 | |
| Miscellaneous Zoology | 1.1% | 135/11665 | 1% | 4.5/3.93 | |
| Biomedical Research: | <1% | 8106/1345207 | 218% | 34.07/13.47 | |
| Anatomy & Morphology | <1% | 34/4387 | 104% | 9.56/4.69 | |
| Biochemistry & Molecular Biology | <1% | 464/405341 | 357% | 72.32/14.17 | |
| Biomedical Engineering | 1% | 292/30530 | 21% | 7.34/5.59 | |
| Biophysics | <1% | 79/37437 | 108% | 6.98/4.95 | |
| Cellular Biology Cytology & Histology | <1% | 322/64434 | 241% | 42.84/11.69 | |
| Embryology | n/a | n/a | n/a | n/a | |
| General Biomedical Research | 1.9% | 4368/225782 | 194% | 41.85/17.85 | |
| Genetics & Heredity | <1% | 644/128696 | 4% | 13.04/12.61 | |
| Microbiology | n/a | n/a | n/a | n/a | |
| Microscopy | <1% | 3/1776 | 59% | 3.67/3.41 | |
| Miscellaneous Biomedical Research | <1% | 75/27119 | 9% | 4.59/3.96 | |
| Nutrition & Dietetic | <1% | 174/19154 | 71% | 13.54/7.72 | |
| Parasitology | <1% | 33/9393 | 18% | 8.19/7.13 | |
| Physiology | <1% | 3/13078 | -62% | 5.25/12.92 | |
| Virology | 7.8% | 1615/19115 | 56% | 25.11/15.63 | |
| Chemistry: | <1% | 2506/1039817 | 136% | 16.16/6.44 | |
| Analytical Chemistry | <1% | 289/64613 | 133% | 18.54/7.52 | |
| Applied Chemistry | <1% | 27/8297 | 4% | 3.75/3.63 | |
| General Chemistry | <1% | 996/334083 | 244% | 19.66/5.07 | |
| Inorganic & Nuclear Chemistry | <1% | 12/35583 | -7% | 3.44/5 | |
| Organic Chemistry | <1% | 2/27382 | -39% | 2.5/6.34 | |
| Physical Chemistry | <1% | 608/244983 | 41% | 8.77/6.48 | |
| Polymers | <1% | 572/74420 | 130% | 12.14/6.11 | |
| Clinical Medicine: | <1% | 2914/3413447 | 193% | 25.69/7.19 | |
| Addictive Diseases | n/a | n/a | n/a | n/a | |
| Allergy | n/a | n/a | n/a | n/a | |
| Anesthesiology | <1% | 197/38055 | 193% | 9.98/3.01 | |
| Arthritis & Rheumatology | <1% | 147/29265 | 113% | 11.41/5.94 | |
| Cancer | <1% | 225/96489 | 132% | 26/11.63 | |
| Cardiovascular System | <1% | 243/111411 | 178% | 20.56/7.55 | |
| Dentistry | n/a | n/a | n/a | n/a | |
| Dermatology & Venerial Disease | <1% | 159/38547 | 124% | 8.86/4.05 | |
| Endocrinology | <1% | 520/63072 | 157% | 25.5/10.24 | |
| Environmental & Occupational Health | n/a | n/a | n/a | n/a | |
| Fertility | n/a | n/a | n/a | n/a | |
| Gastroenterology | <1% | 1/16532 | 505% | 5/0.83 | |
| General & Internal Medicine | <1% | 512/319748 | 398% | 41.9/7.57 | |
| Geriatrics | <1% | 29/10551 | 207% | 14.62/4.9 | |
| Hematology | <1% | 180/77238 | 172% | 14.94/5.69 | |
| Immunology | <1% | 4/53822 | -100% | 0/2.88 | |
| Miscellaneous Clinical Medicine | n/a | n/a | n/a | n/a | |
| Nephrology | <1% | 44/32339 | 70% | 9.12/6.1 | |
| Neurology & Neurosurgery | <1% | 121/312991 | -6% | 7.73/9.06 | |
| Obstetrics & Gynecology | n/a | n/a | n/a | n/a | |
| Ophthalmology | <1% | 1/8145 | -100% | 0/4.51 | |
| Orthopedics | n/a | n/a | n/a | n/a | |
| Otorhinolaryngology | <1% | 1/3005 | -100% | 0/4.27 | |
| Pathology | n/a | n/a | n/a | n/a | |
| Pediatrics | n/a | n/a | n/a | n/a | |
| Pharmacology | <1% | 2/42256 | -10% | 5.5/3.17 | |
| Pharmacy | n/a | n/a | n/a | n/a | |
| Psychiatry | <1% | 7/23829 | 11% | 9/6.5 | |
| Radiology & Nuclear Medicine | <1% | 8/47977 | -29% | 1.5/1.56 | |
| Respiratory System | n/a | n/a | n/a | n/a | |
| Surgery | <1% | 6/11373 | -64% | 1.17/3.21 | |
| Tropical Medicine | 6.6% | 506/7177 | 75% | 7/4.86 | |
| Urology | n/a | n/a | n/a | n/a | |
| Veterinary Medicine | <1% | 1/6166 | -25% | 1/1.33 | |
| Earth and Space: | 5.8% | 24668/372413 | 218% | 22.3/7.77 | |
| Astronomy & Astrophysics | 24.3% | 23832/68598 | 114% | 22.79/10.87 | |
| Earth & planetary Science | <1% | 170/79051 | 53% | 8.18/5.9 | |
| Environmental Science | <1% | 629/72191 | 13% | 5.86/4.85 | |
| Geography | n/a | n/a | n/a | n/a | |
| Geology | <1% | 6/16623 | 237% | 15.5/4.63 | |
| Meteorology & Atmospheric Science | <1% | 26/31716 | -18% | 7.62/8.22 | |
| Oceanography & Limnology | <1% | 5/9003 | -85% | 1/5.48 | |
| Engineering and Technology: | <1% | 2649/643314 | 47% | 4.06/2.95 | |
| Aerospace Technology | 3.3% | 595/17216 | 51% | 2.22/1.58 | |
| Chemical Engineering | <1% | 13/32976 | 221% | 7.29/2.18 | |
| Civil Engineering | <1% | 3/7543 | -75% | 0.33/0.91 | |
| Computers | 1.5% | 1070/68265 | 92% | 3.7/2.39 | |
| Electrical Engineering & Electronics | <1% | 209/146597 | 72% | 4.51/2.78 | |
| General Engineering | <1% | 39/4584 | 143% | 2.53/0.94 | |
| Industrial Engineering | <1% | 2/807 | -100% | 0/0.13 | |
| Library & Information Science | n/a | n/a | n/a | n/a | |
| Materials Science | <1% | 381/125355 | 61% | 5.22/4.03 | |
| Mechanical Engineering | <1% | 205/50849 | 187% | 8.14/2.51 | |
| Metals & Metallurgy | <1% | 77/72055 | 24% | 3.61/3.22 | |
| Miscellaneous Engineering & Technology | <1% | 4/4641 | -46% | 0.33/0.97 | |
| Nuclear Technology | <1% | 49/26284 | 86% | 3.74/1.94 | |
| Operations Research | <1% | 2/1376 | -32% | 2/1.58 | |
| Mathematics: | 4.3% | 6656/135012 | 67% | 4.7/2.76 | |
| Applied Mathematics | 1.8% | 794/36484 | 85% | 6.78/3.72 | |
| General Mathematics | 6.5% | 5338/71739 | 106% | 4.53/2.13 | |
| Miscellaneous Mathematics | 3.8% | 378/7983 | 27% | 2.15/1.95 | |
| Probability & Statistics | <1% | 146/18806 | 64% | 5.95/3.79 | |
| Physics: | 10.1% | 106040/930059 | 135% | 13.95/6.16 | |
| Acoustics | <1% | 15/18797 | 109% | 3.97/2.27 | |
| Applied Physics | <1% | 1970/245265 | 60% | 7.85/5.73 | |
| Chemical Physics | 1.1% | 1142/104175 | 49% | 12.47/9.25 | |
| Fluids & Plasmas | 3.6% | 845/22305 | 95% | 13.39/6.01 | |
| General Physics | 13.8% | 43886/267141 | 153% | 15.16/6.14 | |
| Miscellaneous Physics | 16.5% | 1021/4737 | 20% | 6.42/5.76 | |
| Nuclear & Particle Physics | 38.6% | 44798/68470 | 120% | 14.07/6.53 | |
| Optics | 1.1% | 970/82196 | 47% | 5.29/4.75 | |
| Solid State Physics | 9.8% | 11393/104209 | 90% | 10.05/5.5 | |
| Psychology: | 2.1% | 1120/49865 | 83% | 9.24/5.8 | |
| Behavioral Science & Complementary Psychology | 2.4% | 715/27674 | 31% | 9.8/7.13 | |
| Clinical Psychology | 2.6% | 1/37 | -65% | 1/2.89 | |
| Developmental & Child Psychology | 5.1% | 98/1860 | 5% | 2.93/3.09 | |
| Experimental Psychology | <1% | 3/472 | 99% | 14/7.55 | |
| General Psychology | n/a | n/a | n/a | n/a | |
| Human Factors | 4.1% | 84/2289 | 48% | 3.9/2.21 | |
| Miscellaneous Psychology | 1.7% | 166/10590 | 304% | 10.3/3.14 | |
| Psychoanalysis | n/a | n/a | n/a | n/a | |
| Social Psychology | 5.9% | 53/867 | 122% | 13.96/7.46 | |
| Unknown: | <1% | 22/24526 | -30% | 1.63/2.16 | |
| Unknown | <1% | 22/24526 | -30% | 1.63/2.16 | |
| Specialite | % Articles OA (OA/Non-OA) | % OA Advantage | |||
|---|---|---|---|---|---|
| Administration & Management: | <1% | 286/68070 | 243% | 4.54/1.04 | |
| Administration & Management, General | <1% | 43/4482 | 81% | 4.75/1.72 | |
| Business | <1% | 135/19109 | 221% | 3.54/0.74 | |
| Business, Finance | n/a | n/a | n/a | n/a | |
| Industrial Relations & Labor | 1.3% | 55/4445 | 244% | 2.64/0.81 | |
| Management | <1% | 21/13272 | 868% | 13.61/1.87 | |
| Public Administration | <1% | 32/6349 | 373% | 3.59/0.61 | |
| AHCI: | n/a | n/a | n/a | n/a | |
| AHCI | n/a | n/a | n/a | n/a | |
| Anthropology & Sociology: | <1% | 238/65496 | 841% | 5.32/0.55 | |
| Anthropology | n/a | n/a | n/a | n/a | |
| Social Issues | n/a | n/a | n/a | n/a | |
| Sociology | <1% | 238/32543 | 653% | 5.32/0.72 | |
| Communication: | <1% | 39/14334 | 137% | 2.78/1.24 | |
| Communication | <1% | 39/14334 | 137% | 2.78/1.24 | |
| Economics: | <1% | 365/49027 | 386% | 6.4/1.41 | |
| Economics | <1% | 365/49027 | 386% | 6.4/1.41 | |
| Education & Family: | <1% | 101/42250 | 292% | 3.66/0.81 | |
| Education | n/a | n/a | n/a | n/a | |
| Education, Research | n/a | n/a | n/a | n/a | |
| Education, Special | n/a | n/a | n/a | n/a | |
| Family Studies | 1.6% | 101/6216 | 140% | 3.66/1.33 | |
| Ethnology: | n/a | n/a | n/a | n/a | |
| Ethnology | n/a | n/a | n/a | n/a | |
| Fine Arts: | n/a | n/a | n/a | n/a | |
| Architecture | n/a | n/a | n/a | n/a | |
| Art | n/a | n/a | n/a | n/a | |
| Dance | n/a | n/a | n/a | n/a | |
| Music | n/a | n/a | n/a | n/a | |
| Theatre | n/a | n/a | n/a | n/a | |
| Geography, Urban and Developme: | <1% | 179/57287 | 180% | 1.8/0.54 | |
| Area Studies | <1% | 40/21762 | 271% | 0.61/0.19 | |
| Demography | <1% | 37/4425 | 121% | 2.36/0.83 | |
| Geography | <1% | 29/9198 | 325% | 4.48/0.95 | |
| Planification and Development | n/a | n/a | n/a | n/a | |
| Transport Studies | n/a | n/a | n/a | n/a | |
| Urban Studies | 1.1% | 73/6750 | 78% | 1.51/0.95 | |
| Health: | n/a | n/a | n/a | n/a | |
| Ergonomics | n/a | n/a | n/a | n/a | |
| Geriatrics & Gerontology | n/a | n/a | n/a | n/a | |
| Health Policy & Services | n/a | n/a | n/a | n/a | |
| Medicine, Legal | n/a | n/a | n/a | n/a | |
| Nursing | n/a | n/a | n/a | n/a | |
| Public Health | n/a | n/a | n/a | n/a | |
| Rehabilitation | n/a | n/a | n/a | n/a | |
| Social Sciences, Biomedical | n/a | n/a | n/a | n/a | |
| Social Work | n/a | n/a | n/a | n/a | |
| Substance Abuse | n/a | n/a | n/a | n/a | |
| History: | <1% | 108/191679 | 1032% | 1.5/0.12 | |
| History | n/a | n/a | n/a | n/a | |
| History & Philosophy of Science | <1% | 75/19692 | 336% | 2.35/0.46 | |
| History of Social Sciences | <1% | 33/15119 | -16% | 0.24/0.24 | |
| Humanities: | n/a | n/a | n/a | n/a | |
| Archeology | n/a | n/a | n/a | n/a | |
| Arts & Humanities, General | n/a | n/a | n/a | n/a | |
| Film, Radio, TV | n/a | n/a | n/a | n/a | |
| Folklore | n/a | n/a | n/a | n/a | |
| History | n/a | n/a | n/a | n/a | |
| History & Philosophy of Science | <1% | 75/19692 | 336% | 2.35/0.46 | |
| Language & Linguistic | <1% | 80/31424 | 1236% | 7.87/0.53 | |
| Philosophy | <1% | 1/5980 | 1067% | 5/0.43 | |
| Theology & Religious Studies | n/a | n/a | n/a | n/a | |
| Inconnu: | n/a | n/a | n/a | n/a | |
| Inconnu | n/a | n/a | n/a | n/a | |
| Law: | n/a | n/a | n/a | n/a | |
| Criminology & Penology | n/a | n/a | n/a | n/a | |
| Law | n/a | n/a | n/a | n/a | |
| Literature: | n/a | n/a | n/a | n/a | |
| Classics | n/a | n/a | n/a | n/a | |
| Language & Linguistic | <1% | 80/31424 | 1236% | 7.87/0.53 | |
| Literacy Review | n/a | n/a | n/a | n/a | |
| Literature | n/a | n/a | n/a | n/a | |
| Literature, African, Canadian & Australian | n/a | n/a | n/a | n/a | |
| Literature, American | n/a | n/a | n/a | n/a | |
| Literature, British Isles | n/a | n/a | n/a | n/a | |
| Literature, German, Netherlandic, Scnadinavian | n/a | n/a | n/a | n/a | |
| Literature, Romance | n/a | n/a | n/a | n/a | |
| Literature, Slavic | n/a | n/a | n/a | n/a | |
| Poetry | n/a | n/a | n/a | n/a | |
| N.A.: | n/a | n/a | n/a | n/a | |
| N.A. | n/a | n/a | n/a | n/a | |
| Other: | n/a | n/a | n/a | n/a | |
| Environmental Studies | n/a | n/a | n/a | n/a | |
| Information Science & Library Science | n/a | n/a | n/a | n/a | |
| Language & Linguistics | n/a | n/a | n/a | n/a | |
| Social Sciences, Interdisciplinary | n/a | n/a | n/a | n/a | |
| Philosophy: | n/a | n/a | n/a | n/a | |
| Philosophy | <1% | 1/5980 | 1067% | 5/0.43 | |
| Political Sciences: | n/a | n/a | n/a | n/a | |
| International Relations | n/a | n/a | n/a | n/a | |
| Political Sciences | n/a | n/a | n/a | n/a | |
| Psychology & Psychiatry: | <1% | 881/176586 | 320% | 8.36/1.73 | |
| Psychiatry | <1% | 140/30514 | 344% | 7.85/1.64 | |
| Psychology | <1% | 147/47389 | 956% | 16.86/1.31 | |
| Psychology & Psychiatry, General | <1% | 94/12305 | -83% | 0.28/1.23 | |
| Psychology, Applied | 1.7% | 182/10260 | 95% | 3.89/1.59 | |
| Psychology, Biological | n/a | n/a | n/a | n/a | |
| Psychology, Clinical | <1% | 162/19634 | 357% | 10.64/2.17 | |
| Psychology, Developmental | <1% | 17/11205 | 95% | 4.26/2.88 | |
| Psychology, Educational | <1% | 28/6111 | 280% | 5.49/1.2 | |
| Psychology, Experimental | <1% | 79/18600 | 375% | 8.02/2.28 | |
| Psychology, Mathematical | 1.1% | 14/1449 | -80% | 0.9/2.29 | |
| Psychology, Psychoanalysis | <1% | 1/177 | -100% | 0/0.01 | |
| Psychology, Social | <1% | 17/11072 | -42% | 1.9/2.76 | |
| SCI: | n/a | n/a | n/a | n/a | |
| Expanded | n/a | n/a | n/a | n/a | |
| SCI | n/a | n/a | n/a | n/a | |
| Women's Studies: | n/a | n/a | n/a | n/a | |
| Women's Studies | n/a | n/a | n/a | n/a | |