Citation-Downloads Correlation Tool

The correlation generator tool allows you to test the predictiveness of citation impact from either earlier downloads or citations.

This tool has only been tested with Mozilla Firefox. Uses the Chart Widget developed by Emil A Eklund.

Correlation Source Data

The source data used comes from Citebase Search's database. You can restrict the data used by either dragging a selection box over the following graphs, or by entering values into the form below.

Download data are downloads of articles (based only on the UK arXiv mirror).

Citation data are citations from other arXiv articles (based on Citebase).

The Impact Histogram graphs show the distribution of articles by either the number of downloads or citations to each article.

The Latency Histogram graphs show the distribution of downloads or citations by the delay between the article being deposited and later being downloaded or cited.

Articles Deposited per Month

Download Data

Impact Histogram

Latency Histogram

Citation Data

Impact Histogram

Latency Histogram

Correlation Settings

You can restrict the data used in the correlation by filling values into this table. As you fill values in the range of data used will automatically be highlighted on the graphs above.
Because generating the correlation processes a lot of data it may take upwards of 5 minutes to calculate.