How universities submit data to rankings and why it matters
The data submission process introduces inconsistencies, strategic reporting, and verification gaps that affect ranking reliability.
The institutional data pipeline
Much of the data that feeds into university rankings is provided by the universities themselves. Rankings send data collection templates to institutions, asking for counts of students, faculty, research income, publications, and other metrics. University administrators compile this data from internal systems and submit it to the ranking publisher. This pipeline, from internal record-keeping to published ranking, is the least visible but potentially the most consequential part of the ranking process.
The data submission process is vulnerable to several types of error and inconsistency. Different universities may interpret the same reporting guidelines differently. An international student who holds dual citizenship may be counted as international by one institution and domestic by another. A faculty member who splits their time between teaching and research may be counted as full-time equivalent in different ways. A research grant awarded in one currency and spent in another may be converted at different exchange rates. These small differences accumulate and can shift an institution's position.
Strategic reporting and gaming
Beyond inadvertent inconsistency, there is the more troubling possibility of strategic reporting. Universities have incentives to present their data in the most favorable light, especially as rankings increasingly influence student recruitment, faculty hiring, and funding decisions. An institution might classify marginal staff as research-active to reduce its student-to-staff ratio, or include non-academic income in its research income total, or report its most favorable year of data rather than the most recent.
The scope for gaming is limited by the specificity of ranking publishers' data definitions, but definitions can never be perfectly precise. There is always judgment involved in classifying people, activities, and resources into the categories that rankings use. A university that employs people specifically to manage its ranking data submissions is not necessarily cheating, but it is optimizing its presentation in ways that less well-resourced institutions cannot match. The playing field is not level.
Verification and audit processes
Some ranking publishers have introduced verification and audit processes to improve data quality. Data may be cross-checked against independent sources, such as government statistical agencies or bibliometric databases. Statistical checks may flag implausible values for manual review. Third-party auditors may be engaged to verify a sample of submitted data. These measures improve reliability but are not foolproof. Auditors cannot verify every data point from every institution, and some types of data are inherently difficult to verify independently.
The transparency of verification processes varies. Some publishers describe their verification procedures in detail; others provide only general assurances. A user cannot assess the reliability of the underlying data without knowing what verification has been performed. Look for rankings that are transparent about their data quality processes, including the proportion of data that is independently verified and the types of checks that are applied. If this information is not provided, you are being asked to trust the publisher and the submitting institutions without evidence.
What users can do to assess data reliability
Given the opacity of the data submission process, users should approach ranking data with a degree of skepticism, particularly for indicators that rely heavily on institutional self-reporting. Financial data, student counts, and faculty counts are more susceptible to inconsistency and strategic reporting than bibliometric data, which comes from independent databases, though even bibliometric data has limitations discussed elsewhere in this series.
When using rankings, cross-check key data points against official sources when possible. Government education departments, higher education statistics agencies, and university annual reports often publish data that can be compared against ranking submissions. Discrepancies do not necessarily indicate wrongdoing—they may reflect different definitions or time periods—but they should prompt further investigation. A university whose self-reported data consistently diverges from independent sources deserves additional scrutiny before you base a decision on its ranking position.
For the user, the practical implication of the submission pipeline is that ranking data should be treated as estimates rather than exact measurements. Small differences between institutions—a slightly higher student-to-faculty ratio, a modestly lower research income per capita—may reflect differences in how the data was compiled rather than differences in institutional reality. Focus on large, consistent patterns rather than precise comparisons, and always verify critical figures against official institutional sources when they matter for your decision.
A healthy ranking ecosystem depends on institutions that report honestly and publishers that verify rigorously. When either side of this relationship weakens, the entire product suffers. As a user, your awareness of the data submission process helps you calibrate your trust appropriately, neither dismissing rankings as fabrications nor accepting them as infallible.