The current influence statistics are computed using the
OpenAlex dataset, which contains comprehensive scientific publication records, citation relationships, as well as metadata about authors, institutions, journals, conferences, and fields of study. The current InfluenceFlower is based on the OpenAlex 2025-05-30 release, replacing the previously used
Microsoft Academic Graph (MAG), which was retired in December 2021.
In a nutshell, the whole dataset contains:
268 million scientific publications from as early as the 1800s.103 million authors and 115 thousand institutions identified by OpenAlex.65 thousand concepts (fields of study), such as "labour economics" and "algorithms".2.64 billion citation links between publications. The transition to OpenAlex offers access to an expanded dataset, enhancing the availability and accuracy of influence statistics. However, some papers and authors may have missing data due to indexing delays or challenges with name entity resolution. Additionally, scoring is based on a dataset snapshot, while entity names are retrieved directly from the OpenAlex API, potentially causing occasional discrepancies between API results and scoring data.