Skip to content

This portfolio provides a collection of Jupyter Notebooks designed to facilitate the use of open bibliometric data sources for higher education institutions. The goal is to offer tools for cleaning, analysing and improving open data for bibliometric analyses.

Notifications You must be signed in to change notification settings

eth-library/tobi-portfolio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TOBI Portfolio

About this Portfolio

Description

This portfolio provides a collection of Jupyter Notebooks designed to facilitate the use of open bibliometric data sources for higher education institutions. The goal is to offer tools for cleaning, analysing and improving open data for bibliometric analyses.

Benefits of Open Data Sources

Using open data sources for research assessment enables more compliance with DORA (San Francisco Declaration on Research Assessment), CoARA (Coalition for Advancing Research Assessment) and POSI (The Principles of Open Scholarly Infrastructure) compared to licensed commercial databases. Open data are also aligned with FAIR principles (Findable, Accessible, Interoperable, Reusable). In comparison to established commercial databases, they usually provide:

  • Unpaywalled access to the data
  • More transparency and possibilities for reuse due to less restrictive licensing
  • Broader inclusivity and diversity
  • Community-driven improvements

Included data sources

The portfolio currently includes two open bibliometric data sources, OpenAlex and OpenAIRE.

  • OpenAIRE
    • Note: The data extraction has been updated to Walden, the full rewrite of OpenAlex. If you wish to know more or to access older versions of OpenAlex, check out this link
  • OpenAlex

Portfolio Scope

This portfolio offers scripts to start using the API of open bibliometric data sources like OpenAIRE and OpenAlex by providing:

  • Tutorials in form of Jupyter Notebooks.
  • Tools to identify quality issues (in preparation).
  • Scripts to perform customized bibliometric analyses.

Repository Contents

This repository contains Jupyter Notebooks, Python scripts and dependencies for conducting initial bibliometric analyses.

Files:

  • requirements.txt - List of required Python packages (install using pip install -r requirements.txt).
  • open_colors.py - Open source color scheme we use for most the visualizations.
  • language_abbreviations.py - A dictionary mapping of language abbreviations to full language names.
  • swiss_universities.py - A dictionary mapping of Swiss higher education institutions to OpenAlex IDs.
Starter Notebooks
  • openalex_first_analyses.ipynb - Initial analyses of bibliometric data from OpenAlex.
  • openaire_first_analyses.ipynb - Initial analyses of bibliometric data from OpenAIRE.
  • openalex_oa_analyses.ipynb - Analysis of Open Access (OA) published article trends using OpenAlex.
  • openaire_oa_analyses.ipynb - Analysis of Open Access (OA) published article trends using OpenAIRE.
Advanced Notebooks
  • openalex_language_distr_single_institutions.ipynb - Analysis of non-English articles published by a selected institution.
  • openalex_language_distr_multi_institutions.ipynb - Analysis of non-English articles across Swiss HEIs.
  • openalex_research_domains_nonenglish.ipynb - Analysis of the research domain distribution of non-English articles.
  • openalex_landing_page_domains.ipynb - Analysis of the most frequent landing page domains and prefixes across Swiss HEIs.

Getting Started

Prerequisites

Ensure you have Python installed (preferably version 3.8+).

We recommend using a virtual environment to manage dependencies.

Installation

  1. Clone this repository:
    git clone https://github.com/eth-library/tobi-portfolio
  2. Create a virtual environment and activate it:
    python -m venv venv  # Create a virtual environment
    source venv/bin/activate  # Activate on macOS/Linux
    venv\Scripts\activate  # Activate on Windows
  3. Install dependencies:
    pip install -r requirements.txt

Running the Notebooks

Launch Jupyter Notebook and open the desired .ipynb file:

jupyter notebook

Or open the Notebook file in your IDE.

Contributions

Contributions are welcome! If you find issues or want to suggest improvements, please open an issue or send us an E-Mail.

License

Disclaimer: The exclusive right of use and the exclusive exploitation rights of the software belong to ETH Zurich (Federal Act on the Federal Institutes of Technology, Art. 36, para. 2).

We are in the process of registering the software under an Open Source License.

Contributors

  • Elisabeth Giryes

  • Simon Willemin

  • Dr. Julian Dederke

  • Dr. Teresa Kubacka

About TOBI

Towards Open Bibliometric Indicators (TOBI) is a project co-founded by the ETH Library and swissuniversities within the Open Science Program.

More on the TOBI Website: https://eth-library.github.io/tobi/

About

This portfolio provides a collection of Jupyter Notebooks designed to facilitate the use of open bibliometric data sources for higher education institutions. The goal is to offer tools for cleaning, analysing and improving open data for bibliometric analyses.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published