Overview
The bdi-kit is a library that assist users in performing data harmonization. It provides state-of-the-art tools to streamline the process of integrating and transforming disparate datasets (with a focus on biomedical data), and includes APIs for performing tasks such as:
Schema matching
Value matching
Data transformation to a target schema/standard
Warning: bdi-kit is currently in alpha stage and under heavy development. Expect APIs to change.
You can find the source code in our GitHub repository.
Getting Started
- Installation
- Getting Started
- Matching the table schema to GDC standard vocabulary
- Generating a harmonized table
- Generating a harmonized table with value mappings
- Verifying the schema mappings
- Viewing the column domains
- Finding correct value mappings
- Verifying multiple value mappings at once
- Fixing remaining value mappings
- Combining custom user mappings with suggested mappings
- Examples
API Reference