BDI-Kit Documentation
Version: 0.10.0
BDI-Kit is a toolkit designed to assist users in performing data harmonization (see our GitHub repository). It provides state-of-the-art tools to streamline the integration and transformation of disparate datasets, with a particular focus on biomedical data. BDI-Kit includes methods for tasks such as:
Schema matching
Value matching
Data transformation to a target table or data model
BDI-Kit can be used in two complementary ways:
🐍 Python API — Programmatic data harmonization workflows
🤖 AI Agent — Conversational data harmonization using natural language
The following quick demo illustrates how BDI-Kit can be used through both the Python API and the AI agent:
For more details about the design and capabilities of BDI-Kit, see our papers:
BDI-Kit: An AI-Powered Toolkit for Biomedical Data Harmonization (preferred citation)
BDI-Kit Demo: A Toolkit for Programmable and Conversational Data Harmonization