This document describes how to set up the development environment for the Parquet Converter project.
This project is part of Sami Adnan’s DPhil research at the Nuffield Department of Primary Care Health Sciences, University of Oxford.
git clone https://github.com/sami5001/parquet-converter.git
cd parquet-converter
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
pip install -e ".[dev]"
pre-commit install
For production or reproducible environments, use the locked dependencies:
pip install -r requirements-lock.txt
To update the locked dependencies:
pyproject.toml
if neededpip-compile pyproject.toml --output-file=requirements-lock.txt
For development with the latest package versions:
pip install -r requirements-dev.txt
pytest
With coverage:
pytest --cov=parquet_converter
The project uses several tools to maintain code quality:
These tools are configured in pyproject.toml
and .pre-commit-config.yaml
.
To manually run these tools:
black .
isort .
flake8
mypy parquet_converter
python -m build
pip install git+https://github.com/sami5001/parquet-converter.git
When working with this project, it may be helpful to understand the Parquet format in detail: