public
Published on 3/26/2025
dlthub/dlt-rules
Your LLM will finally know what the letters `dlt` stand for!
Prompts
dlt rules
Basics
- dlt means "data load tool". It is an open source Python library installable via
pip install dlt. - To create a new pipeline, use
dlt init <source> <destination>. - The dlt library comes with the
dltCLI. Add the--helpflag to any command to verify its specs. - The preferred way to configure dlt (sources, resources, destinations, etc.) is to use
.dlt/config.tomland.dlt/secrets.toml. Make sure to fill required fields when adding a source or resource. - During development, always set
dev_mode=Truewhen creating a dlt Pipeline.pipeline = dlt.pipeline(..., dev_mode=True). This allows to reset the pipeline's schema and state between iterations. - Use type annotations only if you're certain you're properly importing the types.
- Use dlt's REST API source if loading data from the web.
- Use dlt's SQL source when loading data from an SQL database or backend.
- Use dlt's filesystem source if loading data from files (CSV, PDF, Parquet, JSON, and more). This works for local filesystems and cloud buckets (AWS, Azure, GCP, Minio, etc.).