how I write tests for dagster
unit-testing data engineering
It is vital that your data pipelines work as intented, but it is also easy to make mistakes. That is why we write tests.
Testing in Airflow was a fucking pain. The best you could do was create a complete deployment in a set of containers and test your work in there. Or creating python packages, test those and hope it would work in the larger airflow system. In large deployments you could not add new dependencies without breaking other stuff, so you have to either be creative with the python /airflow standard library or run your work outside of airflow.
[Read More]