Evolution of Our Dagster File Organization
File structures should make your work easier

Whenever you try to do intelligent data engineering tasks: refreshing tables in order, running python processes, ingesting and outputting data, you need a scheduler. Airflow is the best known of these beasts, but I have a fondness for Dagster.
Dagster focuses on the result of computations: tables, trained model artefacts and not on the process itself, this works really well for data science solutions.
Dagster makes it also quite easy to swap implementations, you can read from disk while testing or developing, and read from a database in production.
[Read More]