Blog
Faster tests with PySpark
Unit testing and data warehouse queries
Slicing polygons with linestrings
Adapting split strategies from Shapely to TurfJS
External DAG triggers in Airflow
Executing DAGs from within other DAGs and managing shared context
Localized GeoDataFrame UTM reprojection
Programmatically identify UTM zones during meter-projection conversion
Testing patterns with PySpark
Unit testing multistep transformation pipelines
Identifying representative core of diverging trace set
Heuristic methods to extract a representative core segment from a corridor trace set
Dynamic bytestring-encoded arrays
Using construct-based declarative structure with variable-lengthed arrays
Hierarchical clustering of trip traces
Dynamic resolution of S2 cells to represent trip paths