Tag: Apache Arrow
All the articles with the tag "Apache Arrow".
Single-Node Data Engineering: DuckDB, DataFusion, Polars, and LakeSail
Published: at 02:00 PMOptimize single-node data engineering with DuckDB, DataFusion, Polars, and LakeSail. Compare architectures and learn when to transition to Dremio MPP.
Agentic Analytics on the Apache Lakehouse
Published: at 12:00 PMIf you grant a Large Language Model direct access to a raw Amazon S3 bucket filled with Parquet files, it will fail to answer your business questions....
What is Apache Arrow? Erasing the Serialization Tax
Published: at 12:00 PMIf you pull a million records from a database into a Python notebook, the query runs instantly, but the transfer feels endlessly slow. Your compute en...