Skip to content

Working with Arrow in Python

Apache Arrow IPC file handling with high-performance columnar data processing.

Terminal window
pip install fairspec

The Arrow plugin provides:

  • load_arrow_table - Load Arrow IPC files into tables
  • save_arrow_table - Save tables to Arrow IPC files
  • ArrowPlugin - Plugin for framework integration

For example:

from fairspec import load_arrow_table, Resource
table = load_arrow_table(Resource(data="table.arrow"))
# High-performance columnar format
from fairspec import load_arrow_table, Resource
# Load from local file
table = load_arrow_table(Resource(data="data.arrow"))
# Load from remote URL
table = load_arrow_table(Resource(data="https://example.com/data.arrow"))
# Load multiple files (concatenated)
table = load_arrow_table(Resource(data=["file1.arrow", "file2.arrow"]))
from fairspec import save_arrow_table
# Save with default options
save_arrow_table(table, path="output.arrow")
from fairspec import load_arrow_table, Resource
# Load from URL
table = load_arrow_table(Resource(data="https://example.com/data.arrow"))
# Load multiple remote files
table = load_arrow_table(Resource(data=[
"https://api.example.com/data-2023.arrow",
"https://api.example.com/data-2024.arrow",
]))