Convert JSON Lines to ORC

Convert your JSON Lines / NDJSON data to Apache ORC quickly

Drop your JSON_LINES (.jsonl) file here (or click to browse).
(100 MB or less file)
0%
Uploading...
Loading...
Sign-up to upload larger files
JSON Lines / NDJSON

JSON Lines (also known as NDJSON — Newline Delimited JSON) is a text format where each line is a self-contained, valid JSON value, typically a JSON object. It is widely used for streaming data, log files, and large data exports because files can be read and written one record at a time without loading the entire document into memory. Common file extensions are .jsonl and .ndjson.

Apache ORC

Apache ORC (Optimized Row Columnar) is a self-describing, columnar file format that supports high compression ratios and fast data retrieval. ORC supports complex types, including structs, lists, maps, and unions. ORC files are divided into blocks of data (stripes) containing statistics (such as min, max, sum, and count) and lightweight indexing which can be used to skip over irrelevant data during queries. ORC also supports predicate pushdown, meaning that filters can be applied as the data is read from disk, reducing the amount of data loaded into memory and processed. Due to its high performance in terms of compression and speed of access, ORC is particularly well-suited for heavy read operations and is commonly used in data warehousing and analytics applications.