Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
ARROW-4708: [C++] add multithreaded json reader
- add Converter for conversion of arrays from the parser - add ChunkedArrayBuilder for multithreaded conversion of arrays produced by BlockParser - extract BlockParser's builder management code into a separate class - add serial and threaded implementations of TableReader for parsing from an InputStream @pitrou the sequel, unless you'd like me to break this one up as well Author: Benjamin Kietzman <[email protected]> Author: Wes McKinney <[email protected]> Closes apache#4165 from bkietz/4708-Add-multithreaded-JSON-reader.2 and squashes the following commits: 673ecdd <Benjamin Kietzman> run clang-format e2ad676 <Benjamin Kietzman> Merge branch '4708-Add-multithreaded-JSON-reader.2' of https://github.com/bkietz/arrow into 4708-Add-multithreaded-JSON-reader.2 dba28b8 <Benjamin Kietzman> clearing up conversion errors 90a017d <Wes McKinney> Use real time instead of CPU time 7e43c8e <Benjamin Kietzman> resolve CI failures 74cfd06 <Benjamin Kietzman> Adding further tests and benchmarks for the JSON reader 41d7375 <Benjamin Kietzman> CompareBinary must account for offset when checking emptiness 5b62398 <Benjamin Kietzman> add explicit string conversion for MSVC bc5ec1a <Benjamin Kietzman> re-add partial/completion processing 0b3dbe1 <Benjamin Kietzman> rewrite converter tests 8793c30 <Benjamin Kietzman> comment: conversion errors caught at parse time 4703cd0 <Benjamin Kietzman> refactor to remove chunk_lengths from ...Builder.Finish 1bba412 <Benjamin Kietzman> ensure dictionary arrays in scalar conversion, init null bitmaps 93856af <Benjamin Kietzman> simplify TableReader impl e0b35c9 <Benjamin Kietzman> add new factories to Table fee9dc4 <Benjamin Kietzman> chunked-array-builder: fix converter_ race, store unconverted_fields 6c78b2d <Benjamin Kietzman> fix merge error b6697b3 <Benjamin Kietzman> refactor RawArrayBuilder management into a separate class f31e079 <Benjamin Kietzman> adding reader, converter, and chunked-builder back ebad2ff <Benjamin Kietzman> #include sse-utils in rapidjson-def for sse macros 4984604 <Benjamin Kietzman> use arrow sse macros e12ee75 <Benjamin Kietzman> correct SSE detection 47d37f7 <Benjamin Kietzman> address review comments b3b7f5d <Benjamin Kietzman> fix build error ef624d0 <Benjamin Kietzman> refactoring JSON parser to prepare for multithreaded impl
- Loading branch information