Skip to content

Latest commit

 

History

History
 
 

rfc

RFCs

The RFC process is documented on our site. Please familiarize yourself with it, before working a new RFC.

Status can be one of these values.

Status Meaning
UNDER REVIEW RFC has been proposed and community is actively debating the design/proposal.
IN PROGRESS The initial phase of implementation is underway.
ONGOING Some or most work has landed; community continues to improve or build follow on phases.
ABANDONED The proposal was not implemented, due to various reasons.
COMPLETED All work is deemed complete.

The list of all RFCs can be found here.

Older RFC content is still here.

RFC Number Title Status
1 CSV Source Support for Delta Streamer COMPLETED
2 ORC Storage in Hudi ONGOING
3 Timeline Service with Incremental File System View Syncing COMPLETED
4 Faster Hive incremental pull queries COMPLETED
5 HUI (Hudi WebUI) ABANDONED
6 Add indexing support to the log file ABANDONED
7 Point in time Time-Travel queries on Hudi table COMPLETED
8 Record level indexing mechanisms for Hudi datasets ONGOING
9 Hudi Dataset Snapshot Exporter COMPLETED
10 Restructuring and auto-generation of docs COMPLETED
11 Refactor of the configuration framework of hudi project ABANDONED
12 Efficient Migration of Large Parquet Tables to Apache Hudi COMPLETED
13 Integrate Hudi with Flink COMPLETED
14 JDBC incremental puller COMPLETED
15 HUDI File Listing Improvements COMPLETED
16 Abstraction for HoodieInputFormat and RecordReader COMPLETED
17 Abstract common meta sync module support multiple meta service COMPLETED
18 Insert Overwrite API COMPLETED
19 Clustering data for freshness and query performance COMPLETED
20 handle failed records IN PROGRESS
21 Allow HoodieRecordKey to be Virtual COMPLETED
22 Snapshot Isolation using Optimistic Concurrency Control for multi-writers COMPLETED
23 Hudi Observability metrics collection ABANDONED
24 Hoodie Flink Writer Proposal COMPLETED
25 Spark SQL Extension For Hudi COMPLETED
26 Optimization For Hudi Table Query ONGOING
27 Data skipping index to improve query performance ONGOING
28 Support Z-order curve COMPLETED
29 Hash Index ONGOING
30 Batch operation UNDER REVIEW
31 Hive integration Improvement ONGOING
32 Kafka Connect Sink for Hudi ONGOING
33 Hudi supports more comprehensive Schema Evolution ONGOING
34 Hudi BigQuery Integration COMPLETED
35 Make Flink MOR table writing streaming friendly UNDER REVIEW
36 HUDI Metastore Server IN PROGRESS
37 Hudi Metadata based Bloom Index ONGOING
38 Spark Datasource V2 Integration IN PROGRESS
39 Incremental source for Debezium ONGOING
40 Hudi Connector for Trino IN PROGRESS
41 [Hudi Snowflake Integration] UNDER REVIEW
42 Consistent Hashing Index IN PROGRESS
43 Compaction / Clustering Service UNDER REVIEW
44 Hudi Connector for Presto ONGOING
45 Asynchronous Metadata Indexing ONGOING
46 Optimizing Record Payload Handling IN PROGRESS
47 Add Call Produce Command for Spark SQL ONGOING
48 LogCompaction for MOR tables UNDER REVIEW
49 Support sync with DataHub ONGOING
50 Improve Timeline Server IN PROGRESS
51 Change Data Capture UNDER REVIEW
52 Introduce Secondary Index to Improve HUDI Query Performance UNDER REVIEW
53 Use Lock-Free Message Queue Improving Hoodie Writing Efficiency IN PROGRESS
54 New Table APIs and Streamline Hudi Configs UNDER REVIEW
55 Improve Hive/Meta sync class design and hierachies ONGOING
56 Early Conflict Detection For Multi-Writer UNDER REVIEW
57 DeltaStreamer Protobuf Support UNDER REVIEW
58 Integrate column stats index with all query engines UNDER REVIEW
59 Multiple event_time Fields Latest Verification in a Single Table UNDER REVIEW
60 Federated Storage Layer UNDER REVIEW
61 Snapshot view management UNDER REVIEW
62 Diagnostic Reporter UNDER REVIEW