-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Insights: apache/iceberg
Overview
59 Pull requests merged by 21 people
-
Docs: Add Stackable to the Vendors page
#12344 merged
Feb 25, 2025 -
Arrow, Parquet, Spark 3.5, Flink 1.20: Avoid deprecated method
#11874 merged
Feb 25, 2025 -
Core: Add "volatile" to HadoopFileIO#hadoopConf
#12388 merged
Feb 25, 2025 -
Spec: Allow Equality Deletes with Row Lineage and Define Behavior
#12230 merged
Feb 24, 2025 -
Build: Bump software.amazon.awssdk:bom from 2.30.21 to 2.30.26
#12379 merged
Feb 24, 2025 -
Build: Bump testcontainers from 1.20.4 to 1.20.5
#12380 merged
Feb 24, 2025 -
[1.8.x] Fix versions in LICENSE/NOTICE
#12365 merged
Feb 24, 2025 -
Build: Bump com.google.cloud:libraries-bom from 26.54.0 to 26.55.0
#12382 merged
Feb 24, 2025 -
Build: Bump nessie from 0.102.5 to 0.103.0
#12383 merged
Feb 24, 2025 -
Build: Bump org.awaitility:awaitility from 4.2.2 to 4.3.0
#12384 merged
Feb 24, 2025 -
[1.8.x] Core: Don't remove trailing slash from absolute paths
#12390 merged
Feb 24, 2025 -
Build: Bump org.xerial:sqlite-jdbc from 3.49.0.0 to 3.49.1.0
#12385 merged
Feb 24, 2025 -
Build: Bump mkdocs-material from 9.6.4 to 9.6.5
#12386 merged
Feb 24, 2025 -
Core: Don't remove trailing slash from absolute paths
#12389 merged
Feb 24, 2025 -
Flink: Fix the comment error in SketchDataStatistics
#12375 merged
Feb 24, 2025 -
[1.8.x] Core: Remove namespace/table/view HEAD endpoints from defaults (#12351)
#12368 merged
Feb 22, 2025 -
API: Move Variant interfaces and serialized implementations to API
#12374 merged
Feb 21, 2025 -
[1.7.x] Fix
{LICENSE,NOTICE}
for Spark Runtime#12355 merged
Feb 21, 2025 -
[1.7.x] Fix
{LICENSE,NOTICE}
for Azure Bundle#12361 merged
Feb 21, 2025 -
[1.7.x] Fix
{LICENSE,NOTICE}
for AWS Bundle#12360 merged
Feb 21, 2025 -
[1.7.x] Fix
{LICENSE,NOTICE}
for GCP Bundle#12359 merged
Feb 21, 2025 -
[1.7.x] Fix
{LICENSE,NOTICE}
for Flink Runtime#12358 merged
Feb 21, 2025 -
[1.7.x] Fix
{LICENSE,NOTICE}
for Kafka Connect Runtime#12353 merged
Feb 21, 2025 -
Nit: Remove additional 'Iceberg' in Puffin footer payload
#12369 merged
Feb 21, 2025 -
Core: Remove namespace/table/view HEAD endpoints from defaults
#12351 merged
Feb 21, 2025 -
[1.7.x] Bump Nessie to 0.120.5 to include updated License/Notice
#12356 merged
Feb 21, 2025 -
API: Move variant to API and add extract expression
#12304 merged
Feb 21, 2025 -
[1.7.x] Kafka: Pin Kafka-Connect version to fix integration tests
#12354 merged
Feb 20, 2025 -
Core, Spark: Remove deprecated code for 1.9.0
#12336 merged
Feb 20, 2025 -
Core: Handle partition evolution case in PartitionStatsUtil#computeStats
#12137 merged
Feb 20, 2025 -
Core: Remove deprecated Util.blockLocations method and StructCopy class
#12320 merged
Feb 20, 2025 -
Spark: Remove Spark 3.3 support
#12279 merged
Feb 20, 2025 -
Bump versions in
{LICENSE,NOTICE}
#12337 merged
Feb 20, 2025 -
Parquet: Remove deprecated VectorizedReader.setRowGroupInfo and ParquetValueReader.setPageSource
#12321 merged
Feb 20, 2025 -
[1.8.x] Build: Revert AWS SDK from 2.30.11 to 2.29.52
#12339 merged
Feb 20, 2025 -
Spark 3.5: Fix Incorrect Spec Used With AddFiles Procedure
#12319 merged
Feb 19, 2025 -
Docs: Add documentation for Rate limiting in Spark Structured Streaming
#12217 merged
Feb 19, 2025 -
Docs: Fix link of catalog in terms.md
#12326 merged
Feb 19, 2025 -
[1.8.x] Kafka: Pin Kafka-Connect version to fix integration tests
#12341 merged
Feb 19, 2025 -
Kafka: Pin Kafka-Connect version to fix integration tests
#12340 merged
Feb 19, 2025 -
[1.8.x] Parquet: Fix performance regression in reader init (#12305)
#12338 merged
Feb 19, 2025 -
Checkstyle: Apply the same generic type naming rules to interfaces and classes
#12333 merged
Feb 19, 2025 -
[1.8.x] Core: Adjust Jackson settings to handle large metadata json (#12224)
#12330 merged
Feb 19, 2025 -
[1.8.x] Parquet: Fix performance regression in reader init (#12305)
#12329 merged
Feb 19, 2025 -
Revert "Core: Serialize
null
when there is no current snapshot"#12312 merged
Feb 19, 2025 -
Fix: fix apache amoro ams doc pic ref
#12332 merged
Feb 19, 2025 -
[1.8.x] Core: Fallback to GET requests for namespace/table/view exists checks
#12328 merged
Feb 19, 2025 -
Core: Fallback to GET requests for namespace/table/view exists checks
#12314 merged
Feb 19, 2025 -
[1.8.x] Revert "Core: Serialize
null
when there is no current snapshot"#12313 merged
Feb 19, 2025 -
Parquet: Fix performance regression in reader init
#12305 merged
Feb 19, 2025 -
Docs: add apache amoro(incubating) with iceberg (#11965)
#11966 merged
Feb 19, 2025 -
Parquet: Fix errorprone warning
#12324 merged
Feb 19, 2025 -
Docs: Add rewrite-table-path in spark procedure
#12115 merged
Feb 19, 2025 -
Parquet: Implement Variant readers
#12139 merged
Feb 18, 2025 -
Docs: Refactor site navigation bar
#12289 merged
Feb 18, 2025 -
Fix CI: Update tests with
UnknownType
fromRequired
toOptional
#12316 merged
Feb 18, 2025 -
Core: add variant type support
#11831 merged
Feb 18, 2025 -
API: Fix TestInclusiveMetricsEvaluator notStartsWith tests
#12303 merged
Feb 18, 2025 -
API: Reject unknown type for required fields and validate defaults
#12302 merged
Feb 18, 2025
25 Pull requests opened by 17 people
-
API, Core: Update inclusive metrics evaluator for extract and transforms
#12311 opened
Feb 18, 2025 -
Throw on `{write.folder-storage.path,write.object-storage.path}` properties
#12315 opened
Feb 18, 2025 -
Wrap variant in PrimitiveHoder so serialization can result same instance
#12317 opened
Feb 18, 2025 -
Core: Print un-pretty metadata files without whitespace
#12318 opened
Feb 18, 2025 -
Use delimited column names in `CreateChangelogViewProcedure`
#12322 opened
Feb 18, 2025 -
Parquet: Implement Variant writers
#12323 opened
Feb 18, 2025 -
Spark: Infer partition spec in ADD_FILES procedure for FileTables than taking latest table spec
#12327 opened
Feb 19, 2025 -
Spec: Add implementation note on `current-snapshot-id`
#12334 opened
Feb 19, 2025 -
Core: Write `null` for `current-snapshot-id` for V3+
#12335 opened
Feb 19, 2025 -
API, Core: Add geometry and geography types support
#12346 opened
Feb 20, 2025 -
WIP Parquet: Support reading/writing geometry and geography columns
#12347 opened
Feb 20, 2025 -
Build: remove Hadoop 2 dependency
#12348 opened
Feb 20, 2025 -
Fix versions in LICENSE/NOTICE
#12364 opened
Feb 21, 2025 -
Core: Apply correct metric configs in GenericAppenderFactory
#12366 opened
Feb 21, 2025 -
AWS: fix GlueCatalog name validation
#12367 opened
Feb 21, 2025 -
Docs: Fix lifecycle and versions in multi-engine-support
#12370 opened
Feb 21, 2025 -
Handling no coordinator and data loss in ICR mode
#12372 opened
Feb 21, 2025 -
OpenAPI: Use more clear language in recommending error responses
#12376 opened
Feb 22, 2025 -
Build: Bump junit from 5.11.4 to 5.12.0
#12378 opened
Feb 23, 2025 -
Build: Bump junit-platform from 1.11.4 to 1.12.0
#12381 opened
Feb 23, 2025 -
Build: Bump junit to v5.12.0
#12391 opened
Feb 23, 2025 -
[hive]:Fix Hive table creation syntax errors
#12394 opened
Feb 24, 2025 -
Spark: Bump Spark 3.5 to 3.5.5
#12396 opened
Feb 24, 2025 -
Core: Add support for Avro's timestamp-millis LogicalType in DataReader
#12397 opened
Feb 24, 2025 -
Build: Upgrade to Gradle 8.13
#12398 opened
Feb 25, 2025
16 Issues closed by 5 people
-
Refactor TestIcebergCommitter state recovery unit tests to use checkpointId=1
#10942 closed
Feb 25, 2025 -
Support committed callback
#10936 closed
Feb 25, 2025 -
Support custom spark procedure in plugin mode for iceberg
#10906 closed
Feb 25, 2025 -
Iceberg materialized view
#10890 closed
Feb 25, 2025 -
Iceberg 1.8.0 Breaking oauth2 authentication
#12373 closed
Feb 24, 2025 -
Contribution Proposal: Kafka to Iceberg Tutorial Repo
#10933 closed
Feb 24, 2025 -
Spec: The spec about metadata key `schema-id` in manifest file do not match the lib implementation
#10927 closed
Feb 24, 2025 -
Is dataFiles() Method Retryable?
#10750 closed
Feb 24, 2025 -
Add batch file deletion support to FileIO
#12387 closed
Feb 23, 2025 -
Do not override finalize
#10901 closed
Feb 20, 2025 -
Partition spec mismatch when 'compatibility.snapshot-id-inheritance.enabled' is true
#12273 closed
Feb 19, 2025 -
Nested column filter expression
#12331 closed
Feb 19, 2025 -
Rest catalog: write.metadata.delete-after-commit set true not deleting expired metadata files
#10894 closed
Feb 19, 2025 -
Some schema updates do not support dots inside a field name
#10875 closed
Feb 19, 2025
12 Issues opened by 12 people
-
Add support for Avro's timestamp-millis LogicalType in DataReader
#12395 opened
Feb 24, 2025 -
Dataproc metastore on gRPC for iceberg tables is causing errors
#12377 opened
Feb 22, 2025 -
Spark Iceberg REST Catalog refresh token
#12363 opened
Feb 21, 2025 -
Critical Warnings For Users of Iceberg Kafka Connect Need To Be Documented
#12357 opened
Feb 20, 2025 -
Allow Custom Transactional ID in Iceberg Kafka Connect
#12352 opened
Feb 20, 2025 -
How can we override auth.session-timeout-ms of catalog or avoid cache at all?
#12350 opened
Feb 20, 2025 -
Add spark 4.0.0-preview2 support
#12349 opened
Feb 20, 2025 -
does iceberg support spark connect ?
#12345 opened
Feb 20, 2025 -
Limit the delete file/records
#12343 opened
Feb 19, 2025 -
Add possibility of configuration for Coordinator and Worker prefix
#12342 opened
Feb 19, 2025 -
Add option to provide partition spec in spark ADD_FILES procedure
#12325 opened
Feb 19, 2025 -
Serialize `null` for `current-snapshot-id` when there is no current snapshot for ≥V3
#12310 opened
Feb 18, 2025
74 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Core: Interface changes for separating rewrite planner and runner
#12306 commented on
Feb 21, 2025 • 42 new comments -
Core: Add Variant logical type for Avro
#12238 commented on
Feb 25, 2025 • 25 new comments -
Data: Add partition stats writer and reader
#11216 commented on
Feb 25, 2025 • 24 new comments -
WIP: Interface based DataFile reader and writer API
#12298 commented on
Feb 25, 2025 • 24 new comments -
support create table like in flink catalog
#12199 commented on
Feb 21, 2025 • 16 new comments -
AWS: Integrate S3 analytics accelerator library
#12299 commented on
Feb 25, 2025 • 14 new comments -
Core,Api: Add overwrite option when register external table to catalog
#12228 commented on
Feb 25, 2025 • 6 new comments -
Fix IndexOutOfBounds exception in FileFormat#fromFileName
#12301 commented on
Feb 21, 2025 • 5 new comments -
Spec additions for encryption
#12162 commented on
Feb 25, 2025 • 5 new comments -
S3: Disable strong integrity checksums
#12264 commented on
Feb 25, 2025 • 4 new comments -
Materialized View Spec
#11041 commented on
Feb 21, 2025 • 3 new comments -
Auth Manager API part 6: API enablement
#12197 commented on
Feb 24, 2025 • 3 new comments -
Spark: Detect dangling DVs properly
#12270 commented on
Feb 19, 2025 • 2 new comments -
Core: Extended header support for RESTClient implementations
#12194 commented on
Feb 24, 2025 • 2 new comments -
Use Snapshot's statistics file in SparkScan
#11040 commented on
Feb 21, 2025 • 1 new comment -
Data: Handle case where partition location is missing for `TableMigrationUtil`
#12212 commented on
Feb 24, 2025 • 1 new comment -
Path parameters should encode spaces as '%20' instead of '+'
#12309 commented on
Feb 18, 2025 • 1 new comment -
List data and metadata directories instead of table root
#12278 commented on
Feb 18, 2025 • 1 new comment -
Missing records after compaction using `rewrite_data_files`
#11014 commented on
Feb 24, 2025 • 0 new comments -
Build: Add plugin to generate license and notice files
#11977 commented on
Feb 23, 2025 • 0 new comments -
Spark3.5: Standardizing Error Handling in Iceberg Spark Module - TestViews
#11993 commented on
Feb 22, 2025 • 0 new comments -
Core, Test: Parsing and Writing Tests for V3 Metadata
#12025 commented on
Feb 22, 2025 • 0 new comments -
ORC: Fix null map values and list elements in vectorized reads
#12030 commented on
Feb 24, 2025 • 0 new comments -
fix(iceberg/kafka-connect): add empty string check before string-to-number conversion
#12063 commented on
Feb 23, 2025 • 0 new comments -
Fix rename then add column with same name failure if the renamed columns was an identity partition key
#12064 commented on
Feb 23, 2025 • 0 new comments -
Kafka Connect: Add delta writer support
#12070 commented on
Feb 24, 2025 • 0 new comments -
Docs: Add warning about `snapshot_ids` arg in `expired_snapshots` procedure
#12291 commented on
Feb 22, 2025 • 0 new comments -
Core: Checks for Equality Delete when Row Lineage is Enabled - Using Snapshot Summary
#12075 commented on
Feb 24, 2025 • 0 new comments -
Backport #11702 to FLink1.19 and 1.18
#12080 commented on
Feb 25, 2025 • 0 new comments -
Core: Select for rewriting the files belonging to old partitioning schemes
#12083 commented on
Feb 25, 2025 • 0 new comments -
Core: Change RemoveSnapshots to remove unused schemas
#12089 commented on
Feb 24, 2025 • 0 new comments -
Spark: Support singular form of years, months, days, and hours functions
#12117 commented on
Feb 18, 2025 • 0 new comments -
Retry on NoSuchNamespaceException not found in rename table for rest catalog
#12159 commented on
Feb 25, 2025 • 0 new comments -
AWS, AZURE: Move docker-based tests to integration test source
#12274 commented on
Feb 19, 2025 • 0 new comments -
Spark: Structured Streaming read limit support follow-up
#12260 commented on
Feb 18, 2025 • 0 new comments -
Spark: support rewrite on specified target branch
#12257 commented on
Feb 18, 2025 • 0 new comments -
SPARK: Remove dependency on hadoop's filesystem class from remove orphan files
#12254 commented on
Feb 25, 2025 • 0 new comments -
Spark: Rewrite V2 deletes to V3 DVs
#12250 commented on
Feb 19, 2025 • 0 new comments -
Core: Remove duplicate definitions of MAX_FILE_GROUP_SIZE_BYTES
#12222 commented on
Feb 19, 2025 • 0 new comments -
Accessing Minio with Pyiceberg
#10709 commented on
Feb 24, 2025 • 0 new comments -
ICEBERG performance is slow when querying tables with a large number of partitions.
#8161 commented on
Feb 24, 2025 • 0 new comments -
Type Promotion: Long to Timestamp
#9065 commented on
Feb 23, 2025 • 0 new comments -
software.amazon.awssdk.services.s3.model.S3Exception: The bucket you are attempting to access must be addressed using the specified endpoint.
#11997 commented on
Feb 22, 2025 • 0 new comments -
[REST Catalog] OAuth 2 grant type "refresh_token" not implemented
#12196 commented on
Feb 21, 2025 • 0 new comments -
Cherrypick the data rows [deleted or old values] from a past snapshot
#12271 commented on
Feb 21, 2025 • 0 new comments -
Allow to configure the tables' namespace when using dynamic routing with Kafka Connect
#12269 commented on
Feb 21, 2025 • 0 new comments -
TestS3FileIO fails locally (on OSX with Docker Desktop) due to missing Content-MD5 header during delete
#12237 commented on
Feb 21, 2025 • 0 new comments -
Support row filter & column masking in REST spec
#10909 commented on
Feb 21, 2025 • 0 new comments -
Extends Iceberg table stats API to allow publish data and stats atomically
#6442 commented on
Feb 21, 2025 • 0 new comments -
RewriteDataFiles maintenance action never converges
#6669 commented on
Feb 20, 2025 • 0 new comments -
MERGE INTO requires sorting in already sorted iceberg tables
#10891 commented on
Feb 19, 2025 • 0 new comments -
Spaces in path parameters are encoded as '+' instead of '%20'
#12308 commented on
Feb 19, 2025 • 0 new comments -
JdbcCatalog fails to initialize with MS SQL Server
#10068 commented on
Feb 19, 2025 • 0 new comments -
Support for Shallow Clone / Zero Copy Cloning in Apache Iceberg
#12263 commented on
Feb 19, 2025 • 0 new comments -
Add properties support for HadoopTables.load()
#12251 commented on
Feb 19, 2025 • 0 new comments -
deadlock when spark call delete row postition
#10987 commented on
Feb 19, 2025 • 0 new comments -
Core: Refactor Table Metadata Tests
#11947 commented on
Feb 21, 2025 • 0 new comments -
Kafka Connect: Add SMTs for Debezium and AWS DMS
#11936 commented on
Feb 24, 2025 • 0 new comments -
AWS: Add support for enabling access to S3 Requester Pays bucket
#11915 commented on
Feb 22, 2025 • 0 new comments -
Use SupportsPrefixOperations for Remove OrphanFile Procedure
#11906 commented on
Feb 25, 2025 • 0 new comments -
Kafka Connect: Add the configuration option to provide a transactional id prefix to use
#11780 commented on
Feb 19, 2025 • 0 new comments -
Core: Unimplement Map from CharSequenceMap to obey contract
#11704 commented on
Feb 23, 2025 • 0 new comments -
Core, Rest: Enable useSystemProperties on RESTClient
#11548 commented on
Feb 19, 2025 • 0 new comments -
Spark: add property to disable client-side purging in spark
#11317 commented on
Feb 21, 2025 • 0 new comments -
API: Define RepairManifests action interface
#10784 commented on
Feb 24, 2025 • 0 new comments -
Core: Add KLL Datasketch as standard blob types to puffin file
#8202 commented on
Feb 20, 2025 • 0 new comments -
Manifest list encryption
#7770 commented on
Feb 25, 2025 • 0 new comments -
Move docker-specific tests to integrationTest configuration
#12236 commented on
Feb 25, 2025 • 0 new comments -
Flink Table Maintenance
#10264 commented on
Feb 25, 2025 • 0 new comments -
Update Table Error: UPDATE TABLE is not supported temporarily.
#9960 commented on
Feb 25, 2025 • 0 new comments -
Proxy Settings for catalog REST API client
#12059 commented on
Feb 24, 2025 • 0 new comments -
Spec and Doc inconsistency about supported schema evolution for list and map types
#11020 commented on
Feb 24, 2025 • 0 new comments -
org.apache.thrift.transport.TSaslTransport: SASL negotiation failure
#11019 commented on
Feb 24, 2025 • 0 new comments -
`expire_snapshots` sudden increase in compute/memory usage
#11017 commented on
Feb 24, 2025 • 0 new comments