Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pull new version #3

Merged
merged 331 commits into from
Nov 2, 2017
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
331 commits
Select commit Hold shift + click to select a range
76d1320
Propagate HAVE_PIPE2 compile time value to files that use it
smukil Aug 17, 2017
605118b
IMPALA-5452: Rewrite test case to avoid 'pos'.
Aug 17, 2017
55e680b
IMPALA-5677: limit clean page memory consumption
timarmstrong Aug 10, 2017
4979a5e
IMPALA-5809: Relax max_minidumps in breakpad test
Aug 17, 2017
ad71e2b
IMPALA-4833: Compute precise per-host reservation size, pt2
Aug 15, 2017
47efbbe
IMPALA-5776: Write partial tuple to the correct mempool
tbobrovytsky Aug 9, 2017
198aa88
IMPALA-5504: Fix TupleIsNullPredicate evaluation.
Aug 18, 2017
ae755dc
IMPALA-5108: idle_session_timeout kicks in later than expected
pranay2017 Aug 17, 2017
fc098fb
IMPALA-5800: [Addendum] Fix bad import of squeasel.c
Aug 21, 2017
ffa307e
IMPALA-5819: DCHECK in HdfsTextScanner::Close()
timarmstrong Aug 21, 2017
9ea0594
Bump Kudu version to 44a820b
Aug 21, 2017
4bc8264
IMPALA-5158,IMPALA-5236: account for unused buffer pool reservations
timarmstrong Jul 7, 2017
c117831
IMPALA-5641: mem-estimate should never be less than mem-reservation
timarmstrong Aug 14, 2017
9af9df3
IMPALA-5644,IMPALA-5810: Min reservation improvements
Aug 15, 2017
5381eb2
IMPALA-5825: Catch exceptions thrown by TSSLSocketFactory c'tor
Aug 22, 2017
42110bb
IMPALA-5612: join inversion should factor in parallelism
timarmstrong Jul 1, 2017
4da9a5d
IMPALA-5820: Fix string format() syntax in test_scanners_fuzz.py
Aug 21, 2017
89beccb
KUDU-2087. Fix failure to map Kerberos principal to username with Fre…
toddlipcon Aug 1, 2017
b4a2d8a
Cherry-pick constant renaming for be/src/kudu from Kudu commit e719b
toddlipcon Jun 12, 2017
7889365
IMPALA-5749: coordinator race hits DCHECK 'num_remaining_backends_ > 0'
twmarshall Aug 3, 2017
4206c48
Minor comment improvements in buffer-pool.h
Aug 22, 2017
e09a1f0
IMPALA-3208: max_row_size option
timarmstrong Aug 9, 2017
1bbe784
IMPALA-992: Rerun past queries from history in shell
glglwty Aug 15, 2017
17623bd
IMPALA-5788: Fix agg node crash when grouping by nondeterministic exprs
Aug 17, 2017
1aa8245
IMPALA-5352: Age out unused file handles from the cache
joemcdonnell Jun 29, 2017
dca9a28
IMPALA-4990: fix run_tests.py --update_results
dhecht Feb 24, 2017
dccc475
IMPALA-5815: right outer join returns invalid memory
timarmstrong Aug 18, 2017
b7a9a6d
IMPALA-5573: Add decimal codegen in text scanner
glglwty Aug 18, 2017
0ab5ccb
IMPALA-5823: fix SET_DENY_RESERVATION_PROBABILITY
timarmstrong Aug 21, 2017
af17906
IMPALA-5798: ASAN use-after-poison in Parquet decoder
twmarshall Aug 22, 2017
45c89f6
IMPALA-5512: [DOCS] Distinguish GROUP BY and ORDER BY in SELECT syntax
Aug 23, 2017
67b4fe3
Hide some deprecated flags
Aug 22, 2017
1d62253
IMPALA-5775: (Addendum) Make SSL cluster actually come up in test_cli…
Aug 18, 2017
730f584
IMPALA-5602: Fix query optimization for kudu and datasource tables
Aug 1, 2017
0729c39
IMPALA-5811: Add 'backends' tab to query details pages
Aug 16, 2017
81be352
IMPALA-5784: Separate planner and user set query options in profile
Aug 18, 2017
b9836a6
IMPALA-5570: fix spilling null-aware anti join
timarmstrong Jun 27, 2017
5c5ab68
IMPALA-5309: [DOCS] Add TABLESAMPLE clause to SELECT statement
Jul 27, 2017
19bcdcc
IMPALA-5780,IMPALA-5779: extra spilling tests
timarmstrong Aug 23, 2017
acbef2a
IMPALA-5648: fix count(*) mem estimate regression
timarmstrong Aug 23, 2017
09c423a
IMPALA-5531: Fix correctness issue in correlated aggregate subqueries
dtsirogiannis Aug 15, 2017
7d20d12
IMPALA-5625: stress test: write profile when queries fail
Jun 30, 2017
a941b70
IMPALA-5799: Kudu DML can crash if schema has changed
twmarshall Aug 16, 2017
7f7aca2
IMPALA-5713: always reserve memory for preaggs
timarmstrong Aug 18, 2017
06428d8
IMPALA-5827: add test for failure to repartition in hash join
timarmstrong Aug 24, 2017
2159f7e
IMPALA-5836: Improvements to Eclipse frontend configuration.
Aug 24, 2017
6eb85dd
IMPALA-5211: Simplifying ifnull/isnull/nvl where conditional is a lit…
Aug 22, 2017
06dd423
IMPALA-4669: [KRPC] Add kudu_rpc library to build
Nov 8, 2016
d305231
IMPALA-5846: Fix output path for kudu libraries
Aug 24, 2017
f5dc719
KUDU-1929: [rpc] Allow using encrypted private keys for TLS
smukil Apr 5, 2017
4040870
Update stress test with admission control rejected messages
timarmstrong Aug 26, 2017
dd35a28
IMPALA-4847: Simplify HdfsTable block metadata loading code
Jul 17, 2017
e235ebf
IMPALA-5108: Followup change to do the cleanup.
pranay2017 Aug 24, 2017
3fe0e4c
IMPALA-5838: Improve errors on AC buffer mem rejection
Aug 25, 2017
2ec7368
IMPALA-5853: fix GetResultSetMetadata() error message for invalid que…
dhecht Aug 28, 2017
2166585
IMPALA-5852: improve MINIMUM_RESERVATION_UNAVAILABLE error
timarmstrong Aug 28, 2017
0ef2fa1
IMPALA-467: enable disabled metric verification
timarmstrong Aug 23, 2017
d4c5ac8
IMPALA-5840: Don't write page-level statistics in Parquet files.
Aug 24, 2017
eb0884b
IMPALA-2234: remove workaround from stress test
timarmstrong Aug 28, 2017
12d7edd
IMPALA-5830: SET_DENY_RESERVATION_PROBABILITY test
timarmstrong Aug 24, 2017
5608a03
IMPALA-4856: Rename thrift-deps to gen-deps
Aug 26, 2017
8363730
IMPALA-5857: avoid invalid free of hedged read metrics
timarmstrong Aug 29, 2017
c12430e
KUDU-2041: Fix negotiation deadlock
Apr 14, 2017
ebd0b24
KUDU-2065: Support cancellation for outbound RPC call
Jul 13, 2017
511491f
IMPALA-5850: Cast sender partition exprs under unions.
Aug 29, 2017
ccaefa9
IMPALA-5863: Include-what-you-use for Kudu client
twmarshall Aug 29, 2017
a762cc7
IMPALA-5855: reserve enough memory for preaggs
timarmstrong Aug 29, 2017
4a6cd01
KUDU-1865: Avoid heap allocation for payload slices
Jul 20, 2017
a7ec36d
IMPALA-5617: Include full workload name in tpch_nested query filenames
Aug 30, 2017
de5bae8
Updates several .gitignore files.
vercegovac Aug 30, 2017
aff3c7e
Bump Kudu version to 1c70e5d
twmarshall Aug 30, 2017
f3642bb
IMPALA-4856, IMPALA-4872: Include KRPC services in plan fragment's de…
Aug 19, 2017
4cfbaad
KUDU-2091: Certificates with intermediate CA's do not work with Kudu
smukil Aug 3, 2017
14a9a83
IMPALA-5816: xfail wildcard TLS cert tests
Aug 30, 2017
49dc4d8
IMPALA-5653: Remove "unlimited" process mem_limit option
timarmstrong Aug 25, 2017
667244a
IMPALA-5854: Update external hadoop versions
Aug 29, 2017
ffc8d81
IMPALA-5873: Check for existence of sync_file_range()
Aug 31, 2017
656ca75
KUDU-1942. Kerberos fails to log in on hostnames with capital letters
toddlipcon Aug 17, 2017
ead02c6
security: only lookup hostname if _HOST substitution is required
toddlipcon Aug 17, 2017
820923c
rpc: move ConnectionId to its own file
toddlipcon Aug 15, 2017
f06d116
rpc: some small cleanup in ConnectionId
toddlipcon Aug 15, 2017
4d55b24
Revert "IMPALA-5854: Update external hadoop versions"
smukil Sep 1, 2017
6fa0d87
Update thirdparty dependencies
Sep 1, 2017
9dc07b9
IMPALA-5871: KuduPartitionExpr incorrectly handles its child types
twmarshall Aug 31, 2017
11fc96b
[security] avoid kerberos ticket renewal and only reacquire
smukil Aug 22, 2017
288e073
KUDU-2032 (part 1): pass pre-resolution hostname into RPC proxies
toddlipcon Aug 16, 2017
50bcc31
IMPALA-5849: Remove compile-time checks for OpenSSL > 1.0.0
Aug 25, 2017
d389be4
IMPALA-5891: fix PeriodicCounterUpdater initialization
timarmstrong Sep 1, 2017
e2189b8
IMPALA-5210: Count rows and collection items in parquet scanner separ…
glglwty Aug 24, 2017
db6636a
IMPALA-5885: free runtime filter allocations in Parquet
timarmstrong Aug 31, 2017
330d8b0
IMPALA-2810: Remove column stats restoration when altering table
glglwty Aug 26, 2017
7ef97bc
IMPALA-5589: change "set" in impala-shell to show empty string for un…
Aug 29, 2017
923a629
IMPALA-4826: Fix error during a scan on repeated root schema in Parquet.
gaborkaszab Aug 28, 2017
8847e4a
IMPALA-5892: Allow reporting status independent of fragment instance
joemcdonnell Sep 1, 2017
5e7430b
IMPALA-5812: Fix NPE when joining on empty const select
Sep 5, 2017
512a94c
IMPALA-5888: free other local allocations in Parquet
timarmstrong Sep 1, 2017
b794a25
IMPALA-5902: add ThreadSanitizer build
timarmstrong Sep 1, 2017
4af9bac
IMPALA-5317: add DATE_TRUNC() function
sakinapelli Jun 16, 2017
1630a54
IMPALA-4620: Refactor evalcost computation in query analysis
glglwty Aug 30, 2017
6a5052b
IMPALA-5750: Catch exceptions from boost thread creation
joemcdonnell Aug 17, 2017
ff80fe9
Bump Kudu version to a71ecfd
Sep 7, 2017
2eb84f9
IMPALA-2107: [DOCS] Document base64*code() functions
Aug 30, 2017
63be9a2
IMPALA-5867: Fix bugs parsing 2-digit year
glglwty Aug 30, 2017
57aa8aa
IMPALA-5912: fix crash in trunc(..., "WW") in release build
timarmstrong Sep 8, 2017
934df8a
IMPALA-3897 Codegen null-aware constant in PHJ::ProcessBuildBatch()
Aug 19, 2017
5b5ba78
IMPALA-3642: Adding backend addresses to error statuses for some scra…
Aug 24, 2017
d88444f
IMPALA-5894: [DOCS] Clarify placement of STRAIGHT_JOIN hint
Sep 11, 2017
38fdc08
IMPALA-4082: Remove todo item in getRegionsInRange
glglwty Sep 9, 2017
816c9a6
Bump Kudu version to 3f49724
twmarshall Sep 12, 2017
b10cbe2
IMPALA-4987: Fix flaky test test_row_availability.py
glglwty Sep 12, 2017
7269e95
IMPALA-3516: Avoid writing to /tmp in testing
glglwty Sep 11, 2017
bf5decb
IMPALA-5597: Try casting targetExpr when building runtime filter plan
glglwty Sep 1, 2017
aa597be
IMPALA-5890: Abort queries if scanner hits IO errors
Sep 5, 2017
55a00c3
IMPALA-5905: add script for all-build-options job
timarmstrong Sep 12, 2017
7d9fc00
IMPALA-5923: Print binary ID as hex in ChildQuery::Cancel()
Sep 13, 2017
c0330c4
IMPALA-5905: build-all-flag-combinations addendum
timarmstrong Sep 13, 2017
2f7f9c5
IMPALA-5926: Avoid printing expensive stack when closing a session
ttmahdy Sep 13, 2017
83985a1
IMPALA-5417: make I/O buffer queue fixed-size
timarmstrong Jun 14, 2017
ed8e086
IMPALA-5856: Fix outer join predicate assignment.
Sep 11, 2017
ad3b1d8
IMPALA-3437: DECIMAL_V2: avoid implicit decimal->double conversion
timarmstrong Aug 30, 2017
0dd1b00
Increment version to 2.11.0-SNAPSHOT
Sep 15, 2017
23cf51d
IMPALA-5211: Simplifying nullif conditional.
Aug 24, 2017
48ecce1
IMPALA-3877: support unpatched LLVM
timarmstrong Aug 25, 2017
638675d
IMPALA-5199: prevent hang on empty row batch exchange
timarmstrong Sep 7, 2017
6a92031
Revert "IMPALA-5589: change "set" in impala-shell to show empty strin…
Sep 15, 2017
be0a46c
IMPALA-5860: upgrade to LLVM 3.9.1
timarmstrong Aug 30, 2017
7fbfbc8
IMPALA-5920: Remove admission control dependency on YARN RM jar
Sep 11, 2017
25be391
IMPALA-5941: Fix Metastore schema creation in create-test-configurati…
joemcdonnell Sep 15, 2017
cd4a029
IMPALA-5932: Improve transitive closure computation performance in FE
glglwty Sep 14, 2017
55fb4cf
IMPALA-5895: clean up runtime profile lifecycle
timarmstrong Sep 5, 2017
ceeea3f
IMPALA-5920: addendum - add missing RAT check
timarmstrong Sep 20, 2017
89a9371
IMPALA-5927: Fix enable_distcc for zsh
Sep 13, 2017
114ac27
IMPALA-4863/IMPALA-5311: Correctly account the file type and compress…
Jun 19, 2017
797defc
Remove unused MemPool::peak_allocated_bytes_
timarmstrong Sep 18, 2017
1b13925
IMPALA-5966: Fix the result file location of PlannerTest
glglwty Sep 20, 2017
ea9c68a
IMPALA-5416: Fix an impala-shell command recursion bug
glglwty Sep 14, 2017
b02060f
IMPALA-5250: Unify decompressor output_length semantics
glglwty Sep 9, 2017
b3b8285
fix query report generator typo
mikesbrown Sep 21, 2017
1f4d1c1
IMPALA-5954: Set DO_NOT_UPDATE_STATS in alterTable() RPCs to HMS.
Sep 19, 2017
8c1bc76
IMPALA-5955: Use totalSize tblproperty instead of rawDataSize.
Sep 19, 2017
89a790b
IMPALA-5965: avoid per-value switch on NeedsConversionInline() in par…
timarmstrong Sep 20, 2017
16199ef
IMPALA-5949: fix test_exchange_small_delay failure
timarmstrong Sep 20, 2017
18ec1eb
IMPALA-4513: Promote integer types for ABS()
Sep 7, 2017
a7fb10a
IMPALA-5599: Fix for mis-use of TimestampValue
zoramthanga Sep 15, 2017
46288f5
IMPALA-1767 Adds predicate to test boolean values true, false, unknown.
vercegovac Sep 21, 2017
e42d98f
IMPALA-3360: Codegen inserting into runtime filters
twmarshall Sep 6, 2017
effbf18
[DOCS] Fill in release note subtopics for Apache Impala 2.10
Sep 5, 2017
0829a7e
IMPALA-5538: Use explicit catalog versions for deleted objects
dtsirogiannis Aug 14, 2017
30b1077
IMPALA-5975: Work around broken beeline clients
Sep 23, 2017
72ed08d
IMPALA-5870: Improve runtime profile for partial sort
twmarshall Sep 21, 2017
0779b49
Move tests related to the old join node.
Sep 27, 2017
e31b4ad
IMPALA-5307: Part 3: remove TODO from RCFile
timarmstrong Sep 27, 2017
e5836cb
IMPALA-5986: Correct set-option logic to recognize digits in names.
Sep 28, 2017
f32bab1
IMPALA-992: [DOCS] Document impala-shell 'rerun' command
Sep 12, 2017
fc54882
IMPALA-5951: Remove flaky test_catalogd_timeout
twmarshall Sep 27, 2017
e9a30f6
IMPALA-5994: Lower case struct-field names
twmarshall Sep 28, 2017
8af7ff3
IMPALA-4252: Move runtime filters to ScanNode
twmarshall Sep 26, 2017
bd0eb6d
Re-apply: IMPALA-5589: change "set" in impala-shell to show empty str…
Sep 18, 2017
8be2355
IMPALA-4736: Add SIGUSR1 behavior to help string for 'minidump_path' …
csringhofer Sep 28, 2017
924675a
IMPALA-4786: Clean up how ImpalaServers are created
smukil Sep 14, 2017
c274ac6
IMPALA-6009: Upgrade Guava to 14.0.1
Oct 3, 2017
446f80f
IMPALA-5383: [DOCS] Document unpartitioned Kudu tables
Sep 21, 2017
00c6c9c
IMPALA-4951: Fix database visibility for user with only column privilege
Sep 26, 2017
e1a4d94
IMPALA-5174: Suppress kudu flags that aren't relevant to Impala
smukil Sep 14, 2017
fea764c
IMPALA-4939, IMPALA-4940: Decimal V2 multiplication
tbobrovytsky Jul 6, 2017
4e7d302
IMPALA-6012: workaround - downgrade hive temporarily
timarmstrong Oct 4, 2017
7b2924c
IMPALA-5394: Change ThriftServer() to always use TAcceptQueueServer
jfs-arcadia Jun 1, 2017
9127ffc
IMPALA-5988: optimise MemPool::TryAllocate()
timarmstrong Sep 21, 2017
2f815f8
IMPALA-5908: Allow SET to unset modified query options.
Sep 11, 2017
7a4d3ef
CDH-43406: delete unneeded thirdparty components
timarmstrong Oct 2, 2017
4153162
Bump Kudu version to bec2a24
twmarshall Oct 4, 2017
2fc4dd6
Revert "IMPALA-6012: workaround - downgrade hive temporarily"
timarmstrong Oct 5, 2017
a3b22c4
IMPALA-4623: [DOCS] Document file handle caching
Oct 3, 2017
a091d9c
IMPALA-5844: use a MemPool for expr result allocations
timarmstrong Jul 25, 2017
bdc18cd
IMPALA-5525 Extend TestScannersFuzzing to test uncompressed parquet
pranay2017 Sep 13, 2017
b34b56f
IMPALA-6019: Remove dead code parallel-executor*
dhecht Oct 4, 2017
8040298
IMPALA-3316: [DOCS] Add known issue for timezone conversion slowdown
Sep 28, 2017
6420ba1
IMPALA-2190: [DOCS] from_timestamp() and to_timestamp()
Sep 12, 2017
ff0068b
IMPALA-4670: Introduces RpcMgr class
Aug 20, 2017
0125385
IMPALA-4622: [DOCS] New Kudu ALTER TABLE syntax
Oct 2, 2017
2389985
IMPALA-3504: [DOCS] Document utc_timestamp()
Oct 2, 2017
c5d1ecd
IMPALA-6021: Revert "IMPALA-6009: Upgrade Guava to 14.0.1"
mikesbrown Oct 6, 2017
e605f39
IMPALA-5529: [DOCS] New trunc() signatures
Oct 2, 2017
d3707d9
[DOCS] Tighten up advice about first COMPUTE INCREMENTAL STATS
Sep 1, 2017
2832126
IMPALA-3200: [DOCS] Document user-facing aspects of new buffer pool
Aug 15, 2017
84c4190
IMPALA-2636: HS2 GetTables() returns TABLE_TYPE as TABLE for VIEW
sakinapelli Jun 30, 2017
710d982
Download toolchain in parallel.
Oct 9, 2017
478e2a1
IMPALA-5448: fix invalid number of splits reported in Parquet scan node
stiga-huang Sep 26, 2017
f7a5040
IMPALA-5940: Avoid log spew by using Status::Expected.
Sep 15, 2017
af509bc
IMPALA-4682: Remove Preconditions check from analyzeAggregation().
zoramthanga Sep 26, 2017
ad5ad70
IMPALA-6016: Fix logging in TableLoadingMgr class
Oct 4, 2017
46111ab
IMPALA-5425: Add test for validating input when setting query options
glglwty Aug 25, 2017
85fcd0e
IMPALA-6023: Fix broken breakpad test
Oct 9, 2017
41f4537
IMPALA-6027: Retry downloading toolchain components.
Oct 11, 2017
0a200ff
IMPALA-6030: Don't start coordinator specific thread pools if a node …
smukil Oct 9, 2017
1748a80
IMPALA-5940: Avoid log spew by using Status::Expected()
Oct 11, 2017
594c392
IMPALA-6040: skip test_multi_compression_types where Hive isn't suppo…
mikesbrown Oct 11, 2017
41551f3
IMPALA-5983: Fix crash in to/from_utc_timestamp("10:00:00", 'MSK')
csringhofer Sep 26, 2017
33e7e3a
[DOCS] Fix alphabetical ordering of query options
Oct 12, 2017
a3d74a1
IMPALA-6045: Make build scripts more friendly to Ubuntu 16.04
jbapple-cloudera Oct 11, 2017
b6de1a7
IMPALA-6049: breakpad tests: skip all tests with local filesystem
mikesbrown Oct 13, 2017
61702c9
Reduce log spew from rpcz_store.cc
Oct 12, 2017
7b252f2
Making bin/bootstrap_system.sh executable.
Oct 17, 2017
0ffebbd
Allow configuration of values passed into kerberos env vars
smukil Oct 10, 2017
4b838b9
Add 'psmisc' to bootstrap_system.sh.
Oct 17, 2017
6901fad
IMPALA-6002: Add a LLVM diagnostic handler for LLVM linker errors
Oct 6, 2017
a99ce09
Allow the SASL protocol service name to be configurable
smukil Oct 5, 2017
05b56e1
IMPALA-6063: stress test: report test duration
Oct 19, 2017
a16f554
IMPALA-5789: Add always_false flag in bloom filter
glglwty Oct 6, 2017
d6fd413
IMPALA-4918: Support getting column comments via HS2
csringhofer Oct 18, 2017
13497d8
IMPALA-6055: Fix hdfs encryption test far Hadoop 2.8+
Oct 13, 2017
dbf9527
IMPALA-5668: Fix cast(X as timestamp) for negative subsecond Decimals
csringhofer Sep 13, 2017
a8a51e9
IMPALA-6803: [DOCS] Clarify scope of STRAIGHT_JOIN hint
Oct 19, 2017
eb1bc3f
IMPALA-5664: Unix time to timestamp conversions may crash Impala
csringhofer Sep 1, 2017
af5087e
IMPALA-5307: part 1: don't transfer disk I/O buffers out of parquet
timarmstrong May 10, 2017
61799a9
IMPALA-5376: Implement all TPCDS test cases or alternates for Impala.
Aug 30, 2017
77466c4
IMPALA-6100: increase test_exchange_delays timeout on slow builds
timarmstrong Oct 23, 2017
576ac1c
IMPALA-4524: Batch ALTER TABLE...ADD PARTITION calls.
Oct 9, 2017
802047c
IMPALA-6060: Check the return value of JNI exception handling functions
glglwty Oct 18, 2017
d8ab319
IMPALA-5599: Clean up references to TimestampValue in be/src.
zoramthanga Oct 17, 2017
3001a64
IMPALA-6076: Parquet BIT_PACKED deprecation warning
timarmstrong Oct 24, 2017
6188bde
IMPALA-6099: Fix crash in CheckForAlwaysFalse()
glglwty Oct 23, 2017
e04baa1
IMPALA-6070: Parallel compute_table_stats.py
Oct 22, 2017
1d558b8
IMPALA-6070: Parallel data load.
Oct 18, 2017
912f91d
IMPALA-5018: Error on decimal modulo or divide by zero
tbobrovytsky Oct 19, 2017
a8deee5
IMPALA-5957: print memory address, not memory
timarmstrong Oct 24, 2017
45ce4fa
IMPALA-4236: Codegen CopyRows() for select nodes
Sep 6, 2017
a711bf4
IMPALA-6068: Fix dataload for complextypes_fileformat
joemcdonnell Oct 20, 2017
89ff680
IMPALA-6004: Fix test_row_filters failure on ASAN
twmarshall Oct 23, 2017
448c3e3
Remove Parquet "incubating" modifier in README
ebuildy Oct 20, 2017
b6c3684
IMPALA-3998: deprecate --refresh_after_connect
timarmstrong Oct 25, 2017
860aa0d
IMPALA-6106: handle comments before set in test parser
timarmstrong Oct 25, 2017
6a27954
IMPALA-6108: Revert "IMPALA-6070: Parallel data load."
Oct 25, 2017
cdf76e5
IMPALA-3548: Prune runtime filters based on query options in the FE
Jul 18, 2017
f122417
IMPALA-1291: Parquet read fails if io buffer size is less than the fo…
boroknagyz Oct 24, 2017
efef770
IMPALA-5429: Multi threaded block metadata loading
Jul 31, 2017
6858c64
IMPALA-5129: Use Kudu's Kinit code to avoid expensive fork
smukil Mar 28, 2017
ba62034
IMPALA-2758: Remove BufferedTupleStream::GetRows
glglwty Oct 4, 2017
1423b94
IMPALA-6093: diagnostics for flaky TestHashJoinTimer
timarmstrong Oct 26, 2017
b89f99a
IMPALA-1575: Part 1: eagerly release query exec resources
timarmstrong Oct 17, 2017
7b289f0
IMPALA-5243: Speed up code gen for wide Avro tables.
Sep 13, 2017
83eb1b9
IMPALA-6123: Fix column order of a query test in test_inline_view_limit
Oct 28, 2017
9b6cc8e
IMPALA-6124: Fix alter table ddl updates and test
Oct 28, 2017
731472b
IMPALA-6118: Fix assertion failure in coordinator bloom filter updating
glglwty Oct 27, 2017
ccd3ac7
IMPALA-6126: ASAN detects heap-use-after-free in thrift-server-test
smukil Oct 30, 2017
a520b0c
IMPALA-5638: [DOCS] Add known issue for Impala-Kudu-Sentry issue
Oct 30, 2017
1c92dc2
IMPALA-6127: Fix timeout in TestRuntimeFilter.test_wait_time
twmarshall Oct 28, 2017
1f43b7f
IMPALA-5307: Part 2: copy out strings in uncompressed Avro
timarmstrong Sep 22, 2017
d9548ff
IMPALA-6108, IMPALA-6070: Parallel data load (re-instated).
Oct 25, 2017
42f6b8d
IMPALA-5541: Reject BATCH_SIZE greater than 65536
glglwty Oct 28, 2017
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
IMPALA-5749: coordinator race hits DCHECK 'num_remaining_backends_ > 0'
In Coordinator::UpdateBackendExecStatus(), we check if the backend
has already completed with BackendState::IsDone() and return without
applying the update if so to avoid updating num_remaining_backends_
twice for the same completed backend.

The problem is that the value of BackendState::IsDone() is updated by
the call to BackendState::ApplyExecStatusReport() that comes after it,
but these operations are not performed atomically, so if there are
two simultaneous calls to UpdateBackendExecStatus(), they can both
call IsDone(), both get 'false', and then proceed to erroneously both
update num_remaining_backends_, hitting a DCHECK.

This patch modifies ApplyExecStatusReport to return true iff this
report transitioned the backend to a done status, and then only
updates num_remaining_backends_ in this case, ensuring it is only
updated once per backend.

Testing:
- Ran test_finst_cancel_when_query_complete 10,000 times without
  hitting the DCHECK (previously, it would hit about once per 300
  runs).

Change-Id: I1528661e5df6d9732ebfeb414576c82ec5c92241
Reviewed-on: http://gerrit.cloudera.org:8080/7577
Reviewed-by: Dan Hecht <[email protected]>
Tested-by: Impala Public Jenkins
  • Loading branch information
twmarshall authored and jenkins committed Aug 23, 2017
commit 7889365acea1b99ad060598101e60ce2a1644b03
25 changes: 17 additions & 8 deletions be/src/runtime/coordinator-backend-state.cc
Original file line number Diff line number Diff line change
Expand Up @@ -228,20 +228,29 @@ void Coordinator::BackendState::MergeErrorLog(ErrorLogMap* merged) {
if (error_log_.size() > 0) MergeErrorMaps(error_log_, merged);
}

bool Coordinator::BackendState::IsDone() {
lock_guard<mutex> l(lock_);
return IsDoneInternal();
void Coordinator::BackendState::LogFirstInProgress(
std::vector<Coordinator::BackendState*> backend_states) {
for (Coordinator::BackendState* backend_state : backend_states) {
lock_guard<mutex> l(backend_state->lock_);
if (!backend_state->IsDone()) {
VLOG_QUERY << "query_id=" << backend_state->query_id_
<< ": first in-progress backend: " << backend_state->impalad_address();
break;
}
}
}

inline bool Coordinator::BackendState::IsDoneInternal() const {
inline bool Coordinator::BackendState::IsDone() const {
return num_remaining_instances_ == 0 || !status_.ok();
}

void Coordinator::BackendState::ApplyExecStatusReport(
bool Coordinator::BackendState::ApplyExecStatusReport(
const TReportExecStatusParams& backend_exec_status, ExecSummary* exec_summary,
ProgressUpdater* scan_range_progress, bool* done) {
ProgressUpdater* scan_range_progress) {
lock_guard<SpinLock> l1(exec_summary->lock);
lock_guard<mutex> l2(lock_);
// If this backend completed previously, don't apply the update.
if (IsDone()) return false;
for (const TFragmentInstanceExecStatus& instance_exec_status:
backend_exec_status.instance_exec_status) {
Status instance_status(instance_exec_status.status);
Expand Down Expand Up @@ -298,8 +307,8 @@ void Coordinator::BackendState::ApplyExecStatusReport(
VLOG_FILE << "host=" << host_ << " error log: " << PrintErrorMapToString(error_log_);
}

*done = IsDoneInternal();
// TODO: keep backend-wide stopwatch?
return IsDone();
}

void Coordinator::BackendState::UpdateExecStats(
Expand Down Expand Up @@ -327,7 +336,7 @@ bool Coordinator::BackendState::Cancel() {
if (!rpc_sent_) return false;

// don't cancel if it already finished (for any reason)
if (IsDoneInternal()) return false;
if (IsDone()) return false;

/// If the status is not OK, we still try to cancel - !OK status might mean
/// communication failure between backend and coordinator, but fragment
Expand Down
24 changes: 13 additions & 11 deletions be/src/runtime/coordinator-backend-state.h
Original file line number Diff line number Diff line change
Expand Up @@ -67,13 +67,14 @@ class Coordinator::BackendState {
CountingBarrier* rpc_complete_barrier);

/// Update overall execution status, including the instances' exec status/profiles
/// and the error log. Updates the fragment instances' TExecStats in exec_summary
/// (exec_summary->nodes.exec_stats) and updates progress_update, and sets
/// done to true if all fragment instances completed, regardless of status.
/// If any instance reports an error, the overall execution status becomes the first
/// reported error status and 'done' is set to true.
void ApplyExecStatusReport(const TReportExecStatusParams& backend_exec_status,
ExecSummary* exec_summary, ProgressUpdater* scan_range_progress, bool* done);
/// and the error log, if this backend is not already done. Updates the fragment
/// instances' TExecStats in exec_summary (exec_summary->nodes.exec_stats) and updates
/// progress_update. If any instance reports an error, the overall execution status
/// becomes the first reported error status. Returns true iff this update changed
/// IsDone() from false to true, either because it was the last fragment to complete or
/// because it was the first error received.
bool ApplyExecStatusReport(const TReportExecStatusParams& backend_exec_status,
ExecSummary* exec_summary, ProgressUpdater* scan_range_progress);

/// Update completion_times, rates, and avg_profile for all fragment_stats.
void UpdateExecStats(const std::vector<FragmentStats*>& fragment_stats);
Expand Down Expand Up @@ -108,8 +109,9 @@ class Coordinator::BackendState {
/// Only valid after Exec().
int64_t rpc_latency() const { return rpc_latency_; }

/// Return true if execution at this backend is done.
bool IsDone();
/// Print host/port info for the first backend that's still in progress as a
/// debugging aid for backend deadlocks.
static void LogFirstInProgress(std::vector<BackendState*> backend_states);

private:
/// Execution stats for a single fragment instance.
Expand Down Expand Up @@ -217,8 +219,8 @@ class Coordinator::BackendState {
const FilterRoutingTable& filter_routing_table,
TExecQueryFInstancesParams* rpc_params);

/// Return true if execution at this backend is done. Doesn't acquire lock.
bool IsDoneInternal() const;
/// Return true if execution at this backend is done. Caller must hold lock_.
bool IsDone() const;
};

/// Per fragment execution statistics.
Expand Down
57 changes: 23 additions & 34 deletions be/src/runtime/coordinator.cc
Original file line number Diff line number Diff line change
Expand Up @@ -931,59 +931,48 @@ Status Coordinator::UpdateBackendExecStatus(const TReportExecStatusParams& param
params.coord_state_idx, backend_states_.size() - 1));
}
BackendState* backend_state = backend_states_[params.coord_state_idx];
// ignore stray exec reports if we're already done, otherwise we lose
// track of num_remaining_backends_
if (backend_state->IsDone()) return Status::OK();
// TODO: return here if returned_all_results_?
// TODO: return CANCELLED in that case? Although that makes the cancellation propagation
// path more irregular.

bool done;
backend_state->ApplyExecStatusReport(params, &exec_summary_, &progress_, &done);

// TODO: only do this when the sink is done; probably missing a done field
// in TReportExecStatus for that
if (params.__isset.insert_exec_status) {
UpdateInsertExecStatus(params.insert_exec_status);
}

// for now, abort the query if we see any error except if returned_all_results_ is true
// (UpdateStatus() initiates cancellation, if it hasn't already been)
// TODO: clarify control flow here, it's unclear we should even process this status
// report if returned_all_results_ is true
TUniqueId failed_instance_id;
Status status = backend_state->GetStatus(&failed_instance_id);
if (!status.ok() && !returned_all_results_) {
Status ignored = UpdateStatus(status, failed_instance_id,
TNetworkAddressToString(backend_state->impalad_address()));
return Status::OK();
}

// If all results have been returned, return a cancelled status to force the fragment
// instance to stop executing.
if (!done && returned_all_results_) return Status::CANCELLED;
if (backend_state->ApplyExecStatusReport(params, &exec_summary_, &progress_)) {
// This report made this backend done, so update the status and
// num_remaining_backends_.

// for now, abort the query if we see any error except if returned_all_results_ is
// true (UpdateStatus() initiates cancellation, if it hasn't already been)
// TODO: clarify control flow here, it's unclear we should even process this status
// report if returned_all_results_ is true
TUniqueId failed_instance_id;
Status status = backend_state->GetStatus(&failed_instance_id);
if (!status.ok() && !returned_all_results_) {
Status ignored = UpdateStatus(status, failed_instance_id,
TNetworkAddressToString(backend_state->impalad_address()));
return Status::OK();
}

if (done) {
lock_guard<mutex> l(lock_);
DCHECK_GT(num_remaining_backends_, 0);
VLOG_QUERY << "Backend completed: "
<< " host=" << backend_state->impalad_address()
<< " remaining=" << num_remaining_backends_ - 1;
if (VLOG_QUERY_IS_ON && num_remaining_backends_ > 1) {
// print host/port info for the first backend that's still in progress as a
// debugging aid for backend deadlocks
for (BackendState* backend_state: backend_states_) {
if (!backend_state->IsDone()) {
VLOG_QUERY << "query_id=" << query_id() << ": first in-progress backend: "
<< backend_state->impalad_address();
break;
}
}
VLOG_QUERY << "Backend completed: "
<< " host=" << backend_state->impalad_address()
<< " remaining=" << num_remaining_backends_ - 1;
BackendState::LogFirstInProgress(backend_states_);
}
if (--num_remaining_backends_ == 0 || !status.ok()) {
backend_completion_cv_.notify_all();
}
return Status::OK();
}
// If all results have been returned, return a cancelled status to force the fragment
// instance to stop executing.
if (returned_all_results_) return Status::CANCELLED;

return Status::OK();
}
Expand Down