forked from apache/kudu
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathoverview.html
245 lines (231 loc) · 10.5 KB
/
overview.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
---
title: Overview
layout: default
active_nav: overview
TODO: attributions for CC-A
---
<!------------------------------------------------------------->
<div class="row header">
<div class="col-lg-12">
<h2>Apache Kudu Overview</h2>
</div>
</div>
<div class="row overview">
<div class="col-md-6">
<h3>Data Model</h3>
<p>
A Kudu cluster stores tables that look just like tables you're used to from relational (SQL) databases.
A table can be as simple as an binary <code>key</code> and <code>value</code>, or as complex
as a few hundred different strongly-typed attributes.
</p>
<p>
Just like SQL, every table has a <code>PRIMARY KEY</code> made up of one or more columns.
This might be a single column like a unique user identifier, or a compound key such as a
<code>(host, metric, timestamp)</code> tuple for a machine time series database. Rows can be efficiently
read, updated, or deleted by their primary key.
</p>
<p>
Kudu's simple data model makes it breeze to port legacy applications or build new ones:
no need to worry about how to encode your data into binary blobs or make sense of a
huge database full of hard-to-interpret JSON. Tables are self-describing, so you can
use standard tools like SQL engines or Spark to analyze your data.
</p>
<ul>
<li><a href="docs/schema_design.html">Learn more about schema design with Kudu</a></li>
</ul>
</div>
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/twitter-firehose-schema.png" />
</div>
</div>
<div class="row overview">
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/java-code.png" />
</div>
<div class="col-md-6">
<h3>Low-latency random access</h3>
<p>
Unlike other storage for big data analytics, Kudu isn't just a file format. It's a live storage
system which supports low-latency millisecond-scale access to individual rows. For
"NoSQL"-style access, you can choose between Java, C++, or Python APIs. And of course these
random access APIs can be used in conjunction with batch access for machine learning or analytics.
</p>
<p>
Kudu's APIs are designed to be easy to use. The data model is fully typed, so you don't
need to worry about binary encodings or exotic serialization. You can just store primitive
types, like when you use JDBC or ODBC.
</p>
<p>
Kudu isn't designed to be an OLTP system, but if you have some subset of data which fits
in memory, it offers competitive random access performance. We've measured 99th percentile
latencies of 6ms or below using YCSB with a uniform random access workload over a billion
rows. Being able to run low-latency online workloads on the same storage as back-end
data analytics can dramatically simplify application architecture.
</p>
<ul>
<li><a href="apidocs/">View the Java API docs</a></li>
<li><a href="cpp-client-api/">View the C++ API docs</a></li>
<li><a href="docs/developing.html">Learn more about developing applications with Kudu</a></li>
<!-- TODO: include a link to YCSB results -->
</ul>
</div>
</div>
<div class="row overview">
<div class="col-md-6">
<h3>Apache Hadoop Ecosystem Integration</h3>
<p>
Kudu was designed to fit in with the Hadoop ecosystem, and integrating it with other
data processing frameworks is simple. You can stream data in from live real-time data sources
using the Java client, and then process it immediately upon arrival using Spark, Impala,
or MapReduce. You can even transparently join Kudu tables with data stored in other Hadoop
storage such as HDFS or HBase.
</p>
<p>
Kudu is a good citizen on a Hadoop cluster: it can easily share data
disks with HDFS DataNodes, and can operate in a RAM footprint as small as 1 GB for
light workloads.
</p>
<ul>
<li><a href="docs/kudu_impala_integration.html">Learn more about integration with Impala</a>
<li><a href="https://github.com/apache/incubator-kudu/blob/master/java/kudu-client-tools/src/main/java/org/kududb/mapreduce/tools/RowCounter.java">
View an example of a MapReduce job on Kudu
</a></li>
</ul>
</div>
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/impala.png" />
</div>
</div>
<div class="row overview">
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/tracing.png" />
</div>
<div class="col-md-6">
<h3>Built by and for Operators</h3>
<p>
Kudu was built by a group of engineers who have spent many late nights providing
on-call production support for critical Hadoop clusters across hundreds of
enterprise use cases. We know how frustrating it is to debug software
without good metrics, tracing, or administrative tools.
</p>
<p>
Ever since its first beta release, Kudu has included advanced in-process tracing capabilities,
extensive metrics support, and even watchdog threads which check for latency
outliers and dump "smoking gun" stack traces to get to the root of the problem
quickly.
</p>
<ul>
<li><a href="docs/administration.html">Learn more about administering Kudu</a></li>
<li><a href="docs/troubleshooting.html#kudu_tracing">Learn more about Kudu's tracing capabilities</a></li>
</ul>
</div>
</div>
<div class="row overview noborder">
<div class="col-md-6">
<h3><a name="opensource">Open Source</a></h3>
<p>
Kudu is Open Source software, licensed under the Apache 2.0 license and
governed under the aegis of the Apache Software Foundation. We believe
that Kudu's long-term success depends on building a vibrant community of
developers and users from diverse organizations and backgrounds.
</p>
<ul>
<li><a href="{{site.github.url}}/community.html">Learn more about how to contribute</a></li>
<li><a href="https://github.com/apache/incubator-kudu">View the Kudu github repository</a></li>
</ul>
</div>
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/asf_logo.png" size="75%"/>
</div>
</div>
<!------------------------------------------------------------->
<div class="row header">
<div class="col-lg-12">
<h2><a name="architecture">Kudu Architecture</a></h2>
</div>
</div>
<div class="row overview">
<div class="col-md-6">
<h3>Super-fast Columnar Storage</h3>
<p>
Like most modern analytic data stores, Kudu internally organizes its data by column rather than
row. Columnar storage allows efficient encoding and compression. For example, a string field with
only a few unique values can use only a few bits per row of storage. With techniques such as
run-length encoding, differential encoding, and vectorized bit-packing, Kudu is as fast at
reading the data as it is space-efficient at storing it.
</p>
<p>
Columnar storage also dramatically reduces the amount of data IO required to service analytic
queries. Using techniques such as lazy data materialization and predicate pushdown, Kudu can perform
drill-down and needle-in-a-haystack queries over billions of rows and terabytes of data in seconds.
</p>
<ul>
<li><a href="kudu.pdf">Read the Kudu paper for more details and a performance evaluation</a></li>
</ul>
</div>
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/kudu_vs_parquet.png" />
</div>
</div>
<div class="row overview">
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/raft.png" width="400"/>
</div>
<div class="col-md-6">
<h3>Distribution and Fault Tolerance</h3>
<p>
In order to scale out to large datasets and large clusters, Kudu splits tables
into smaller units called <em>tablets</em>. This splitting can be configured
on a per-table basis to be based on hashing, range partitioning, or a combination
thereof. This allows the operator to easily trade off between parallelism for
analytic workloads and high concurrency for more online ones.
</p>
<p>
In order to keep your data safe and available at all times, Kudu uses the
<a href="http://raft.github.io">Raft</a> consensus algorithm to replicate
all operations for a given tablet. Raft, like Paxos, ensures that every
write is persisted by at least two nodes before responding to
the client request, ensuring that no data is ever lost due to a
machine failure. When machines do fail, replicas reconfigure
themselves within a few seconds to maintain extremely high system
availability.
</p>
<p>
The use of majority consensus provides very low tail latencies
even when some nodes may be stressed by concurrent workloads such as
Spark jobs or heavy Impala queries. But unlike eventually
consistent systems, Raft consensus ensures that all replicas will
come to agreement around the state of the data, and by using a
combination of logical and physical clocks, Kudu can offer strict
snapshot consistency to clients that demand it.
</p>
<ul>
<li><a href="http://raft.github.io/">Learn more about Raft Consensus</a></li>
<li><a href="kudu.pdf">Read the Kudu paper for more details on its architecture</a></li>
</ul>
</div>
</div>
<div class="row overview noborder">
<div class="col-md-6">
<h3>Designed for Next-Generation Hardware</h3>
<p>
The Kudu team has worked closely with engineers at Intel to harness the power
of the next generation of hardware technologies.
Kudu's storage is designed to take advantage of the IO
characteristics of solid state drives, and it includes an
experimental cache implementation based on the <a href="http://pmem.io">libpmem</a>
library which can store data in persistent memory.
</p>
<p>
Kudu is implemented in C++, so it can scale easily to large amounts
of memory per node. And because key storage data structures are designed to
be highly concurrent, it can scale easily to tens of cores. With an
in-memory columnar execution path, Kudu achieves good instruction-level
parallelism using SIMD operations from the SSE4 and AVX instruction sets.
</p>
</div>
<div class="col-md-6">
<img class="img-responsive" src="{{site.github.url}}/img/intel_logo.png" size="75%"/>
</div>
</div>
<!------------------------------------------------------------->