Java client for Apache HoraeDB (Incubating).
Important
Apache HoraeDB (incubating) is an effort undergoing incubation at the Apache Software Foundation (ASF), sponsored by the Apache Incubator PMC.
Please read the DISCLAIMER and a full explanation of "incubating".
- With the well-designed SPI, the network transport layer is extensible. And we provide the default implementation which uses the gRPC framework.
- The client provides high-performance async streaming write API.
- The client also collects lots of performance metrics by default. These metrics can be configured to write to local file.
- We can take memory snapshots that contains the status of critical objects. The snapshots can also be configured to write to local file, which helps a lot when we diagnose complex problems.
βββββββββββββββββββββββ
β HoraeDBClient β
βββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββ
β WriteClient βββββ
βββββββββββββββββββββββ β
β Async to retry and merge responses
β β
βββββSplit requests β
β β
β βββββββββββββββββββββββ β βββββββββββββββββββββββ βββββββββββββββββββββββ
βββΆβ RouterClient ββββ΄βββΆβ RouterCache ββββββββΆβ RouterFor β
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
β² β
β β
βΌ β
βββββββββββββββββββββββ β
β RpcClient βββββββββββββββββββββββββββββββββββββββββββββββββ
βββββββββββββββββββββββ
β²
β
βΌ
βββββββββββββββββββββββ
β Default gRPC impl β
βββββββββββββββββββββββ
β²
β
βββββββββββββββββββββ΄ β β β β β β β β β β β β β β β β β β β
β β
βΌ βΌ βΌ
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
β HoraeDB Node1 β β HoraeDB Node2 β β ... β
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
βββββββββββββββββββββββ
β HoraeDBClient β
βββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββ
β QueryClient βββββ
βββββββββββββββββββββββ β
β βAsync to retry
β β
ββββββββββββββ β
β β
β βββββββββββββββββββββββ β βββββββββββββββββββββββ βββββββββββββββββββββββ
βββΆβ RouterClient ββββ΄βββΆβ RouterCache ββββββββΆβ RouterFor β
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
β² β
β β
βΌ β
βββββββββββββββββββββββ β
β RpcClient βββββββββββββββββββββββββββββββββββββββββββββββββ
βββββββββββββββββββββββ
β²
β
βΌ
βββββββββββββββββββββββ
β Default gRPC impl β
βββββββββββββββββββββββ
β²
β
βββββββββββββββββββββ΄ β β β β β β β β β β β β β β β β β β β
β β
βΌ βΌ βΌ
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
β HoraeDB Node1 β β HoraeDB Node2 β β ... β
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
- Java 8 or later is required for compilation
<dependency>
<groupId>io.ceresdb</groupId>
<artifactId>horaedb-all</artifactId>
<version>1.0.0-alpha</version>
</dependency>
final HoraeDBOptions opts = HoraeBOptions.newBuilder("127.0.0.1", 8831, DIRECT) // HoraeDB default grpc port 8831οΌuse DIRECT RouteMode
.database("public") // use database for client, can be overridden by the RequestContext in request
// maximum retry times when write fails
// (only some error codes will be retried, such as the routing table failure)
.writeMaxRetries(1)
// maximum retry times when read fails
// (only some error codes will be retried, such as the routing table failure)
.readMaxRetries(1).build();
final HoraeDBClient client = new HoraeDBClient();
if (!client.init(opts)) {
throw new IllegalStateException("Fail to start HoraeDBClient");
}
For more configuration options, see configuration
HoraeDB is a Schema-less time-series database, so creating table schema ahead of data ingestion is not required (HoraeDB will create a default schema according to the very first data you write into it). Of course, you can also manually create a schema for fine grained management purposes (eg. managing index).
The following table creation statementοΌusing the SQL API included in SDK οΌshows all field types supported by HoraeDBοΌ
// Create table manually, creating table schema ahead of data ingestion is not required
String createTableSql = "CREATE TABLE IF NOT EXISTS machine_table(" + "ts TIMESTAMP NOT NULL," + //
"ts TIMESTAMP NOT NULL," +
"city STRING TAG NOT NULL," +
"ip STRING TAG NOT NULL," +
"cpu DOUBLE NULL," +
"mem DOUBLE NULL," +
"TIMESTAMP KEY(ts)" + // timestamp column must be specified
") ENGINE=Analytic";
Result<SqlQueryOk, Err> createResult = client.sqlQuery(new SqlQueryRequest(createTableSql)).get();
if (!createResult.isOk()) {
throw new IllegalStateException("Fail to create table");
}
final Point point = Point.newPointBuilder("machine_table")
.setTimestamp(t0)
.addTag("city", "Singapore")
.addTag("ip", "10.0.0.1")
.addField("cpu", Value.withDouble(0.23))
.addField("mem", Value.withDouble(0.55))
.build();
final CompletableFuture<Result<WriteOk, Err>> wf = client.write(new WriteRequest(pointList));
// here the `future.get` is just for demonstration, a better async programming practice would be using the CompletableFuture API
final Result<WriteOk, Err> writeResult = wf.get();
Assert.assertTrue(writeResult.isOk());
// `Result` class referenced the Rust language practice, provides rich functions (such as mapXXX, andThen) transforming the result value to improve programming efficiency. You can refer to the API docs for detail usage.
Assert.assertEquals(3, writeResult.getOk().getSuccess());
Assert.assertEquals(3, writeResult.mapOr(0, WriteOk::getSuccess).intValue());
Assert.assertEquals(0, writeResult.mapOr(-1, WriteOk::getFailed).intValue());
See write
final SqlQueryRequest queryRequest = SqlQueryRequest.newBuilder()
.forTables("machine_table") // table name is optional. If not provided, SQL parser will parse the `sql` to get the table name and do the routing automaticly
.sql("select * from machine_table where ts = %d", t0) //
.build();
final CompletableFuture<Result<SqlQueryOk, Err>> qf = client.sqlQuery(queryRequest);
// here the `future.get` is just for demonstration, a better async programming practice would be using the CompletableFuture API
final Result<SqlQueryOk, Err> queryResult = qf.get();
Assert.assertTrue(queryResult.isOk());
final SqlQueryOk queryOk = queryResult.getOk();
Assert.assertEquals(1, queryOk.getRowCount());
// get rows as list
final List<Row> rows = queryOk.getRowList();
// get rows as stream
final Stream<Row> rowStream = queryOk.stream();
rowStream.forEach(row -> System.out.println(row.toString()));
See read
HoraeDB support streaming writing and readingοΌsuitable for large-scale data reading and writingγ
final StreamWriteBuf<Point, WriteOk> writeBuf = client.streamWrite("machine_table");
for (int i = 0; i < 1000; i++) {
final Point point = Point.newPointBuilder("machine_table")
.setTimestamp(timestamp)
.addTag("city", "Beijing")
.addTag("ip", "10.0.0.3")
.addField("cpu", Value.withDouble(0.42))
.addField("mem", Value.withDouble(0.67))
.build();
writeBuf.writeAndFlush(Arrays.asList(point));
timestamp = timestamp+1;
}
final CompletableFuture<WriteOk> writeOk = writeBuf.completed();
See streaming
Under Apache License 2.0.
- Join the user group on DingTalk: 44602802