me-box / zestdb Goto Github PK

View Code? Open in Web Editor NEW

18.0 9.0 3.0 322 KB

ZestDB

License: MIT License

OCaml 94.31% Makefile 0.54% Shell 0.32% Dockerfile 0.79% C++ 4.05%

reasonml coap zmq time-series database hypercat macaroons iot internet-of-things tsdb

zestdb's Introduction

ZestDB

For further details please visit here.

zestdb's People

Contributors

Stargazers

Watchers

Forkers

pooyadav seveneng toshbrown

zestdb's Issues

Restrict data types allowed in a time-series post

Currently, any JSON can be associated with a timestamp.

I think it makes more sense to restrict to integer and floats as this will allow for more efficient data handling.

Not sure if it makes sense to have a timestamp per data point or allow multiple data points to be associated with a timestamp eg.

[ (1511254543330, 1), (1511254543330, 2), (1511254543330, 3) ]

1511254543330 => [1, 2, 3]

Support for DELETE on time-series

Allow a specific time-series to be removed from storage.

UTF8 in json data

When receiving data from twitter utf8 chars are often present in the JSON this cause the store to return a bad request and the data is not saved.

see https://pastebin.com/BLxMMMk2

The above is an example of one tweet that is successfully saved and one that is rejected by the store.

New KV API needs a list keys

The new KV API needs an endpoint to list all the keys available under a datasource.

/<DsId>/list or something similar would be very useful.

/[id]/since/[timestamp] does not order the data

Not sure if this is a but or a "feature"

If you write data using "write at" out of order, then the data returned by /since is also out of order.

Storage performance enhancements

Currently all data associated with a datasource are written to the same path within the git file store. This can be further divided by another level of key (see #9) but could also internally be divided through time-based buckets/bins to further restrict the amount of data to one path. Needs investigating in terms of performance on a pi3.

Bulk write of time series data

Add support for writing an array of values in a single POST.

This would require a timestamp to be included as part of an individual datapoint e.g. {"timestamp": 11111111, "value": 42}. Currently we support timestamps being defined for a POST via the URI which works for single value POSTs.

Hypercat does not persist across restarts

If the store crashes or is restarted then the state of the Hypercat catalogue is lost.

Server config file

Currently the server is configured from command line args but this does not scale well as the options increase. Should replace with a configuration file of some standard format.

TimeStamp resolution and fast writes

It is possible to write to the store faster than the smallest resolution of the timestamp in milliseconds.

When this happens the data return by latest, earliest API calls can be out of order causing tests to fail.

Query language

The current API is path based and only operates on a single stream of time series data. This API does not scale well or support joining together and carrying out operations on data from multiple time series streams. I useful addition would be a simple query language to give this type of flexibility.

Supporting out-of-order writing

The current backend log-based storage is suited to in-order time-series writes. Do we need to support out-of-order time-series writes?

Start up time - with data

Once a store has had a reasonable amount of data written to it, there is a delay at startup.

This is fine but calls to the API during this time appear to succeed but return not data.

I'll try and reproduce out of databox so we can test this, and to convince myself that this is whats happening.

I'll update this issue with findings later.

Add server state statistics to hypercat

The hypercat can be used to contain more information on the state of the server. Currently it is just used to log which data sources exist. It could contain more information such as the number of records in the store, read/write access count, details on current observers, uptime etc.

Support infinite observation

Currently observations must expire (max-age). Can use 0 to mean never expire.

Geotagging

At the moment you can tag a time series value but it is restricted to name and string value. It would be good to extend this to allow some kind of geotagging to then allow data points to be queried on location.

Time series ids

It would be useful to list what time series ids exist in a similar way to listing the keys within the key value store.

Tokens not checked on observe ?

@jptmoore

I might be missing something but, are you verifying the token on the observe endpoint

https://github.com/jptmoore/zest/blob/master/src/server.re#L493

Installation on Raspbian OS

ZestDB has a docker build for HypriotOS. Since Raspbian(the official OS) is currently supporting docker, is there any chance to have a docker build for it?

bad requests for/ts sometimes do not respond

For example

A GET request for /ts/foo instead of /ts/foo/latest will hang clients as will /ts/[key]/[any unsupoted string]

To repeat:

docker run -p 5555:5555 -p 5556:5556 -d --name zest --rm jptmoore/zest /app/zest/server.exe --secret-key-file example-server-key --enable-logging

run --network host -it jptmoore/zest /app/zest/client.exe --server-key 'vl6wu0A@XP?}Or/&BR#LSxn>A+}L)p44/W[wXL3<' --path '/ts/foo' --payload '{"name":"dave", "age":30}' --mode post

docker run --network host -it jptmoore/zest /app/zest/client.exe --server-key 'vl6wu0A@XP?}Or/&BR#LSxn>A+}L)p44/W[wXL3<' --path '/ts/foo' --mode get

/observe/<DsId>

It should send

{"key":"<name of key>","value":"<data stored at key>"}

Drivers can only write to the endpoint. (but no data is saved)

Apps can only observe the endpoint to get notification of new data.

Using Aggregation on an empty data source crashes the server

Calls to /ts//last/5/min for example before a driver has written any data will cause the following crash:

server.exe: main: report_error:Opps: (Invalid_argument "index out of bounds")Raised at file "format.ml" (inlined), line 239, characters 35-52
server.exe: main: Called from file "format.ml", line 465, characters 8-33
server.exe: main: Called from file "format.ml", line 480, characters 6-24
server.exe: main:
server.exe: main: report_error:Opps: ZMQ.ZMQ_exception(0, "Operation cannot be accomplished in current state")Raised at file "map.ml", line 122, characters 10-25
server.exe: main: Called from file "src0/sexp_conv.ml", line 149, characters 10-37
server.exe: main:

me-box / zestdb Goto Github PK

zestdb's Introduction

ZestDB

zestdb's People

Contributors

Stargazers

Watchers

Forkers

zestdb's Issues

Recommend Projects

Recommend Topics

Recommend Org