awslabs / amazon-kinesis-aggregators Goto Github PK

Amazon Kinesis Aggregators provides a simple way to create real time aggregations of data on Amazon Kinesis.

License: Other

Java 99.22% HTML 0.22% CSS 0.56%

amazon-kinesis-aggregators's Issues

DefaultIdempotencyCheck doesn't actually do anything

The README documents the lastWriteSeq field in the aggregated DynamoDB table, so my impression was that this consumer application implements an idempotency mechanism to handle the scaling and failover cases in a KCL application.

However while browsing the source, I noticed that DefaultIdempotencyCheck's doProcess method returns true unconditionally, and looks like a placeholder implementation of IIdempotencyCheck.

I'm not clear if a full implementation of IIdempotencyCheck was simply omitted by mistake, or if the framework intends for the user to provide her own implementation. If a full implementation that uses lastWriteSeq was omitted, it should be added. If the framework intends for the user to supply an implementation, it should be documented as such.

Without one or the other, users may be lead to believe the aggregators are safe to use in scaling and failover scenarios when they're not.

Examples on S3 giving "access denied"

The examples in the README, such as https://s3-eu-west-1.amazonaws.com/meyersi-ire-aws/KinesisDynamicAggregators/sample/regex-aggregator.json, don't appear to be public and are returning 403.

Dynamo tables created in wrong region.

I have an EC2 instance running an aggregator app. It reads fine from a Kinesis Stream but the aggregated tables are generated in an incorrect region.
How do I specify the region of the DynamoDB tables?

BTW, the KCL app is creating tables in the right region.

SEVERE error when creating cloud watch metrics

I have just cloned the repo as of yesterday, and am trying to run the JAR file directly via command line on my local laptop (for testing).

I have configured the JSON config file to aggregate on MINNUTE, HOUR and FOREVER (see below)

[
{
"namespace":"TestJsonConfigApp",
"labelItems":["EmailAddress"],
"type":"COUNT",
"timeHorizons":["MINUTE","HOUR", "FOREVER"],
"dataExtractor":"JSON",
"dateItem":"EventDateTime",
"tableName":"TestTable",
"emitMetrics" : true,
"readIOPS":20,
"writeIOPS":40,
"IDataStore":"com.amazonaws.services.kinesis.aggregators.datastore.DevNullDataStore"
}
]

I have then added some events through the KCL libraries (which works fine), then I can see errors in the aggregator logs saying:

Unable to Parse Date Value H-1970-01-18 02:00:00

I can see this value in the DynamoDB table, but it looks like the data that the aggregator adds, not from me... what am I doing wrong?

The same happens for the minute aggregations, such as m-1970-01-18 ...

Publish to maven repository

As far as I can see current distribution jars are not published to Maven central repository. Are there any plans to do this?

NoSuchMethodError: com.amazonaws.transform.JsonUnmarshallerContext.getCurrentToken()

From https://forums.aws.amazon.com/thread.jspa?messageID=585971:

Generating Sensor 1
Exception in thread "main" java.lang.NoSuchMethodError: com.amazonaws.transform.JsonUnmarshallerContext.getCurrentToken()Lcom/fasterxml/jackson/core/JsonToken;
at com.amazonaws.services.kinesis.model.transform.PutRecordResultJsonUnmarshaller.unmarshall(PutRecordResultJsonUnmarshaller.java:40)
at com.amazonaws.services.kinesis.model.transform.PutRecordResultJsonUnmarshaller.unmarshall(PutRecordResultJsonUnmarshaller.java:31)
at com.amazonaws.http.JsonResponseHandler.handle(JsonResponseHandler.java:104)
at com.amazonaws.http.JsonResponseHandler.handle(JsonResponseHandler.java:41)
at com.amazonaws.http.AmazonHttpClient.handleResponse(AmazonHttpClient.java:730)
at com.amazonaws.http.AmazonHttpClient.executeHelper(AmazonHttpClient.java:417)
at com.amazonaws.http.AmazonHttpClient.execute(AmazonHttpClient.java:245)
at com.amazonaws.services.kinesis.AmazonKinesisClient.invoke(AmazonKinesisClient.java:2326)
at com.amazonaws.services.kinesis.AmazonKinesisClient.putRecord(AmazonKinesisClient.java:557)
at producer.SensorReadingProducer.run(SensorReadingProducer.java:151)
at producer.SensorReadingProducer.main(SensorReadingProducer.java:167)

Feature Request - Support for processing Dynamo DB Update Streams

Add support for Aggregators to be able to read not only from a Kinesis Stream, but also from a Dynamo DB Update Stream. Support existing serialisation models with content of Dynamo Stream Images.

process base64 encoded object from kinesis

Hi,

Is there a built in class for working with base64 encoded object? If not how would we go about supporting that?

Regards
Paul

Feature / Request for comment

Couple of ideas, In terms of the as you put it, more common "Querying for Data by Date"
What do think about an optional consistent read on the aggregate data like dateQuery?consistent=true
Any thoughts about being able to configure additional indexes in configuration, to query externally?
Or related suggestions for coordinating additional processing on the aggregate data; Just wondering...

FOREVER timeHorizon seems to set dateValue to * and issues this error...

17-Jun-2015 20:19:15.970 SEVERE [pool-1-thread-1] com.amazonaws.services.kinesis.aggregators.cache.AggregateCache.flush Metrics Emitter Exception - Aggregate Cache will NOT terminate
17-Jun-2015 20:19:15.970 SEVERE [pool-1-thread-1] com.amazonaws.services.kinesis.aggregators.cache.AggregateCache.flush java.text.ParseException: Unparseable date: "*"

awslabs / amazon-kinesis-aggregators Goto Github PK

amazon-kinesis-aggregators's Issues

DefaultIdempotencyCheck doesn't actually do anything

Examples on S3 giving "access denied"

Dynamo tables created in wrong region.

SEVERE error when creating cloud watch metrics

Publish to maven repository

NoSuchMethodError: com.amazonaws.transform.JsonUnmarshallerContext.getCurrentToken()

Feature Request - Support for processing Dynamo DB Update Streams

process base64 encoded object from kinesis

Feature / Request for comment

FOREVER timeHorizon seems to set dateValue to * and issues this error...

Config file support for region

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent