cfn-aws-cur-by-tag

Setup

Essentially, you need to setup the cost allocation tag, the CUR report, partitioning of that data and deploy these Lambda functions via CloudFormation to get this running.

1. Cost allocation tag

This could be configured on your AWS Billing Console. See AWS documentation for details.

2. Cost and Usage report (CUR)

This could also be configured on your AWS Billing Console or via the CLI. The important variables:

Report name ($CUR_NAME): Just has to be unique
Time unit ($TIME_UNIT, Values: HOURLY, DAILY): Units to use for report
S3 bucket for CUR ($CUR_BUCKET): Existing bucket for your CUR data

For CLI, this is what it would look like:

# Modify the following env vars
$ CUR_NAME=cur_foobar
$ TIME_UNIT=HOURLY
$ CUR_BUCKET=my_cur_bucket

$ aws cur put-report-definition --report-definition "ReportName=${CUR_NAME},S3Bucket=${CUR_BUCKET},TimeUnit=${TIME_UNIT},Format=textORcsv,Compression=GZIP,AdditionalSchemaElements=RESOURCES,S3Prefix='',S3Region=us-east-1,AdditionalArtifacts=QUICKSIGHT"

3. Athena partitioning of CUR data

See this AWS blog post for how to configure this with a CloudFormation stack that takes your CUR data and partitions it for Athena.

Note that Setting up the Amazon Quicksight visualizations as discussed on the post is not required.

The CloudFormation has 3 parameters that are worth taking a note of:

CostnUsageReport: Name of your CUR report ($CUR_NAME from previous step)
S3BucketName: Bucket where your Athena CUR data is partitioned
s3CURBucket: Bucket where your original CUR reports are saved by AWS ($CUR_BUCKET from previous step)

4. Deploy and configure the Lambda functions

Finally, you could deploy this CloudFormation stack. The stack consist of an S3 bucket and 3 Lambda functions:

get_tags: Exports the values of the cost allocation tag you want to breakup your report by.
query_athena: Queries Athena for line-items with each value of the cost allocation tag you chose.
copy_final_csv: Copies the reports to the S3 bucket you want to store the final output.

There are two parameters that are important:

TagKeyParam: The cost allocation tag you want to breakup your reports by.
OutputBucketParam: The S3 bucket to save the broken up reports.

To create the CloudFormation stack:

# Modify the following env vars
$ TAG_KEY=business_unit
$ OUTPUT_BUCKET=my-cur-reports
$ CFN_STACK=cur-by-tag-stack

# Create stack
aws cloudformation create-stack --template-body file://template.yaml --stack-name ${CFN_STACK} --capabilities CAPABILITY_IAM --parameters ParameterKey=TagKeyParam,ParameterValue="${TAG_KEY}" ParameterKey=OutputBucketParam,ParameterValue=${S3_BUCKET}

5. Create S3 trigger for Lambda function

Finally, once the stack is created, you need to configure the S3 trigger. To do this, you need two items:

$CUR_BUCKET: S3 bucket with the Athena partitioned CUR data
$GET_TAGS_ARN: Lambda ARN of the get_tags function

You could get $CUR_BUCKET from the parameters of the CloudFormation stack brought up in the 3rd section above.

For $GET_TAGS_ARN, you could retrieve that from the GetTagsFunctionArn output of the CloudFormation stack brought up in the 4th section. This could be retrieved via the AWS Console or via the CLI with a command like this:

$ aws cloudformation describe-stacks --stack-name bf-cur-by-tag-sam --query 'Stacks[0].Outputs'

Finally, to setup the S3 trigger, you could configure it via the AWS Console on the Lambda function or by using a command like this:

# Modify the following env vars
$ CUR_BUCKET=my_athena_cur_data
$ GET_TAGS_ARN=arn:aws:lambda:us-east-1:xxxxxx:function:xxxxxx-GetTagsFunction-xxxxxxxxx

# Sets up for the Lambda function to be triggered by new data in the S3 bucket
$ aws s3api put-bucket-notification-configuration --bucket $CUR_BUCKET --notification-configuration '{"LambdaFunctionConfigurations":[{"LambdaFunctionArn":"'${GET_TAGS_ARN}'","Events":["s3:ObjectCreated:*"],"Filter":{"Key":{"FilterRules":[{"Name":"Prefix","Value":"aws-athena-query-results/"},{"Name":"Suffix","Value":".txt"}]}}}]}'

Considerations

Important: Lambda IAM role is way too open, could be locked down significantly
I'm assuming us-east-1 everywhere right now so you may need to change a few things if you're using another region
Currently doesn't clean up Athena query output S3 files ($bucket/athena_out and $bucket/tags)
Doesn't create a report for non-tagged items
Not sure how it'll do with single digit months; we'll find it in January!

belminf / cfn-aws-cur-by-tag Goto Github PK

cfn-aws-cur-by-tag's Introduction

cfn-aws-cur-by-tag

Setup

1. Cost allocation tag

2. Cost and Usage report (CUR)

3. Athena partitioning of CUR data

4. Deploy and configure the Lambda functions

5. Create S3 trigger for Lambda function

Considerations

cfn-aws-cur-by-tag's People

Contributors

Watchers

cfn-aws-cur-by-tag's Issues

Lock down IAM roles for Lambda functions to specific resources {eom}

Race condition: Lambda invoked when new CSV added to Athena bucket, not when partition added

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent