Time series classification and clustering code written in Python.

Python 12.05% Jupyter Notebook 87.95%

time-series-classification-and-clustering's Introduction

Time Series Classification and Clustering

The work of Dr. Eamonn Keogh at University of California Riverside has shown that a good way to classify time series is with a k-NN algorithm using a dynamic time warping similarity measure.

This repo is meant to implement this time series classification method in Python. The same techniques are also extended to clustering time series.

I have also written a tutorial on this subject.

time-series-classification-and-clustering's People

Contributors

Stargazers

Watchers

Forkers

qingkaikong mboraiah mikkab swijal yusuke0519 cmiller8 pramodatre noelhx zangsir sandy4321 fabrol echohenry2006 ml-ai-nlp-ir geetua fs2013 maxtiff gokunwu greatshang kingkastle nionjo cc13ny vivianyang0821 mansimransingh margulies ramchandra94 madinghehe 66ly gershonc mmadsen balajikalluri yelshater spaxfiz pablogf75 tcarm002 diguabo kungfupandey mfcardenas testmana2 anuragreddygv323 semanticbeeng farazinux ptee snowdj mathematixy tomfisher antondv235 yue-zhao shubhabrataroy smbeli gnovelli aayush26 99sbr sundisktop muzhenxv paumaury bhaveshoswal diegslva appscluster minvex anhnguyendepocen kroid arunnairid metricle akansal1 lamichhanekamal rsantana-isg codeaudit andreas-koukorinis cyivy1992 simonsleo foxlisimulation vybhavk fengyin123 m7catsue ramaswamym1987 forkbackups e184633 linghongtao arrnos flamingofugang wuliwei9278 wendygao16 kapiya eehlise jazzman37 kylinliu nttrungmt hongminwu joleonar pingjingshensheng vipyoung zhd rickymos klaralenyu hiredd kianqunki codeofgod bearwilliamed cristianokiu xinruichen

time-series-classification-and-clustering's Issues

Error while recalculate centroids of clusters(int is not iterable)

    #recalculate centroids of clusters
        for key in self.assignments:
            clust_sum=0
            for k in self.assignments[key]:
                clust_sum=clust_sum+data[k]
            self.centroids[key]=[m/len(self.assignments[key]) for m in clust_sum]-- error in this line,int is not iterable.

change clust_sum declaration to np.zeros(len(data[0]))
change clust_sum=clust_sum+data[k] to clust_sum=np.add(clust_sum,data[k])

change it to following code

#recalculate centroids of clusters
for key in self.assignments:
clust_sum=np.zeros(len(data[0]))
for k in self.assignments[key]:
clust_sum=np.add(clust_sum,data[k])
self.centroids[key]=[m/len(self.assignments[key]) for m in clust_sum]

the k_means_clust function has a error

if closest_clust in assignments:
    assignments[closest_clust].append(ind)
else:
    assignments[closest_clust]=[]

this code is error! when the element match the clus , the element will not append to the clust ,it make the result error!the right code like this:

assignments.setdefault(closest_clust,[])
assignments[closest_clust].append(ind)

Description of a Data

Hi Alex, Can you please tell about/ describe a bit of data. What number or the row suggest what.?

How to return cluster labels for data sampled.

I see that tslearn.clustering natively supports dynamic time warping via TimeSeriesKMeans.

However, TimeSeriesKMeans is quite slow. I would like to use this implementation which from the code looks like it has more optimization via locality constraints. I'm not sure it's actually faster but I have my fingers crossed it is.

Can someone point me to how I can use @alexminnaar's implementation to output cluster labels per series?

I can see that this implementation robustly outputs the average cluster curves but I don't see it outputting the labels for the entire time series data.

I suspect it is in this block, but I'm having trouble parsing it.
if closest_clust in assignments: assignments[closest_clust].append(ind) else: assignments[closest_clust]=[]

Any pointers would be appreciated.

what is function（“compa_clust”）do？

have a doubt

Thanks for the excellent ipython notebook.
I have a doubt, can you please clarify.
Using k-means clustering, what is the conclusion? there are no clusters as such.
How can we predict cluster number for a new input, based on such results?

Please revert back.

Notebook wont load

Hello,

For some reason I can't load your notebook on

http://nbviewer.ipython.org/

"TypeError: 'NoneType' object is not iterable" ERROR in Clustering

I've followed the exact method (for Kmean Clustering) you've written in the tutorial.
But it generates following error:

100
iteration 1
100
iteration 2
69
iteration 3
48
iteration 4
38
Traceback (most recent call last):
  File "spatio-time cluster.py", line 177, in <module>
    for i in centroids:
TypeError: 'NoneType' object is not iterable

The datasets are same as your Github repo. I am also attaching the code. Please let me know, where I am doing wrong.

alexminnaar / time-series-classification-and-clustering Goto Github PK

time-series-classification-and-clustering's Introduction

Time Series Classification and Clustering

time-series-classification-and-clustering's People

Contributors

Stargazers

Watchers

Forkers

time-series-classification-and-clustering's Issues

Error while recalculate centroids of clusters(int is not iterable)

the k_means_clust function has a error

Description of a Data

How to return cluster labels for data sampled.

what is function（“compa_clust”）do？

have a doubt

Notebook wont load

"TypeError: 'NoneType' object is not iterable" ERROR in Clustering

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent