zeeraktalat / mlearn Goto Github PK
View Code? Open in Web Editor NEWML API in which model building will be streamlined
License: MIT License
ML API in which model building will be streamlined
License: MIT License
Could the encoded data be stored in the object, provided that it is only the indices of the tokens that are stored?
This could potentially save time as indices would not need to be recomputed, only situating it in the tensor would need to be done.
Set up Hyper parameter fees for (hyper-)parameter optimisation for neural nets and sklearn models.
GeneralDataset.load_labels is not implemented.
In 2 weeks
Tomorrow
We want to update the prediction writer so that it it writes a file with format
docID, original_Text, model-1 prediction, model-2 prediction, ..., model-n prediction
The head for the models should be the name of the model, not the hyper-parameters (hyper-parameters should be available from file detailing which model is the best).
Ensure a log file is written for each module individually as well as a single log for the entire API
Methods to wrap around sklearn API so that they become a single line call.
When using the tensorisation I created, the ML model tends to not learn anything. Should shift to Cython/C implementation of the tensorisation and have python call the C module.
Add wrapper around the GP library so that single line call is all it requires to run a model
Two different logs:
Wednesday 11 AM
In a week
Today
Tomorrow
Add missing tests for modules:
Add tests for missing lines in:
Next week
Error is thrown in loading preprocessor if liwc_dir = None when preprocessor is initialized.
Megan: "can't have lists as default args has to be e.g. None and then at the top of the function: if blah is None: blah = that list"
Add layer over GPy
The LIWC computation takes a subword into account and not just the full word when considering kleene-starred tokens, e.g. abuse*.
#OnlineAbuse
triggers abuse*,AFFECT
, abuse*,NEGEMOTION
etc. It should trigger UNK.
20th of June
Add layer over GPy
Today
Add logging for when exceptions are caught.
In a week
Early stopping will overwrite the best performing model if multiple models are used. E.g. cnn, lstm, mlp are all used then it'll first store the best cnn, then the best lstm, and finally the best mlp in the same file name.
Consider moving away from SpaCy to Ekphrasis
June 13, 2018
17th of June, 2018
14th of June
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.