Giter Club home page Giter Club logo

Comments (12)

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

Did an initial commit with abstract class files, svm and utility classes. As expected, the svm at this time is giving zero true positive rate in the scikit implementation for rbf kernel.
True Positives 0
False Positives 483
True Negatives 92658
False Negatives 0

from astro_porject.

NobleKennamer avatar NobleKennamer commented on July 27, 2024

Checkout what I just added in the RR_Lyrae ipython notebook. I'm able to get a bit better performance using a linear kernal.

I'll also add a dataset for classification that is a bit easier tomorrow. So that way we can get models with higher classification

from astro_porject.

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

I tried the linear kernel which gives the same results as we obtained before.

The issue is not with the kernel, but the weights as we discussed earlier yesterday, we need a weight distribution such that we can minimize the false positive misclassification.
Running with linear kernel with 'auto' (deprecated) weights gives
True Positives 478
False Positives 5
True Negatives 89321
False Negatives 3337

Running the rbf kernel with 'balanced' (new heuristic) weights gives
True Positives 479
False Positives 4
True Negatives 89396
False Negatives 3262

Which is better than the linear case. I will try to look more into these weight distributions.

from astro_porject.

NobleKennamer avatar NobleKennamer commented on July 27, 2024

Yes, you are right. I was using auto for weights.

That might actually be a really interesting avenue to explore when writing the report.

from astro_porject.

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

Hey guys - so i am running into some problems in implementing the svm - the problem is we need to perform an optimization in the dual space for finding the lagrangian multipliers, this involves solving a quadratic convex optimization problem. For this i am using a convex optimizer package cvxopt. In all implementations i have found, we need to feed it an array or size N^2 (since we are in dual space) - but for our dataset this gives an out of memory error (N=number of data points). I will try to look now into scikits own implementation of this - might take me sometime

from astro_porject.

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

Hey guys - i am having some trouble integrating my python changes in a copied notebook - some of the things seem not to be running - eg. in the rr star - we are using np.load where we have never written import numpy as np, noble did you miss a line while committing? I wrote an import for that

Also the plots after i run a command are not showing up - just a reference to an object. I am not sure if there is a problem with the inline we have specified

from astro_porject.

NobleKennamer avatar NobleKennamer commented on July 27, 2024

That is what the %pylab interactive is doing.

from astro_porject.

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

Hmm, it's weird - I get np undefined error still if I remove the import.
Can you tell me which version of ipython / Jupyter you are using?
On Dec 3, 2015 11:28 AM, "Noble Kennamer" [email protected] wrote:

That is what the %pylab interactive is doing.


Reply to this email directly or view it on GitHub
#5 (comment)
.

from astro_porject.

NobleKennamer avatar NobleKennamer commented on July 27, 2024

I'm using Ipython 3.0.0. When you execute the cell %pylab inline does it say populating the interactive namespace?

from astro_porject.

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

Turns out it was a problem with my latest jupter - after uninstalling it, things seem to be working fine. I just made a copy of the RR_lyrae file and added an svm into it. For the coded version of svm, i found something called SMO Sequential minimal optimization that avoids constructing the NxN matrix in the dual space - i should be done with it by today probably. Please tell me when are you guys staring to write

from astro_porject.

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

I have completed a version of SMO for SVM - it is running but it is slow and takes several hours to converge to the solution. Working on optimizations to make it faster.

from astro_porject.

abhisaarsharma avatar abhisaarsharma commented on July 27, 2024

Optimized the code - however it is still taking a long time to finish execution - Running the code in pypy or cpython is expected to decrease the running time by a large factor - result on a small dataset showed reduction from 10 minutes to approximately a minute. I think i will commit for now and close the issue since we do not have enough time.

from astro_porject.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.