siddsax / xml-cnn Goto Github PK
View Code? Open in Web Editor NEWPytorch implementation of the paper Deep learning for extreme multi-label text classification
Home Page: https://www.getmerlin.in
Pytorch implementation of the paper Deep learning for extreme multi-label text classification
Home Page: https://www.getmerlin.in
The output topK's K is fixed now.
Do you think training a classifier to predict the value of K for every input is a good solution?
Thank you very much.
Hi siddsax,
I wanna ask about the pooling layer because you're using a sliding max pooling. XML-CNN use dynamic max pooling with the definition "For a p document with m words, we evenly divide its m-dimensional feature map into p chunks, each chunk is pooled to a single feature by taking the largest value within that chunk".
Saving Model to: Gen_data_CNN_Z_dim-100_mb_size-20_hidden_dims-512_preproc-0_loss-BCELoss_sequence_length-500_embedding_dim-300_params.vocab_size=30000
Traceback (most recent call last):
File "main.py", line 57, in
x_tr, x_te, y_tr, y_te, params.vocabulary, params.vocabulary_inv, params = save_load_data(params, save=params.load_data)
File "../utils\futils.py", line 153, in save_load_data
x_tr = sparse.load_npz(params.data_path + '/x_train.npz')
File "C:\Users\dc\anaconda\envs\riya\lib\site-packages\scipy\sparse_matrix_io.py", line 131, in load_npz
with np.load(file, **PICKLE_KWARGS) as loaded:
File "C:\Users\dc\anaconda\envs\riya\lib\site-packages\numpy\lib\npyio.py", line 415, in load
fid = open(os_fspath(file), "rb")
FileNotFoundError: [Errno 2] No such file or directory: '../datasets/rcv/x_train.npz'
Hi, when I got into the link of the RCV dataset, I found "404 not found", could you provide another link of the RCV dataset? If possible could you provide other datasets in your paper. It's a little hard for me to understand the code without the dataset. Thank you very much!
Hi there,
I am interested in trying XML-CNN on my own dataset. I have collection of documents, and their labels. Could you please help me understand how I can feed it to your tool? Or, if you provide me samples, that would also be helpful. I tried to go through the RCV file you mentioned in the README file, but it's not really clear. Thanks.
Hello, I want to run XML-CNN on several benchmark, but i don't know how to deal with data, can you provide the script you use for data preprocessing in WIKI31K , AMAZON and others, and the downloading source of dataset?
Thanks~
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.