Giter Club home page Giter Club logo

sc-lstm's People

Contributors

hit-computer avatar smallt-tao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

sc-lstm's Issues

为什么在generation的时候还是使用input_data?

在generation的时候,input_data应该是不知道的。然而在代码中仍使用:
line62: res_wr = tf.matmul(inputs[:, time_step, :], sc_wr)
line71: (cell_output, state, cell_outputs) = cell(inputs[:, time_step, :], state, sc_vec)

难道不是使用每个时刻预测的结果作为输入吗,是不是写错了?

请问运行preprocess.py报这个错,该怎么解决啊?

Traceback (most recent call last):
File "C:/Users/user/PycharmProjects/textgeneration/Preprocess.py", line 77, in
vocab, _ = Read_WordVec(config)
File "C:/Users/user/PycharmProjects/textgeneration/Preprocess.py", line 36, in Read_WordVec
assert len(wordLS) == config.vocab_size
AssertionError

大小写

DATE文件夹中TrainingData_Keywords.txt=====》需要改成TrainingData_keywords.txt才能运行。

关于文本生成模型的疑惑。

你好~
从之前的 char-rnn 开始关注,到如今的 SC-LSTM,我也一直觉得现在文本生成的质量还不过关,而这步却是很多其他模型的基础。
看到这个模型和那个基于关键字 Attention 的 Issue,有个小疑惑:用 RNN 做文本生成时,能否基于 Attention 考虑词性、位置等信息,或者使用 Beam Search 这样的一些技巧。

对训练的损失函数(cost Funtcion) 的疑问

您好,关于训练使用的损失函数,您是不是没有按照论文中给出损失函数来实现? 我看您是直接使用一个tensorflow 内置的方法 sequence_loss_by_example.
如果是我没注意,能否告诉我你是在哪里改进的损失函数?
如果没有的话,能否给改code实现一下哈? 按照你的这个写法,对我来说感觉不太好改code来实现论文的损失函数.

关于“段错误(核心已转储)”的错误

您好,感谢您的代码。我在运行train.py程序时,程序print出第一个epoch的learning rate,准备计算cost时,程序突然中断,并提示:段错误 (核心已转储)。请问您遇到过这个问题吗?可以提供一下解决思路吗?

关于大量数据训练的问题

您好,感谢您实现的代码。我训练网络的时候训练数据有100 0000条(一百多兆),预处理之后的数据文件就有49G,训练的时候都是out of memory,请问您是怎么解决大数据量的训练问题呢?

关于train.py报错

Epoch: 1 Learning rate: 0.0010
5-step perplexity: 24.645 cost-time: 2.26 s
10-step perplexity: 25.724 cost-time: 0.92 s
15-step perplexity: 26.257 cost-time: 0.96 s
20-step perplexity: 26.411 cost-time: 0.78 s
25-step perplexity: 25.416 cost-time: 0.73 s
在出现以上信息后提示
DataLossError (see above for traceback): truncated record at 3055259
调试了几次还是不行是tensorflow版本问题么?

Beam Search和Score

您好!
我最近看了SC-LSTM的这篇论文,对文章Decoding部分Rerank的内容不太理解,所以上来翻了这份源代码。
代码里面Beam Search部分的评分计算应该是“将每一步选择的词语对应的概率累加起来”,是吗?
我在论文里看到一个score评分项:
Score

想知道这个R在代码里面的哪一部分实现了吗?好像没有找到..

谢谢!

对于数据预处理的疑惑

在preprocess.py文件中:data[i][:_size] = tmp这一行代码,其中_size要小于num_steps,这就限制了关键字对应数据的大小,可不可以直接根据字符的长度进行数据处理呢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.