Giter Club home page Giter Club logo

deepaligned-clustering's People

Contributors

dependabot[bot] avatar hanleizhang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

deepaligned-clustering's Issues

最后测试时候为什么不直接用分类呢?

请问一下,在训练完之后,得到了bert分类模型,类别数是self.num_labels。
在测试时,我看代码中是这样处理:先用bert提取测试集的embedding,然后用kmeans聚类,类数是self.num_labels,得到聚类结果以后去和真实标签计算最优匹配。
那么请问测试时为什么不直接对测试集分类呢?直接分类应该也可以得到对应的label

您好,请问有尝试过预训练+Kmeans的方法吗

作者您好,最近在做新意图发现的工作,在复线这篇工作时,我发现直接在预训练模型的基础上+Kmeans在ACC和ARI两个数据上可以达到比预训练+DAC基本上高2-3个点,MNI基本上差不多。目前我也不确定这是否是正常的现象还是我使用有误,如果您之前有类似的经验,希望可以解答一下这个现象,十分感谢!
我的实验设置是banking+0.75(known)+0.1(labeled),K设置为77,对Kmeans的实现方法为直接在ModelManager的train之前调用test函数得到Kmeans结果。

Dataset

Hi,
I notice that this paper uses CLINC and BANKING dataset. Your previous work (Discovering new intents via constrained deep adaptive Clustering with Cluster Refinement) uses SNIPS, DBPedia, StackOverflow dataset. It seems that this two studies study the same task? And what is the benchmark dataset which used in the future from your perspective?

关于BERT参数冻结问题

感谢作者大大开源您的代码,我最近看了您的CDAC+和这篇模型,都发现您对BERT的前11层参数都采取了冻结的做法,我想问一下如果不冻结前11层参数,效果会是怎样的?还是说这样做是考虑到训练显存的问题,冻结前11层的参数可以在训练时设置更大的batch_size? 希望得到您的解答

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.