Giter Club home page Giter Club logo

deepaligned-clustering's Introduction

Discovering New Intents with Deep Aligned Clustering

A deep aligned clustering method to discover new intents.

The proposed method together with baselines are also integrated into the open intent discovery module in our another scalable framework TEXTOIR, enjoy it!

Introduction

This repository provides the official PyTorch implementation of the research paper Discovering New Intents with Deep Aligned Clustering (Accepted by AAAI2021).

Related works can refer to the reading list.

Dependencies

We use anaconda to create python environment:

conda create --name python=3.6

Install all required libraries:

pip install -r requirements.txt

Model Preparation

Get the pre-trained BERT model and convert it into Pytorch.

Set the path of the uncased-bert model (parameter "bert_model" in init_parameter.py).

Usage

Run the experiments by:

sh scripts/run.sh

You can change the parameters in the script. The selected parameters are as follows:

dataset: clinc | banking
factor_of_clusters: 1 (default) | 2 | 3 | 4 
known_class_ratio: 0.25 | 0.5 | 0.75 (default)

Model

The model architecture of DeepAligned: Model

Results

The detailed results can be seen in known_intent_ratio_results.csv and k_results.csv.

Main experiments

CLINC BANKING
Method NMI ARI ACC NMI ARI ACC
KM 70.89 26.86 45.06 54.57 12.18 29.55
AG 73.07 27.70 44.03 57.07 13.31 31.58
SAE-KM 73.13 29.95 46.75 63.79 22.85 38.92
DEC 74.83 27.46 46.89 67.78 27.21 41.29
DCN 75.66 31.15 49.29 67.54 26.81 41.99
DAC 78.40 40.49 55.94 47.35 14.24 27.41
DeepCluster 65.58 19.11 35.70 41.77 8.95 20.69
PCK-means 68.70 35.40 54.61 48.22 16.24 32.66
BERT-KCL 86.82 58.79 68.86 75.21 46.72 60.15
BERT-MCL 87.72 59.92 69.66 75.68 47.43 61.14
CDAC+ 86.65 54.33 69.89 72.25 40.97 53.83
BERT-DTC 90.54 65.02 74.15 76.55 44.70 56.51
DeepAligned 93.89 79.75 86.49 79.56 53.64 64.90

Ablation study

Method CLINC BANKING
Method NMI ARI ACC NMI ARI ACC
w/o Pre + Reinit 57.80 9.63 23.02 34.34 4.49 13.67
w/o Pre + Align 62.53 14.10 28.63 36.91 5.23 15.42
Pre + Reinit 82.90 45.67 55.80 68.12 31.56 41.32
Pre + Align 93.89 79.75 86.49 79.56 53.64 64.90

If you are insterested in this work, and want to use the codes or results in this repository, please star this repository and cite by:

@article{Zhang_Xu_Lin_Lyu_2021, 
    title={Discovering New Intents with Deep Aligned Clustering}, 
    volume={35}, 
    number={16}, 
    journal={Proceedings of the AAAI Conference on Artificial Intelligence}, 
    author={Zhang, Hanlei and Xu, Hua and Lin, Ting-En and Lyu, Rui}, 
    year={2021}, 
    month={May}, 
    pages={14365-14373}
}

Acknowledgments

This paper is founded by seed fund of Tsinghua University (Department of Computer Science and Technology)- Siemens Ltd., China Joint Research Center for Industrial Intelligence and Internet of Things.

deepaligned-clustering's People

Contributors

hanleizhang avatar dependabot[bot] avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.