Giter Club home page Giter Club logo

ie-semparse's Introduction

Alt IE-SemPArse

IE-SemParse

Implementation of the translation pipeline, automatic sampling and scoring,human evaluation and experiments of our NLP4ConvAI@ACL2023 paper: IE-SemParse: Evaluating Interbilingual Semantic Parsing for Indian Languages. To explore the dataset online visit dataset page.

Below are the details about the IE_SemParse datasets and scripts for reproducing the results reported in the NLP4ConvAI@ACL2023 Paper.

Task Description

Alt interbilingual_TOP

In this paper we proposed a novel task for Inter-Bilingual semantic parsing task where the utterance is in indic language and the model is required to generate logical form with english slot values.

Task Motivation

Alt Task Motivation

  1. Approach A: Translate to English then parse to logical form.
  2. Approach B: Separate parser and dialogue manager for each language
  3. Approach C: Inter-bilingual Semantic Parsing.

Inter-bilingual Semantic Parsing is a good middle ground approach to enhance model’s multilingual semantic parsing ability and reduce system latency and redundancy.

Translation pipeline

The code is present in the translation_notebooks folder containing 2 notebooks, namely dataset_translation.ipynb and post_processing.ipynb.

The Dataset IE-mTOP, IE-multilingualTOP and IE-multiATIS++ are present i huggingface datasets dataset page.

Experiments

We Experiment with 4 approached of train test strategies better described in the paper.

  1. Indic train: file
  2. Translate Test: file
  3. English+Indic Train: file
  4. Train All: file

To run all experiments in your setup just run the following

bash setup.
bash run_tests.sh

Results and Analysis

All the analysis notebooks are in analysis_notebooks folder.

LICENSE

All the datasets created as part of this work will be released under a CC-0 license and all models & code will be release under an MIT license

Citation

@misc{aggarwal2023evaluating,
      title={Evaluating Inter-Bilingual Semantic Parsing for Indian Languages}, 
      author={Divyanshu Aggarwal and Vivek Gupta and Anoop Kunchukuttan},
      year={2023},
      eprint={2304.13005},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

ie-semparse's People

Contributors

divyanshuaggarwal avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.