Giter Club home page Giter Club logo

master-thesis's Introduction

GitHub contributors GitHub issues GitHub forks

Exploring the Potential of Quantum NLP for Non-Functional Requirements Classification

Introduction

Repository of the research study conducted for the Software Engineering for Artificial Intelligence course 2022-23 and for my Master Thesis.

Quantum Natural Language Processing (QNLP) is an emerging field that leverages quantum computing principles to process and analyze natural language text. This thesis delves into the potential of QNLP within the domain of Requirements Engineering, specifically focusing on the classification of non-functional requirements (NFRs). In our approach, we leverage the DisCoCat framework to capture both distributional and compositional aspects of meaning. The former suggests that words with similar meanings statistically exhibit similar distributions in text, while the latter explains the sentence meaning through the combination of the word composing it according to grammatical rules.

The QNLP pipeline involves preprocessing the requirements description, parsing each sentence into string diagram using the DisCoCat framework, and parameterizing the diagrams as tensor networks or quantum circuits respectively for classical and quantum experiments.

This study aims to assess the classification capabilities of the quantum approach and the comparison with shallow ML models. Furthermore, the study aims to explore the comparison with other compositional models that are not grammar-based in order to assess the relevance of grammar in this context.

The empirical study conducted demonstrates the effectiveness of the DisCoCat framework in accurately classifying NFRs, showing that the grammar was an added value in this particular scenario. The comparison with classical language models reveals that quantum approach outperforms traditional approaches such as TF-IDF Naive Bayes, while exhibiting similarities to Word2Vec, albeit with fewer parameters.

Overall, this research contributes to the understanding of QNLP's applicability in real-world scenarios and lays the foundation for future advancements in the field.

Instructions

No installation is required, just open with Google Colab the notebooks in the /notebooks folder and everything will be ready to run experiments in a reproducible environment.

Programming languages and technologies

Python Badge

Lambeq Badge PyTorch Badge

Dataset

License

Distributed under the MIT License. See LICENSE for more information.

master-thesis's People

Contributors

mcalenda avatar

Stargazers

Salvatore avatar Leopoldo Todisco avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.