Mihir's Projects
A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
The idea behind this project is to make speech recognition more better for the people with speech pathology.
A systematically developed human-annotated dataset consisting of coherent summaries for five publicly available datasets and natural language user feedback, offering valuable insights into how to improve coherence in extractive summaries.
Genetic Algorithm (GA) for solving Richardson arm race model for three countries
An evaluation dataset comprising of 274 grid-based puzzles with different complexities
In-BoXBART: Get Instructions into Biomedical Multi-task Learning
Data and code for Logic2Text dataset
This repo has all the code that is required to run the experiments for our LogicAttack Paper.
LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, first-order, and non-monotonic logics.
LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks
Dataset and code for “Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding, EMNLP 2019.
A comprehensive evaluation dataset encompassing multi-step logical reasoning with various inference rules and depths
Expanding natural instructions
CLP 576 SIA Project Fall 20
Open Review Toolkit: Better books, Higher sales, Increased access to knowledge
Solving real life problem using Machine Learning Techniques for Speech Pathology Domain
A first-of-its-kind system in speech domain that helps users to train state-of-the-art classification models without hand-labeling training data.
The aim of this project is to make voice assistants more responsive towards whisper to some extent.