Giter Club home page Giter Club logo

aisightquest's Introduction

AI Sight Quest is an iOS application powered by Artificial Intelligence that leverages Machine Learning models and Apple's Frameworks. It enables users to take a photo or select them from their gallery within the app and capture the text using Apple's Vision Framework and locate relevant passages in a document by asking the Bidirectional Encoder Representations from Transformers (BERT) model a question.

Table of Contents

Demo

Discover AI Sight Quest

Demo In Light Mode

Overview

All computations and processing are performed locally on the device, ensuring user data privacy and security. There's no reliance on external servers or network API calls. Every calculation and processing task is handled by the device itself, enhancing both performance and privacy.

The app generates answers directly within scanned documents using the Core ML Framework and a question-answering AI model.

Interacting with this captured text is easy; users can edit or even play it aloud, facilitated by text playback using Apple's Speech Framework.

Users can customize their reading experience. They can change the font size or switch between different dark or light color schemes, ensuring optimal readability in various environments.

Also, to achieve our versatile app design, we opted for vector-based animations over traditional methods, prioritizing performance. Leveraging Lottie animations ensured seamless visuals while maintaining high efficiency.

By integrating Apple's TipKit Framework into the application, users benefit from valuable tips and insights, enhancing their experience and effectively guiding them through the app's features.

Installation and Requirements

To set up and run the application, please follow these steps:

  1. Begin by cloning the repository onto your local machine.
  2. Owing to the substantial size of the BERT model, it is not directly integrated into the project. You may acquire it by downloading it from the following link:

BERT Model Download Link

  1. Rename the downloaded BERT model to BERTSQUAD.mlmodel

  2. Open the project in Xcode and integrate the downloaded model into the project structure at the designated location:

    ./AISightQuest/Resources/BERT Core ML Model Resources

  3. Finally, build and launch the application either on the iOS simulator or a connected device.

Contribution

Feel free to share your ideas or any other problems. Pull requests are welcome! For major changes, please open an issue first to discuss what you would like to change.

License

AI Sight Quest is released under the MIT license. See LICENSE for more information.

aisightquest's People

Contributors

nsswifter avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

aisightquest's Issues

Implement TipKit

  • Implement TipKit that helps people discover features in the app
  • Add feature to Reset it

Question Answering Model Name

As the Question Answering Model should be integrated into the project, there may be instances where the model's name changes. To address this, we assign a unified name to the model, and before importing the model file, it should be renamed.

Implement First Module

  • Main View containing the list of sessions that can edit, add, remove, and rename items of the list
  • Choose the app's architecture pattern among MVVM, VIP, or Clean Architecture

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.