Giter Club home page Giter Club logo

reality-augmentation-using-cognitive-services's Introduction

Overview

Augmented reality is hot. Artificial intelligence is hot. Combining the two to create an augmented view of reality where pictures can be identified, tracked, and labeled with meaningful text is a truly fun and exciting experience. This page describes how to use Unity and Microsoft Cognitive Services to create an application to do just that. The application can be run as-is on the desktop emulator, and it can also be deployed onto the Microsoft HoloLens.

The application uses the Microsoft Computer Vision API, part of Microsoft Cognitive Services, to extract meaningful text from images. Vuforia is used for image detection and tracking. Microsoft Visual Studio is used to create script actions that call out to the Computer Vision API and also to deploy the application to the HoloLens.

Prerequisites

This tutorial requires that you have a 64-bit Windows computer with a camera and an internet connection. The following software needs to be installed on your computer:

  1. Visual Studio 2017

    • If you do not have Visual Studio 2017 installed:
    • Open the Visual Studio installer
      • If Visual Studio is installed, open it from the Settings > Apps control panel
    • Select the Universal Windows Platform development workload
    • Select the Game Development with Unity workload
      • You may deselect the Unity Editor optional component since you'll be installing a newer version of Unity from the instructions below

    All editions of Visual Studio 2017 are supported (including Community). While Visual Studio 2015 Update 3 is still supported, we recommend Visual Studio 2017 for the best experience.

  2. Unity Editor

    • Navigate to Download Unity
    • Scroll down and under Resources click Older versions of Unity
    • Scroll down and for Unity 2018.1.0 click Downloads (Win) (NOTE: Newer versions have deprecated support for .NET. We will discuss how to support more recent versions in future releases of this tutorial)
    • Select Unity Editor (64 bit)
    • Save the installer executable to your Downloads folder and double click it
    • Click Next, accept the terms of service, click Next, click Next, select the download folder, click Next, and click Finish

You will also need to setup some cloud services.

  1. Vuforia

  2. Microsoft Azure Resource Group

    • Navigate to Azure Portal
      • Sign in to the Azure Cloud if requested
    • Click Resource groups
    • In the blade that opens on the right, click + Add
    • In the blade on the right, fill in the following fields:
      • Resource group name: myCognitiveResources
      • Subscription: select your subscription
      • Resource group location: select West US 2
    • Click Create
  3. Microsoft Computer Vision API **

    • Navigate to Azure Portal
      • Sign in to the Azure Cloud if requested
    • Click Create a resource
    • Type Computer Vision in the search box
    • Click the item named Computer Vision
    • Click Create
    • In the dialog, fill in the following fields:
      • Name: Type in myComputerVision
      • Subscription: select your subscription
      • Pricing tier: select F0 for free
      • Resource group: select myCognitiveResources from the drop-down
      • Click Create
    • When it's ready, go to the resource group and click myComputerVision
    • Click Overview and copy Endpoint to a text file (you will use this later)
    • Click Keys and copy KEY 1 to a text file (you will use this later)
  4. Microsoft Text Analytics API **

    • Navigate to Azure Portal
      • Sign in to the Azure Cloud if requested
    • Click Create a resource
    • Type Translator Text in the search box
    • Click the item named Translator Text
    • Click Create
    • In the dialog, fill in the following fields:
      • Name: Type in myTranslatorText
      • Subscription: select your subscription
      • Pricing tier: select F0 for free
      • Resource group: select myCognitiveResources from the drop-down
      • Click Create
    • When it's ready, go to the resource group and click myTranslatorText
    • Click Keys and copy KEY 1 to a text file (you will use this later)
  5. Microsoft Face API **

    • Navigate to Azure Portal
      • Sign in to the Azure Cloud if requested
    • Click Create a resource
    • Type Face in the search box
    • Click the item named Face
    • Click Create
    • In the dialog, fill in the following fields:
      • Name: Type in myFace
      • Subscription: select your subscription
      • Pricing tier: select F0 for free
      • Resource group: select myCognitiveResources from the drop-down
      • Click Create
    • When it's ready, go to the resource group and click myFace
    • Click Overview and copy Endpoint to a text file (you will use this later)
    • Click Keys and copy KEY 1 to a text file (you will use this later)

** You should hold off doing this until a few days before the tutorial, because free keys expire after one week. Otherwise, you might have to refresh your keys beforehand, which can take up to 10 minutes.

Demos

Contributing

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

reality-augmentation-using-cognitive-services's People

Contributors

marabout2015 avatar mobatmedia avatar roalexan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

reality-augmentation-using-cognitive-services's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.