Giter Club home page Giter Club logo

geo-mllm's Introduction

Files Description

  1. configuration: config.yaml
  2. Use multimodal LLM to inference: infer.py
  3. Extract specific results(e.g.: latitude and longitude): extract_info.py
  4. Use ChatGPT to check and refine the results: check_GPT.py
  5. Change prompt method: method.py
  6. Calculate distance between prediction and ground truth: calc_dist.py
  7. Automatic script: run.py

JSON File Description

[
    {
        "image_file": "file_name",
        "gt": {
            "latitude": "value",
            "longitude": "value"
        },
        "model_1": {
            "method_1": {
                "output": " ",
                "latitude": " ",
                "longitude": " ",
                "location": " ",
                "xxx": " "
            }
        },
        "model_2": {
            "method_1": {
                "output": " ",
                "xxx": " "
            }
        },
    },
    ......
]

Field Description

  • image_file: The name of the image file.
  • gt: A nested object containing geographical location information.
    • latitude: The latitude value.
    • longitude: The longitude value.
  • model_name: A nested object containing model-related information.
    • method: A nested object containing method-related information.
      • output: The output result of the method.
      • xxx: Other relevant information.

Demo

View Interactive Map on local browser

Feel free to run a simple demo:

python infer_demo.py --image-path /path/to/image

or infer_demo.ipynb

demo image demo map

geo-mllm's People

Contributors

onlyairnopods avatar

Stargazers

Zelin Fu avatar Yixuan Wu avatar Helin Wang avatar  avatar

Watchers

 avatar

Forkers

wanghelin123

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.