Giter Club home page Giter Club logo

simchart9k's Introduction

arXiv GitHub issues PRs Welcome

SimChart9K

SimChart9K: An LLMs-based Simulatied Visual Chart Understanding Benchmark

We perform data augmentation for chart perception and reasoning by leveraging an LLMs-based self-inspection data production scheme, producing the SimChart9K dataset, where the simulated dataset consists of 9,536 chart images and associated data annotations in CSV format. Besides, we observe that StructChart continuously improves the chart perception performance as more simulated charts are used for pre-training.

SimChart9K Dataset Download from google drive

Downloading the official SimChart9K dataset from google drive

SimChart9K Dataset Download from Opendatalab

a. Register an account from OpenXLab website as follows.

https://openxlab.org.cn/home

b. Install the dependent libraries as follows:

  • Install the openxlab dependent libraries.
      pip install openxlab
  • Obtain the Access Key and Secret Key on the OpenXLab website by clicking the button of Account Security
  • Login the OpenXLab using the Access Key and Secret Key
      openxlab login

c. Download the SimChart9K dataset by performing the following command:

openxlab dataset get --dataset-repo  Lonepic/SimChart9K

t-SNE comparisons with Real Chart Datasets

Feature Distribution using t-SNE of Real Datasets.

Feature Distribution using t-SNE of both Real Datasets and SimChart9K.

Visualization Exapmles

Visualization results using the proposed StructChart on different chart-related reasoning tasks including Question Answering (QA), Summarization, and Redrawing.

Visualization results using the proposed StructChart on different chart-related reasoning tasks including Question Answering (QA), Summarization, and Redrawing.

Citation

Please consider citing our work if this dataset is helpful for your research:

@article{xia2023structchart,
  title={StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding},
  author={Xia, Renqiu and Zhang, Bo and Peng, Haoyang and Ye, Hancheng and Yan, Xiangchao and Ye, Peng and Shi, Botian and Yan, Junchi and Qiao, Yu},
  journal={arXiv preprint arXiv:2309.11268},
  year={2023}
}

simchart9k's People

Contributors

bobrown avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.