Giter Club home page Giter Club logo

esg-data-extraction's Introduction

Information extraction pipeline from ESG documents

The project requires the user to have the latest version of conda and python installed in path.

  1. Create a new environment in Conda using the command
conda create -n <env> python==3.8

2.Install the dependencies using the requirements.txt file

conda activate <env>
pip install -r requirements.txt

3.Create an instance in neo4j Aura DB and get your DB credentials and save it in credentials.env

4.Download the folder for the purpose of classification.The below mentioned link contains the pretrained weights of ESG BERT models: Create and folder with the name bert-models in the root directory and place the contents of the drive in it.

https://drive.google.com/drive/folders/1N7Biv16TCoK3LTFYihSPwHU6ZNrM6rvn?usp=sharing

5.Update neo1.txt file from neofiles directory with the cloud link to final_output.csv(automate the process to upload the final_output.csv to your interested cloud platform using a function) that you would be getting in the outputs folder.

  1. Open localhost and enter the credentials for the database using the following command:
cd Model
uvicorn api:app --reload

Now your webapplication gets hosted through localhost.

Enter your database credentials to enter to your database.

You should be redirected to Uploadfiles route where you need to upload all the input files. Input files:-

  • corpus.txt(Should be sentence tokenized)
  • relations.txt
  • props.txt

You will be redirected to aura db where you can play with the knowledge-graph using cypher commands.

  1. Deactivate the environment once you are done.
conda deactivate 

esg-data-extraction's People

Contributors

akashvana avatar preetam-kotteda avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Forkers

jackyjqcheung

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.