Giter Club home page Giter Club logo

pemt's People

Contributors

agiani99 avatar yojanagadiya avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

pemt's Issues

chembl to surechembl mapping file source and mapping method question

Hello,

The chembl to surechembl mapping file used in the patent_chemical_harmonizer.py file from the Fraunhofer-ITMP/PEMT Github repository, as mentioned in the paper, seems to have been created based on cross-reference information between the SureChEMBL database and the ChEMBL database.
(https://github.com/Fraunhofer-ITMP/PEMT/blob/main/src/pemt/patent_extractor/patent_chemical_harmonizer.py#:~:text=https%3A//raw.githubusercontent.com/Fraunhofer%2DITMP/PEMT/main/data/mapper/chemical_mapper.json)

However, neither Chebml nor surechembl can find a connection to each other.

If you have used crawling or used other DBs or APIs, please let me know the URL link of the database or API you used.
Also, it would be appreciated if you could tell us the mapping method.

Using this, if what I test produces good results,
I plan to publish it as a paper.
Then, the PEMT papers you published can also benefit from being cited.

thank you

[BUG]: Missing data in PyPI package

Describe the bug
The chemical extractor fails due to a missing chembl_uniprot_mapping.txt file. It seems installing from PyPI omits the data directory.
After manually copying data files from the github repo to the expected directory, the error is gone (and the tool runs into a probably unrelated 500 internal server error by ebi.ac.uk).

To Reproduce
Steps to reproduce the behavior:

conda create --prefix /tmp/pemt python=3.8
conda activate /tmp/pemt
pip install PEMT
pemt run-chemical-extractor --name=symbol --data=/tmp/symbol.tsv  --uniprot

Expected behavior
Program run with exit code 0.

Error

Starting the chemical extractor pipeline for symbol
Traceback (most recent call last):
  File "/tmp/pemt/bin/pemt", line 8, in <module>
    sys.exit(main())
  File "/tmp/pemt/lib/python3.8/site-packages/click/core.py", line 829, in __call__
    return self.main(*args, **kwargs)
  File "/tmp/pemt/lib/python3.8/site-packages/click/core.py", line 782, in main
    rv = self.invoke(ctx)
  File "/tmp/pemt/lib/python3.8/site-packages/click/core.py", line 1259, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
  File "/tmp/pemt/lib/python3.8/site-packages/click/core.py", line 1066, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/tmp/pemt/lib/python3.8/site-packages/click/core.py", line 610, in invoke
    return callback(*args, **kwargs)
  File "/tmp/pemt/lib/python3.8/site-packages/pemt/cli.py", line 98, in run_chemical_extractor
    gene_chemical_dict = extract_chemicals(
  File "/tmp/pemt/lib/python3.8/site-packages/pemt/chemical_extractor/experimental_data_extraction.py", line 138, in extract_chemicals
    chembl_mapper = pd.read_csv(
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/util/_decorators.py", line 311, in wrapper
    return func(*args, **kwargs)
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/io/parsers/readers.py", line 586, in read_csv
    return _read(filepath_or_buffer, kwds)
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/io/parsers/readers.py", line 482, in _read
    parser = TextFileReader(filepath_or_buffer, **kwds)
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/io/parsers/readers.py", line 811, in __init__
    self._engine = self._make_engine(self.engine)
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/io/parsers/readers.py", line 1040, in _make_engine
    return mapping[engine](self.f, **self.options)  # type: ignore[call-arg]
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/io/parsers/c_parser_wrapper.py", line 51, in __init__
    self._open_handles(src, kwds)
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/io/parsers/base_parser.py", line 222, in _open_handles
    self.handles = get_handle(
  File "/tmp/pemt/lib/python3.8/site-packages/pandas/io/common.py", line 702, in get_handle
    handle = open(
FileNotFoundError: [Errno 2] No such file or directory: '/tmp/pemt/lib/python3.8/site-packages/pemt/../../data/mapper/chembl_uniprot_mapping.txt'

Desktop (please complete the following information):

  • OS: Ubuntu 22.04

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.