Giter Club home page Giter Club logo

retcl's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

shunsunsun

retcl's Issues

Is there any performance statistics on large datasets?

Great work!

In addition to the accuracy of USPTO-50K provided in the paper, I am very interested in the performance of this method on large datasets, such as USPTO-MIT and USPTO-FULL. Have you done any experiments?

questions about the candidate set and the function "check_molecule_dict()"

According to the paper, the candidate set should be consist of all reactants in the entrie USPTO database.

However, for the function "check_molecule_dict()" in https://github.com/hankook/RetCL/blob/main/datasets/__init__.py, i found something different.

def check_molecule_dict(mol_dict, datasets):
    for split in ['train', 'val', 'test']:
        for rxn in datasets[split]:
            assert rxn.product in mol_dict
            for reactant in rxn.reactants:
                assert reactant in mol_dict

This function seems to be quite important. The training and evaluation sciprt cannot work without passing this check function.
According to the code, should the products also be inclulded in the candidate set?

It would be great if you could make the code for how to get the candidate set public.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.