Giter Club home page Giter Club logo

auto-qchem's People

Contributors

b-shields avatar beef-broccoli avatar winstongee avatar yujikaiya avatar zuranski avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

auto-qchem's Issues

Unable to connect to the server

To whom it may concerns,

Hi! I'm a master student from Imperial College London doing a research project involving auto-qchem. I have installed the autoqchem environment to my laptop and so on but unable to connect to your server using 'sm=slurm_manager(user='zuranski', host='della.princeton.edu').' Do your still allow external users? Do I need to do something else to connect?
This is the link of the error video:
https://drive.google.com/file/d/1wgDmkRbFidGKhxFOaYcYcaBwx_IvjG6g/view?usp=share_link

Sincerely,
Chenlin Cai
01708103

Known bug for v1.2.6

bad port number for db connection; change to int.
Did not fix because the transition to Python 3.8 and auto-qchem 1.3.1, but in case older versions are needed this will need to be addressed at helper_classes.py

Slurm manager timeout error

When running the code of the Tutorial_creating_descriptor_sets.ipynb notebook in jupyter-notebooks, the "ServerSelectionTimeoutError" is raised.

I apologize in advance, I am a chemist with little programming experience, so if more detail is needed, please let me know.

dependency issue that involves python 3.7, openbabel 2.4 and numpy 1.21

vulnerability issue with numpy 1.21, 1.22 recommended, which requires python >=3.8

however, openbabel 2.4 seems to be more stable than openbabel 3.x. openbabel 2.4 appears to work only with python <= 3.7

Set python to >= 3.7 for now. In the future, might update to python >=3.8, which requires the use of openbabel 3.x, which needs more testing on stability.

autoqchem.openbabel_functions

Hi,

I am trying to run Fetch log files but I get the following error message:

NameError Traceback (most recent call last)
in
1 # add fs_name to the table
----> 2 mol = input_to_OBMol(can, "string", "smi")
3 mol_df['file_base_name'] = mol_df['can'].map(mol_fs_name)

NameError: name 'input_to_OBMol' is not defined

I am checking in your Github but I can not find the file openbabel_functions. How can I fix this?

Thanks

500 Internal Server Error on Database Website

When visiting https://autoqchem.org/ the navbar is displayed but no content on the site appears.

Checking the console log shows this:

500 Internal Server ErrorInternal Server Error

The server encountered an internal error and was unable to complete your request. Either the server is overloaded or there is an error in the application.

message: "Callback error updating page-content.children"

Abnormal molecule structure

I installed auto-qchem according to installation instructions (Windows), and run example notebook "framework_functionality_test.ipynb", but generated chemical structure is highly distorted.

image

Is it common error when using Openbabel for molecular conformation generaion??
(I do not have experience using Openbabel for such purpose.)

How can I fix it?? (Installing different version of OpenBabel or using Linux instead of Windows?)

Here is my environment.

  • openbabel v2.4.1
  • Python 3.7

How to compute dft on computing cluster?

When I run sm.submit_jobs() in Tutorial_creating_descriptor_sets.ipynb file

An error like the message below occurs.

**UnexpectedExit: Encountered a bad command exit code!

Command: 'cd /scratch/a1422a01/gaussian && sbatch /scratch/a1422a01/gaussian/C15H24N4O6S2_e5f2_conf_0.sh'

Exit code: 1

Stdout:

Stderr:

sbatch: error: Batch job submission failed: Unspecified error**

As a result of checking the computing cluster,

Two files were created.

C15H24N4O6S2_e5f2_conf_0.gif
C15H24N4O6S2_e5f2_conf_0.sh

As a result of checking the contents of the โ€˜C15H24N4O6S2_e5f2_conf_0.shโ€™ file

A file was created with the following contents.

#!/bin/bash
#SBATCH -N 1
#SBATCH --ntasks-per-node=8
#SBATCH -t 23:59:00
#SBATCH --constraint="skylake"

input=C15H24N4O6S2_e5f2_conf_0

#run the code
cd /scratch/a1422a01/gaussian
g16 ${input}.gjf

I want the contents of the C15H24N4O6S2_e5f2_conf_0.sh file to be as follows.

#!/bin/sh
#SBATCH -J test
#SBATCH -p ivy_V100_2 # Neurons use tesla_node partition.
#SBATCH -N 1
#SBATCH -n 20
#SBATCH -o gautest.o%j # Define the filename of the standard output
#SBATCH -e gautest.e%j # Define filename for standard error
#SBATCH -t 12:00:00
#SBATCH --gres=gpu:2
#SBATCH --comment gaussian
module load gaussian/g16

input=C15H24N4O6S2_e5f2_conf_0

#run the code
cd /scratch/a1422a01/gaussian
g16 ${input}.gjf

I would like some help on how to modify the code in slurm_manager.py .

I tried modifying the latter part of the slurm_manager.py file, but didn't get any good results.

I need your help.

Error caused by capital letter (Density/density))

when using get_descriptors(), the error below ocurred.

--> 258 string = re.search("Population.?SCF density.?(\sAlph.?)\n\sCondensed", text, re.DOTALL).group(1)
259 if self.descriptors['multiplicity'] == 1:
260 energies = [re.findall(f"({float_or_int_regex})", s_part) for s_part in string.split("Alpha virt.", 1)]

AttributeError: 'NoneType' object has no attribute 'group'

In the output report of gaussian in our environment (Windows, G16) , the target text is below.

Population analysis using the SCF Density

So, I changed the code

string = re.search("Population.*?SCF [Dd]ensity.*?(\sAlph.*?)\n\s*Condensed", text, re.DOTALL).group(1)

then, error was fixed.

Gaussian inputs for differents computational parameters have same name

Hi,
When trying to generate molecular descriptors for molecules in different solvents, the input generator generates gaussian inputs with a name that is the inchikey and the conformation index. Thus, when trying to launch computations of the same molecules within different solvents the input files are overwritten which causes a problem in the retrieving and uploading processes. I guess it may be possible to add solvent or DFT level information to the input name to avoid this issue but I am not sure if this would cause a problem in the job_retrieving process.
All my best,
Jules

how to get all features like your previous paper provided

Our lab has Gaussian 16 software. So I have tried to run opt freq TD calculations according to the Gaussian gjf file settings you provided (basically just changing the molecule, the settings remain unchanged).

At present, I have used the gaussian_log_extractor you provided to successfully obtain a semi-finished feature.

I would like to ask, what else needs to be done to convert it to a complete feature dataframe(or csv)?

I want to have the molecule feature like this file:
https://github.com/b-shields/edbo/blob/master/experiments/data/aryl_amination/aryl_halide_dft.csv

My Current Result: A dict like below

{
    'descriptors': {'number_of_atoms': 12, 'charge': 0, 'multiplicity': 1......},
    'atom_descriptors': {'X': [-2.241815, -0.719692, 0.0, 1.460496, 2.......},
    'modes': {'Frequencies': [99.7815, 166.9938, 244.151, 347.9461, 393.1......},
    'mode_vectors': {'mode_number': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,......},
    'transitions': {'ES_transition': [161.05, 154.99, 154.72, 134.54, 132.51, 127.49, 127.4......},
    'labels': ['C', 'C', 'C', 'C', 'N', 'H', 'H', 'H', 'H', 'H', 'H', 'H']
}

Previous paper by your group: Bayesian reaction optimization as a tool for chemical synthesis
Corresponding Github: https://github.com/b-shields/edbo

Thank you for your kind assistance.

DB Descriptors: duplicate molecules results in sum of molecule descriptors not Boltzmann average

When duplicate molecules exist, downloading Boltzmann average descriptors results in the sum of both molecule's descriptors being downloaded as a single molecule.

For example, a search for triethylamine SMILES: CCN(CC)CC results in two molecules:
https://autoqchem.org/?tag=ALL&solvent=ALL&functional=ALL&basis_set=ALL&substructure=&smiles=CCN%28CC%29CC

Downloading the global descriptors of these as Boltzmann average gives a single molecule where the descriptors are the sum of both molecules rather than an averaged value. For example: number_of_atoms is 44, not 22, E is -584.151 the sum of -291.95 and -292.2.

Please feel free to correct me if my usage is wrong or this is expected functionality. Fantastic work by the way!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.