Giter Club home page Giter Club logo

bioproviders's People

Contributors

alexkenna avatar n7581769 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

bioproviders's Issues

Add script that updates GenBank file lists

At present, the type provider relies on a list of FTP locations for the different species and assemblies on GenBank. These will become outdated over time as GenBank updates, so it would be useful to include a script in the repository that allows us to update the lists every so often (and then publish a new version of the package).

Alex has noted that the files generated are based off a file on the GenBank FTP server itself; in future we may be able to look into using this directly rather than keeping our own lists. This could help us avoid the problems in #6.

NuGet package data file location

Previously, when building the package for NuGet, the files with the lists of locations on the GenBank FTP server would be placed in the same directory as the DLLs, rather than in the data subfolder. This meant that they could not be found when using the type provider from NuGet.

At present, I have moved the files out of the subfolder to attempt to avoid this problem (though my attempt to use the package locally still did not work, for other reasons). There are a few different ways we could tackle this problem:

  • Find a way to properly put the files in the correct subfolder while letting the type provider still work as a project reference, if that's still desired;
  • Make the data files and access code a separate package, as the files can be more easily put into a subfolder this way; or
  • Remove the dependency on these data files entirely, as mentioned in #5.

docs

Looks good - but time to migrate the docs to the main project README.

Problems with documentation generated from fsdocs

There are a couple of issues with the current generation of documentation using fsdocs. I've spent some time trying to figure these out, but have not had any success yet.

  • When running the build script, the code in GenBankExample.fsx does not run. The output of the build script mentions that the DesignTime assembly is already loaded and without being able to load the BioProviders package, the code naturally doesn't work.
  • I cannot get the License, Release Notes and Source Repository links to have the correct URL. I've tried including a Directory.Build.props like the original FSharp.Formatting in the solution (which isn't committed yet) with the required XML properties but that doesn't seem to work.

Add RefSeq type provider

At present, the type provider only supports GenBank, though there are routes to use RefSeq data that lead to "unsupported" message outputs. Once we are happy with the state of the GenBank type provider, we may look into creating a RefSeq type provider too.

Metadata in GenBankFlatFile

According to Alex, the Metadata in the GenBankFlatFile type does not provide all the available metadata fields. We need to check whether there are any useful ones (such as Locus) that are not included yet and add them to the type.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.