Comments (4)
Hi,
The SNOMED preprocessing procedure expects that you pass it one of the following:
- The folder to the SNOMED release
- E.g
/path/to/SnomedCT_InternationalRF2_PRODUCTION_20230731T120000Z
- The folder that contains multiple SNOMED releases
- E.g the parent folder
- In this case, the multiple releases will be combined
- All folders that contain "SnomedCT" will be considered
If this is a UK extension or a UK Drug extension, you would also need to specify uk_ext=True
or uk_drug_ext=True
in the constructor, respectively.
Other flavours of SNOMED are not currently fully supported. The naming conventions of files differ somewhat between different flavours which is why we need to know which one you're attempting to use.
The other assumption is that the files in the release folders are in the standard folder structure (e.g SnomedCT_InternationalRF2_PRODUCTION_20230731T120000Z/Snapshot/Terminology/sct2_Concept_Snapshot_INT_20230731.txt
and similar files exist).
So the error you've received means that the preprocessor was unable to figure out the Snomed version for one of the releases you're trying to work with. This could be because you didn't specify the correct flavour. Or it could be that we don't support the flavour you're trying to use. Or it could be that the file structure differs from the default. Or perhaps the parent folder had a folder that contained "SnomedCT" within it that didn't correspond to a Snomed release.
What Snomed release are you trying to use exactly?
from medcat.
Hi,
I'm using the most recent release (38.0) of the SNOMED CT UK Monolith Edition, RF2: Snapshot from NHS TRUD.
I've followed the instructions here: https://htmlpreview.github.io/?https://github.com/CogStack/MedCATtutorials/blob/main/notebooks/specialised/Preprocessing_SNOMED_CT.html
I skipped the load the data section as the SNOMED file isn't zipped.
Load the data
Assign a path to the zipped SNOMED CT release download. (skip this step if the folder is not zipped)
snomed_path = "SnomedCT_InternationalRF2_PRODUCTION_20210131T120000Z.zip" # Enter your zipped Snomed folder here
!unzip snomed_path
Preprocess the release for MedCAT
Initialise
snomed_filename = "SnomedCT_InternationalRF2_PRODUCTION_20210131T120000Z" # The unzippedSNOMED CT folder
snomed = Snomed(snomed_filename)
Skip this step if your version of snomed is not the UK extension released >2021.
Note: this step will only work with MedCAT v1.2.7+
snomed.uk_ext = True
from medcat.
Unfortunately we don't currently fully support the Monolith edition. Again, its file naming conventions are different from the other releases.
I would recommend using the UK Clinical Edition and/or the Drug extension.
With that said, if you do wish to use the Monolith edition, the changes to the concept snapshot should be doable on your side.
I.e you should be able to change the concept_snapshot = "sct2_Concept_MONOSnapshot_GB"
in code. You could copy the code from the repo and change this yourself.
EDIT: The above has not been tested. There may be other changes to the release as well.
PS:
Supporting as well as making it easier for others to use any edition is on our radar. But there is no ETA on this on our side, unfortunately.
from medcat.
Noted - I'll try with the clinical edition.
Thanks for your help :)
from medcat.
Related Issues (20)
- Loosen dependency constraints HOT 1
- How to use CUI filtering
- Version 1.6.1 is not on pypi HOT 1
- new install attempt with SNOMED model throws spacy/thinc config validation error HOT 4
- remove elasticsearch dependency HOT 5
- Show nested ents results in error HOT 2
- Modifing the weights of words in the models HOT 3
- i am running NER-L tutorial but while running this cell, model and dataset are not downloading. HOT 1
- Model resources no longer available. HOT 4
- TypeError: Snomed._refset_df2dict() takes 1 positional argument but 2 were given HOT 1
- Concept not found if token order is slightly changed contrary to mentioned note in paper HOT 3
- Install MedCAT with pytorch cpu HOT 2
- Error: [E050] when trying to load sample annotate entity project
- MedCAT model used in validation HOT 3
- Stopwords do not load properly HOT 4
- How do I download the models? HOT 3
- Service unavailable HOT 4
- How to use SNOMED trained model HOT 4
- ```cat.cdb.print_stats()``` returns empty HOT 10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from medcat.