Comments (6)
Please note that py-xbrl only parses XBRL Documents. The files you provided are just regular HTML files and do not follow the XBRL Standard.
You can check if a file follows the iXBRL Standard either manually by looking at the Index file of the submission or programmatically i.e via the Structured Disclosure RSS Feeds.
The reason why the parser is crashing is that the SEC appends a file header to these documents, which can't be processed by the XML Parsing libary py-xbrl uses.
from py-xbrl.
However, perhaps a better error message could be issued, indicating that you should check whether the given file is really an XBRL file 🤔.
from py-xbrl.
SEC told me it wasn't mandatory for small companies to include inline xbrl in the past, but it will be from June 15, 2021.
Which means this lib won't cover every historical cases, and an html greper is also required.
Yeah a simple error msg would be nice that this file doesn't have inline xbrl. Then we can fall back to html greper.
from py-xbrl.
Not exactly. Since 2009 it is mandatory for every company (with assets over 10 Mio USD) to publish the 10-K and 10-Q in XBRL.
In 2019 the SEC began to slowly transition from regular XBRL to inline XBRL (iXBRL).
So yes, there are currently some small companies that still don't file inlineXBRL files (html), but they usually append a seperate XBRL Instance Document (xml) to their submission.
This libary can parse both XBRL and inline XBRL documents.
So instead of the original html filing document:
https://www.sec.gov/Archives/edgar/data/1641631/000149315221014050/form10-k.htm
use the XBRL instance document that was submitted with the filing:
https://www.sec.gov/Archives/edgar/data/1641631/000149315221014050/xair-20210331.xml
from py-xbrl.
What about this 10-K (2021-05-26)?
https://www.sec.gov/Archives/edgar/data/0000017313/000001731321000075/0000017313-21-000075-index.htm
It doesn't have any xml among filling files, neither inline xbrl, only plain html.
from py-xbrl.
Correct, this submission does not contain any XBRL tagging.
Maybe there is an exception rule for this company that they do not have to file in iXBRL.
You can find the original legislation regarding which company has to file in XBRL in RIN 3235-AJ71 page 43 and following.
from py-xbrl.
Related Issues (20)
- Need path or reference to source file of a Linkbase HOT 2
- Standardised Financial Data HOT 5
- Equals method for all fact classes HOT 1
- Solution to frequently missing taxonomy specifications in UK submissions HOT 21
- Parsing Failures for Empty Fact Values and 'nil' Text in XBRL Documents HOT 2
- Support a New Taxonomy? HOT 1
- Parsing filings with empty imports HOT 3
- "Explicit Member"s missing HOT 3
- Add support for Datetime in context duration. HOT 3
- KeyError: 'Unit_sqft' HOT 2
- Add support for the ixt-sec transformations. HOT 1
- unresolved schemas HOT 12
- Not well-formed (invalid token) error for ixblr. HOT 11
- parse_ixbrl should add encoding argument HOT 2
- Be nicer to submissions that do not follow the XBRL standard 100% HOT 6
- New 2022 taxonomies HOT 4
- Bug: instance.json('my-file.json') HOT 1
- Unclear +/- sign of some facts HOT 17
- Space in url creates issues when requesting a taxonomy
- Potential arg bug in transformations __init__ HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from py-xbrl.