Comments (8)
sigh.... it looks like this problem is coming straight from the dbf sqlite db
@zaneselvans any ideas on checking the source of this?
from pudl.
Ooooh, your plot is so pretty.
from pudl.
You could use dbfread
directly to try and pull this table out of say the 2010 archive of the FERC Form 1 inputs and see what you find. The f1_bal_sheet_cr
table corresponds to F1_11.DBF
and looking at the 2010 data, it seems to contain data. The file is about 2.7MB.
from pudl.
hahah 🎨
import mplcyberpunk
plt.style.use("cyberpunk")
and thanks i'll take a peek
from pudl.
I think I see what's happening:
import dbfread
f1_bal_sheet_cr_dbf = dbfread.DBF(
"/tmp/UPLOADERS/FORM1/working/F1_11.DBF",
)
f1_bal_sheet_cr_dbf.load()
f1_bal_sheet_cr_df = pd.DataFrame(f1_bal_sheet_cr_dbf.records)
f1_bal_sheet_cr_df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 23736 entries, 0 to 23735
Data columns (total 16 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 RESPONDENT 23736 non-null int64
1 REPORT_YEA 23736 non-null int64
2 SPPLMNT_NU 23736 non-null int64
3 ROW_NUMBER 23736 non-null int64
4 ROW_SEQ 23736 non-null int64
5 ROW_PRVLG 23736 non-null object
6 BEGIN_YR_B 0 non-null object
7 END_YR_BAL 0 non-null object
8 BEGIN_YR_2 23736 non-null int64
9 END_YR_BA2 23736 non-null int64
10 REPORT_PRD 23736 non-null int64
11 END_QTR_BA 23736 non-null int64
12 PRI_YR_Q4_ 23736 non-null int64
13 END_QTR_B2 23736 non-null int64
14 PRI_YR_Q42 23736 non-null int64
15 _NullFlags 23736 non-null object
dtypes: int64(12), object(4)
memory usage: 2.9+ MB
The starting and ending year balances are indeed NULL
. But there's quarterly data, and there's "end of Q4 last year" data.
f1_bal_sheet_cr_df.sample(20)
RESPONDENT | REPORT_YEA | SPPLMNT_NU | ROW_NUMBER | ROW_SEQ | ROW_PRVLG | BEGIN_YR_B | END_YR_BAL | BEGIN_YR_2 | END_YR_BA2 | REPORT_PRD | END_QTR_BA | PRI_YR_Q4_ | END_QTR_B2 | PRI_YR_Q42 | _NullFlags | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5407 | 185 | 2010 | 0 | 29 | 29 | 0 | 0 | 6 | 2379398 | 2265830 | 0 | 0 | ||||
20086 | 266 | 2010 | 0 | 28 | 28 | 0 | 0 | 12 | 2611319 | 3610632 | 0 | 0 | ||||
16988 | 11 | 2010 | 0 | 38 | 38 | 0 | 0 | 9 | 17889530 | 14992020 | 0 | 0 | ||||
11405 | 185 | 2010 | 0 | 35 | 35 | 0 | 0 | 9 | 2322518 | 2265830 | 0 | 0 | ||||
1718 | 149 | 2010 | 0 | 16 | 16 | 0 | 0 | 3 | 4339248092 | 4302112149 | 0 | 0 | ||||
22139 | 446 | 2010 | 0 | 48 | 48 | 0 | 0 | 12 | 6282862 | 0 | 0 | 0 | ||||
15758 | 182 | 2010 | 0 | 49 | 49 | 0 | 0 | 9 | 54886 | 51738 | 0 | 0 | ||||
158 | 40 | 2010 | 0 | 63 | 63 | 0 | 0 | 3 | 3068685 | 3130431 | 0 | 0 | ||||
11050 | 111 | 2010 | 0 | 57 | 57 | 0 | 0 | 6 | 3058544 | 3398384 | 0 | 0 | ||||
12784 | 134 | 2010 | 0 | 11 | 11 | 0 | 0 | 9 | 2666784903 | 2225701346 | 0 | 0 |
from pudl.
yea there is nothing in the ending balance in several years I just checked. There a far amount of END_QTR_BA
or end_qtr_bal.
sigh it looks like they reported all of the money as end of quarter balances instead of end of year balances for these years.
from pudl.
I think that means we could integrate the end of quarter data as annual data
from pudl.
Yeah, I think if you select ONLY the report_prd==12
records.
from pudl.
Related Issues (20)
- Issues setting up PUDL dev environment HOT 6
- core_eia860m__changelog_generators contains NULL valid_until_date values
- Run Q1 2024 Updates for EIA923, EIA860m, CEMS, and EIA Bulk Elec
- Q1-2024 Updates to EIA860m
- Q1-2024 Updates to EIA923
- Q1-2024 Updates to EIA Bulk Electricity
- Q1-2024 Updates to CEMS
- Update Slack upload API for PUDL Deployment Bot
- Duplicate generators in `analysis.allocate_gen_fuel` HOT 1
- Investigate lack of monthly year-to-date data in out_eia923__monthly_generation_fuel_by_generator table
- PUDL Release v2024.5.0
- Create resources for non-urgent AEO Table 54 data
- Update DOI automatically for well-behaved datasets HOT 2
- Publish detailed accounting tables in DB
- Builds initiated via workflow dispatch on main deploy inappropriately
- Small tweaks to FERC rate base categories
- `core_eiaaeo__yearly_projected_fuel_cost_in_electric_sector_by_type` HOT 1
- Update XBRL extraction to handle multiple taxonomies per year
- Add 2019 data to EIA861 Short Form table HOT 2
- Nightly Build Failure 2024-05-28
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pudl.