ccao-data / ccao Goto Github PK
View Code? Open in Web Editor NEWR package of functions and datasets used throughout the CCAO assessment pipeline
Home Page: https://ccao-data.github.io/ccao/
License: GNU Affero General Public License v3.0
R package of functions and datasets used throughout the CCAO assessment pipeline
Home Page: https://ccao-data.github.io/ccao/
License: GNU Affero General Public License v3.0
We forgot to add sv_is_outlier
to the package after completing the sales validation pipeline.
The data team has added new employees, the ccao_ids.R
should be updated accordingly.
The vars_dict
dataset in this package is used heavily by ccao-data/model-res-avm and ccao-data/model-condo-avm. However, iterating with the model requires us to constantly update this dictionary (and thus the package). Additionally, keeping the dictionary up-to-date is tedious, manual process.
I propose we automate the creation and maintenance of this dictionary and move it out of this package on to S3. We can use Glue and other AWS APIs to construct it on a schedule, then store it in a public bucket.
We need to add:
meta_sale_count_past_n_years
char_class
prox_airport_dnl_total
prox_nearest_secondary_road_dist_ft
prox_nearest_university_dist_ft
prox_nearest_vacant_land_dist_ft
ccao_is_active_exe_homeowner
ccao_is_corner_lot
ccao_n_years_exe_homeowner
Cut a new release of the package containing the following changes:
meta_card_protation_rate
and loc_tax_municipality_name
to vars_dict
(#4)vars_dict
to focus exclusively on renaming/recoding (#6)styler::style_pkg()
in the consolelintr::lint_package()
in the consoledevtools::document()
in the consoledevtools::test()
in the consolepkgdown::build_site()
in the consoleREADME.Rmd
from within RStudioDESCRIPTION
file appropriately, following the schema laid out in the READMEThe modeling pipeline variable needs to be in the package or ingest will throw an error.
Forgot to update this when I updated vars_dict.csv ๐คฆโโ๏ธ
I incorrectly assumed the ccao
db was in our dag and that is_corner_lot
would exist as a column name because of that. Since the ccao db is not in the dag, this column only appears as ccao_is_corner_lot
and needs to updated in vars_dict.
New columns have been added to the condo modelling view since the last time the pipeline was run. They need to be added to ccao
's var_dict
.
This column name was changed and will throw an error in the modeling pipeline as a result.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.