fivetran / dbt_linkedin_pages_source Goto Github PK
View Code? Open in Web Editor NEWFivetran's LinkedIn Pages source dbt package
Home Page: https://fivetran.github.io/dbt_linkedin_pages_source/
Fivetran's LinkedIn Pages source dbt package
Home Page: https://fivetran.github.io/dbt_linkedin_pages_source/
This packages README is not currently inline with our existing standards. It would be ideal for this repo to see README updates to be in line with our current documentation guidelines.
No response
During these updates, we should also add the identifier
variables and ensure the integration_tests/dbt_project.yml
references the identifiers as well.
ugc_post_share_content_media
was deprecated internally in a recent release of Linkedin Pages.. This is causing errors for customers attempting to use Transformations on Linkedin Pages.
Examining the new ERD, it's unclear if the staging models built on top of this source can still be utilized. We will likely need to scope whether this model can be modified or needs to be removed.
No response
Transformations should run.
standard to transformations: https://github.com/fivetran/dbt_linkedin_pages/blob/main/dbt_project.yml
packages:
other (mention it in "Additional Context")
What version transformations is running on.
https://fivetran.slack.com/archives/C02919TN9AT/p1695850062879859
The existing logic in the stg_linkedin_pages__organization_ugc_post
model is only effective in parsing the ugc_post_id
from share
type posts. As a result, downstream models for ugcpost
types having breaking joins. I noticed this when the organization_name
came up as null for these types of posts.
The existing logic looks like:
replace(ugc_post_id, 'urn:li:share:', '') as ugc_post_id
Example output in stg_linkedin_pages__organization_ugc_post
Example output in linkedin_pages__post
:
The logic is correct in stg_linkedin_pages__ugc_post_history
since it considers both post types. As a result, the ugc_post_id
looks right in linkedin_pages__posts
, but the join with organization_ugc_post
breaks due to mismatching ugc_post_id
.
No response
Proposed logic would look like:
split_part(ugc_post_id, ':', -1) as ugc_post_id
Example output of stg_linkedin_pages__organization_ugc_post
with proposed fix:
Example output of linkedin_pages__posts
with proposed fix:
This will make the logic more dynamic, and effectively parse the ugc_post_id
from both share
and ugcpost
types.
vars:
# Social Media Reporting
social_media_rollup__linkedin_enabled: True
# LinkedIn Organic
linkedin_pages_database: raw
linkedin_pages_schema: fivetran_linkedin_organic
models:
# disable both linkedin company pages models if not using linkedin company pages
linkedin_pages:
enabled: true
linkedin_pages_source:
enabled: true
- package: fivetran/social_media_reporting
version: [">=0.1.0", "<0.2.0"]
snowflake
installed version: 1.0.3
latest version: 1.0.3
Up to date!
Plugins:
- snowflake: 1.0.0 - Up to date!
No response
When running circleci tests, dbt_linkedin_pages_source fails with the below error:
Database Error in model int_linkedin_pages__latest_post_history (models/intermediate/int_linkedin_pages__latest_post_history.sql)
22:25:13 constant expressions are not supported in partition by clauses
22:25:13 compiled SQL at target/run/linkedin_pages/models/intermediate/int_linkedin_pages__latest_post_history.sql
Fix by updating dbt_project.yml model variable materialize as table
No response
Should not throw errors
n/a
n/a
redshift
~=1.0.0
No response
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.