Giter Club home page Giter Club logo

data-solution-automation-engine / team Goto Github PK

View Code? Open in Web Editor NEW
33.0 24.0 7.0 15.87 MB

The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse template manager and the generic schema for Data Solution Automation.

Home Page: https://github.com/RoelantVos/TEAM

License: GNU Lesser General Public License v3.0

C# 96.34% TSQL 3.66%
datawarehouseautomation metadata etlgeneration source-target-mappings etl-automation-metadata taxonomy etl datavault20 datavault

team's People

Contributors

dependabot[bot] avatar eru-m avatar kjellix avatar lucianobastian avatar martjanpols avatar roelantvos avatar scott-diprose avatar timo2o1o avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

team's Issues

Can't connect metadata repository to any server

When configuring the metadata repository database connection, it will always return an error. When the method TestConnections() is getting called the connection string is not set with the newly set values but still contains the default values. Is there an UPDATE button missing or something?

Handle exception when using physical model but no tables exist

Using Virtual mode, the metadata activation checks the snapshot of the physical model. But, in Physical mode the tool checks the database / catalog for this. If no tables are present in the database for these lookups it creates an unhandled exception.

Very fast update changes not always captured in JSON

Issue with File.WriteAllText. The datatable correctly captures each change, but the IO command doesn't always trigger a change if changes are made rapidly (within second). Only happens with JSON for this reason.

Need redevelopment to move all JSON file writes to a single method so IO speeds can be throttled / wait states can be added.

Remove Show Error Log button

The Show Error Log button is obsolete, and replaced by the event log.

The following should be done:

  • Remove the Show Error Log button from the feedback screens
  • Ensure that meaningful messages are also visible in the event log

Validation does not work yet for STG and PSA sources/targets

  • Source tables will fail the prefix/suffix check
  • STG or PSA as a target fail the 'target' check as this looks at the Integration Layer

Solution is to split out the check by table type / load vector instead of just 'source' and 'target'. This can be done by extending the method to accept the table type. All source and target columns can be handled this way.

In some cases the metadata is not saved, when the application reports it is.

I found out why in some cases a value isn't saved via the TEAM application. It's basically because upon saving a new key is generated which isn't fed back to memory (in the datagrid). So, when making another change to the same record while staying in the screen will cause the change to be lost.

I did end-to-end tests yesterday and this is all I could find. The fix is to re-bind the data to the grid after saving I think.

Add JSON repository sample files

Currently all samples require a SQL Server back-end. The samples are directly created in the database. Creating samples for the JSON would be helpful as well, as currently this only creates a dummy row.

OMD_HASH_FULL_RECORD is being mapped as an attribute

Hello Roelant,

Happy New Year!
I was re-generating some packages today, and I think something changed in the TEAM Activation.
The column OMD_HASH_FULL_RECORD is being mapped as an ATTRIBUTE, which I think it shouldn’t.

PSA Info statments(Views only) generation issue

Issue: wrong table source name in generated info statement
vedw version : 1.3.0

generated info statement for PROFILER PLAN:

-- PSA Insert Into statement for HSTG_PROFILER_PLAN
-- Generated at 31-1-2018 9:01:37

USE [EDW_100_Staging_Area]
GO

INSERT INTO [EDW_150_History_Area].[dbo].[HSTG_PROFILER_PLAN]
(
[ETL_INSERT_RUN_ID],
[LOAD_DATETIME],
[EVENT_DATETIME],
[RECORD_SOURCE],
[SOURCE_ROW_ID],
[CDC_OPERATION],
[HASH_FULL_RECORD],
[PLAN_CODE],
[PLAN_DESC],
[RENEWAL_PLAN_CODE]
)
SELECT
-1 AS ETL_INSERT_RUN_ID,
[LOAD_DATETIME],
[EVENT_DATETIME],
[RECORD_SOURCE],
[SOURCE_ROW_ID],
[CDC_OPERATION],
[HASH_FULL_RECORD],
[PLAN_CODE],
[PLAN_DESC],
[RENEWAL_PLAN_CODE]
FROM HSTG_PROFILER_PLAN

Must be from STG_PROFILER_PLAN

Add delete context menu in the gridviews

Add delete context menu in the gridviews (table and attribute mappings). Currently there is only an export-to-JSON function. Adding delete helps managing the grid, although the del key works also.

Remove MessageBoxes everywhere

There are still a few leftover MessageBox calls in the code to catch errors, which all should be removed as it's not really good user experience.

Validation of Link Key Order relies on ordinal position

In the validation of the Link Key order the business keys need to be selected, and for this all non-business key attributes need to be removed from the selection.

Currently this is done by using the ordinal position, which is not reliable (or conceptually correct) and only supports the setup where the business keys are last in the order.

Example:

                // Select only the business keys in a link table. 
                // Excluding all non-business key attributes
                workingTable = physicalModelDataTable
                    .Select("TABLE_NAME LIKE '" + FormBase.TeamConfigurationSettings.LinkTablePrefixValue +
                            "_%' AND TABLE_NAME = '" + validationObject.Item2 + "' AND **ORDINAL_POSITION > 4**",
                        "ORDINAL_POSITION ASC").CopyToDataTable();

Issue with meta data save

Using the last version downloaded yesterday, there was a null reference exception with the save of the automation meta data.

An unhandled exception of type 'System.NullReferenceException' occurred in TEAM.exe
Additional information: Object reference not set to an instance of an object.

On code;
DataTable dataTableKeyChanges = ((DataTable)_bindingSourceTableMetadata.DataSource).GetChanges();

Change output path in settings is not saved

Issue in saving the output path in settings:
Version 1.3.0

In settings one can change the output path.
after change, save, exit and reopening the program the default path is back again:

C:\Program Files (x86)\Virtual Enterprise Data Warehouse\Output\

The change output path should be saved.

Use of ORDINAL POSITION in determining Hub Keys in Links

Some work to do to remove the ORDINAL POSITION for the metadata activation, as well as making this work against the reverse-engineered metadata as well. Current fix only for default sample setup. This applies to the activation of the metadata into the MD_HUB_LINK_XREF table.

Physical Model Server Name is LinkedServer Option

This option is misleading, because the user might enter the same information in both and doesn't know that the needs to create a LinkedServer in his SQL Server.
Maybe a quick check if the Servers are the same it will simply ignore, or maybe a TipTool?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.