The current standard attempts to address the needs for indicator 6 (Mechanism for Extr

We have expert inputs from <a href="https://www.linkedin.com/in/lea-gimpel/" rel="nofo

Status & Next Steps: Engage an AI expert to take a look at

Prioritization: should come after <a class="issue-link js-issue-link" data-error-text=

We will resolve this on <a class="issue-link js-issue-link" data-error-text="Failed to

[Proposal] Retrofitting the standard for an open AI audience about dpg-standard HOT 4 CLOSED

dpgalliance commented on June 18, 2024

[Proposal] Retrofitting the standard for an open AI audience

from dpg-standard.

Comments (4)

prajectory commented on June 18, 2024 1

We have expert inputs from Lea Gimpel on how the DPG standard can be better retrofitted for Open AI digital solutions.

Reproducibility:
Means that all training details are given: Needless to say, this includes a description of the data, code documentation and tech stack documentation (these can follow the already existing standards and criteria). We think it should also include specific model-training documentation. For instance, what kind of CPU/GPU, OS and platforms (cloud provider, google collab, etc.) were used for the training and listing all training parameters. Ideally, a tech-savvy person should be able to re-train the model with identical evaluation scores, given all information, data and computing power.

A nice way to think about transparent documentation of AI models is also Google’s idea of “model cards” (see here and here the corresponding article; in addition, Timnit Gebru also suggested “datasheets for datasets”, which could be an interesting tool for the discussion around open data as DPG)

Accessibility:
This is quite critical, in our opinion. The model should be easily accessible and usable. A good solution may be the provision of an API. You can send your request and retrieve the prediction outcome through a stable connection in real-time. Here platforms such as Hugging Face are also quite handy since they allow one-liner-code access and usage of trained ML models. (btw they just received 2b$ funding aiming to build the GitHub of Machine Learning)

Interpretability:
We think it is essential that the prediction outcomes of the models are interpretable and understandable, at least through proper documentation and explanation. For traditional ML models, predictions should be accompanied by some sort of intuitive confidence scores. It is a difference if the models predict with 99% confidence or 51% confidence. If such thresholds are set, they need to be clearly stated and explained.
Generally, it should be clear what problem the AI model aims to solve and what realistic outcomes/performance the user can expect.

Independency:
This adds to the point of accessibility. We may also make the model accessible through some sort of package as a collection of modules that can be downloaded and used in a programming language such as python. (pip install our_packaged_model) or as a sub-module in an existing package (this is how it would work if pushed to the Hugging Face model hub). The point of independency is that the dependencies need to follow the same standards as the end-product, but that’s also already outlined in the standard.

from dpg-standard.

Lucyeoh commented on June 18, 2024

Status & Next Steps:

Engage an AI expert to take a look at the current standard and this proposal (specific to non PII data & data privacy).

from dpg-standard.

Lucyeoh commented on June 18, 2024

Prioritization: should come after #59

from dpg-standard.

prajectory commented on June 18, 2024

We will resolve this on #130 latest on the topic of AI as a part of the standard.

from dpg-standard.

[Proposal] Retrofitting the standard for an open AI audience about dpg-standard HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent