boavizta / environmental-footprint-data Goto Github PK
View Code? Open in Web Editor NEW๐พ Boavizta.org Data repository
๐พ Boavizta.org Data repository
Hi,
First of all, thank you for this very useful data !
Some of the data reported in the csv file do not correspond to the data in the sources. For example,
HP ProLiant DL360 Gen10 server
reports a gwp_total
of 1710 kgeqCO2 (with 77% of this caused by the server usage). However, the corresponding source document (https://assets.ext.hpe.com/is/content/hpedam/a50002430enw) reports 6270 kgeqCO2 (with 87% caused by the server usage).
Some other HP server have the same problem (eg. "ProLiant ML30 Gen10 server", "ProLiant DL160 Gen10 server"). HPE probably updated their datasheets.
I think you are missing some values for Apple products, there are some in this report:
https://www.apple.com/environment/pdf/products/desktops/Mac_Pro_PER_Dec2019.pdf
3.5GHz (8-core) processor, Radeon Pro 580X, 32GB memory, and 256GB storage
2765 kg CO2e
2.5GHz (28-core) processor, dual Radeon Pro Vega II Duo with Infinity Fabric Link,
1.5TB memory, Afterburner card, and 4TB storage
6994 kg CO2e
Amazing work!
non regression testing should be implemented to automatically check if parsers results are the same before and after commit.
New multicriteria data is available on https://images.samsung.com/is/content/samsung/assets/latin_en/sustainability/environment/environment-data/2022_Life-Cycle_Assessment_for_HHP_220613.pdf and could be added to the database
The memory
attribute is expected to be a float number in GB. This means that 1) the GB must be removed, and 2) that the parsers have to guaranty that they parsed a number in the right unit, that is not the case yet.
Panasonic product carbon footprint reports are available here
Hello,
I found 4 duplicates in this database:
I see this repository contains manufacturer data, but what about the data for the multicritera cloud instances impacts in the api?
Currently the database only focuses on Carbon footprint wheras other impacts such as Abiotic Depletion, Primary Energy , Water, Human toxicity should be assessed and are available in several Life Cycle Assessments provided by manufacturers.
We already identified the following :
The following IBM PCF could be added to the database
IBM Power E1080 https://www.ibm.com/downloads/cas/VGL0LLMZ
IBM z16 https://www.ibm.com/downloads/cas/KLMA1MPR
IBM LinuxONE Emperor 4 https://www.ibm.com/downloads/cas/2JBPXBMK
If you find any data about Nvidia GPUs, can you please let me know? This is something I'm really interested in!
Thank you :)
We need a dedicated tool to merge merge multiple .csv files while detecting and merging duplicates.
I've started to implement it through a new static method of DeviceCarbonFootprint
:
@staticmethod
def merge(device1: 'DeviceCarbonFootprint', device2: 'DeviceCarbonFootprint',
conflict: Literal['keep2nd','interactive'] = 'keep2nd', verbose: bool = False) -> 'DeviceCarbonFootprint':
and a merge_csv.py file1 file2
standalone script written on top of the above merge
function.
By default, priority is given to device2/file2.
Conflicts are detected only for attributes that provided for both devices and when they are clearly different. If they are close enough, then merge only print a warning in verbose mode.
Then, there are two modes to resolve the conflicts:
TODO:
Hewlett Packard Inc. (HP) and Hewlett Packard Enterprise (HPE) are two separate legal entities, and have been since 2015.
It would therefore be better if the products were correctly designated as such in the list.
At line 451
the correct name for manufacturer is fujitsu instead of fujistu
Hello,
Apple published GHG รฉmissions from their LCA for the Apple Vision Pro : https://www.apple.com/environment/pdf/products/vision-pro/Apple_Vision_Pro_PER_Feb2024.pdf
335kg CO2e for the whole life cycle.
Interesting to add into the dataset ?
Alex
The objective is to create a monitoring tool to detect publication of unknown product environmental footprint reports.
We could regularly search for specific keywords and alert when new reports are found.
Search could be build with a combination of :
Actually, the separator used is ;
while using ,
on GitHub gives a very nice preview.
It is done this way on boavizta-data-us.csv
:
Please use ,
on boavizta-data-fr.csv
All Philips PCF are available here: https://www.philips.fr/c-w/search.html#q=Philips%20Product%20Carbon%20Footprint&cq=%40ps_contenttype_key%3C%3Eproduct
Lenovo parser should be a good starting point to create Philips one.
Most PCF files provides breakdown details for the manufacturing part. They are, however, not always fully consistent on the partitioning. Here is the list I ended up on ecodiag's side:
This long list is conservative, but that's a lot ! So maybe some components could be merged together ?
For instance, when the PSU is combined with the chassis, maybe we could just put it to "others" since this does not provide much information.
Some other propositions:
Hello,
We encounter issue when using this database as the name did not always fit the model name in the device system.
Example:
In this database --> EliteBook ...
In the device registry --> HP EliteBook ...
It make it harder for the automation as all inventory softwares use the device registry to get that information.
Is this issue known?
Apple hardware PCF documents could be downloaded here
Spider should get all pdf links on the page as tools/monitoring/apple_check.py does and simply launch the parser for each of these links.
Parser could be build based on existing parsers such as tools/parsers/hp_workplace.py
ECODIAG parser could also be used to find all needed regex.
README files should:
For Apple's smartphones and the likes, screen_size corresponds to the screen resolution in pixels, whereas for monitors and laptops it corresponds to inches.
Same locations are sometimes spelled with long names (China) or as two letters (CN). This needs to be unified.
Hello,
Framework Laptops published their LCA : https://downloads.frame.work/resources/Framework-Life-Cycle-Report.pdf
And CEPIR published data for VR Headsets in the ADEME footprint dataset : https://base-empreinte.ademe.fr/documentation/base-impact?idDocument=167
Could it be interesting to add this into boavizta ?
Thx
Dell's parser assumes that 'Precision' models are Desktop whereas there also exists Precision laptops.
In ecodiag, I extract the sub-categories from the main html file itself rather than from the PCF file.
Hi and thanks for sharing this project :)
It is currently without license, so it would be difficult for people to contribute to it.
There is this sentence in the README:
This data can be freely used for any purpose including without using Boavizta's methodology.
Then I'd advise you to use a creative common public license. If you agree, I can make the PR.
Thanks and have a nice day!
For the csv, it would be great to add an Id column to be able to track changes in any other column.
Ecodiag collects data with similar parsers as Boavizta :
https://gitlab.inria.fr/guenneba/ecodiag-data/-/tree/main/pyscripts
Boavizta parsers could be improved at least by using Ecodiag reverse_piechart code to improve all parsers using OCR.
Then Boavizta and Ecodiag parser results could be compared to identify other improvements.
Initial parsing date format is 01-11-2020 and manually added rows are on the same format but Auto parsers are on a different format (2022-10-18).
I think it would be easier to change Initial parsing and manually added rows. It will avoid to modify all spiders.
HPE hardware PCF documents could be downloaded here
Spider should :
Parser could be build based on existing HP Workplace parser.
No need for OCR to analyse pie charts as all data is available as text.
Monitoring of all manufacturers webpages could be automated with GH actions to :
Warning : Some improvement could be needed on enerate-gh-pr.py as it was not tested since Novembre 2021.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.