Comments (14)
@paiva, we use our own internal mechanisms that are baked into the server code. I would not recommend doing that, I would use something that is canned and out of the box. I have used GA before, and it works really nice, I know some may have some issues with it being Google, but you can close off sections of the website to the analytics. I don't have experience with the others.
from conp-portal.
Idea for a metric:
- Number of git clones of the
conp-dataset
repo: https://github.com/CONP-PCNO/conp-dataset/graphs/traffic
from conp-portal.
@shots47s What is CBRAIN doing for tracking analytics?
from conp-portal.
Hey @paiva, as you know there is discussion of adding GA to LORIS. So let's stay in sync. We can definitely limit what pages GA has access over by not loading it on certain pages.
from conp-portal.
Matomo (formerly known as Piwik) is free as in freedom (GPLv3)
from conp-portal.
Discussed at CONP dev meeting on May 20th, 2020.
What would be next highest priority analytics?
-
analytics to display on the portal
- how many datasets available
- number of tools available
- more popular datasets on the portal
-
analytics to collect for reviewers and Brain Canada
- how many datasets visited, how many datasets downloaded etc...?
- number of users, visits etc..
for next week, suggest metrics metrics of interests here.
from conp-portal.
A few ideas of metrics (mostly basic I guess but always good to write them down):
[ ] 1. Number of datasets and tools
[ ] 2. Keyword search (to gather some stats on what people are looking for maybe) - not sure how easy to gather that though....
[ ] 3. Number of git clones of the conp-dataset repo (from @glatard above, just gathering everything in one description so far)
[ ] 4. Number of pipelines run through CBRAIN (and number of datasets processed through CBRAIN)
[ ] 5. Number of uploads through the different upload platforms (Zenodo, OSF, FRDR, DataLad...) (not sure if useful, maybe for Brain Canada)
[ ] 6. Graphical representation of some predefined modality key words. For example, number of dataset with EEG, number of quality control subject datasets, number of dataset with MRI etc...
[ ] 7. Graphical representation of some predefined Tags for the tools. For example, number of bioinformatics tools, neuroinformatics tools, MRI tool etc...
[ ] 8. Number of open datasets, open datasets requiring an account, registered datasets ...
legend:
- bold = general stats that could be represented on the portal (dashboard or somewhere else)
- italic = some stats that could be of interest for Brain Canada
from conp-portal.
Add #views and #download click per dataset
from conp-portal.
- Number of datasets and tools on the dashboard
- Graphical representation of tags key words. For example, number of dataset with EEG, number of quality control subject datasets, number of dataset with MRI etc... (taken from the
keywords
section ofDATS.json
of the datasets) - Graphical representation of Tags for the tools. For example, number of bioinformatics tools, neuroinformatics tools, MRI tool etc..
- Number of views of the different visited pages of the portal
- Keyword search statistics (to gather some stats on what people are looking for on the datasets and tools pages)
- List of institutions that provided the datasets (a little map with dots representing the different institutions)
- need to add an institution field to the DATS schema in order to have that information
- in the meantime could use the conp_status field (current possible values being:
CONP
,Candian
andexternal
)
Long term:
- Number of times download has been clicked on when available
- Number of uploads through the different upload platforms (Zenodo, OSF, FRDR, DataLad...) (not sure if useful, maybe for Brain Canada)
- Number of pipelines run through CBRAIN (and number of datasets processed through CBRAIN)
- Where the users come from (to be checked with terms of use too...)
from conp-portal.
Deadline for phase 1 of analytics: End of July 2020. See which analytics in the comment made 7 days ago.
from conp-portal.
Summary of the discussions that happened during a quick meeting between @3design, @liamocn and @cmadjar on July 8th, 2020.
Overall representation of the analytics:
- in the dashboard:
a) in the card showing the graph, add a drop down where users can select which representation they want to see on that card with the ability to visualize the selected graph in the card
b) at the bottom of the card, add a square button that would lead to an analytics page that would display all the analytics available - create an analytics page that will display all the graphs in the page. When we will have a lot of graphs, they could be organized per subject (data, tools, user's based statistics...)
Different graphs to work on:
- start with the representation of keyword/tags for the datasets and tools present in CONP
- for dataset, representation of the modality keywords: use the
types
section of the DATS model to show the number of each modality keyword across all datasets (aka: number of EEG, number of MRI, number of BIDS, number of behavrioral response...) - for tools: use the
tags
present in the Boutiques descriptor and represent the number of each tag
- for dataset, representation of the modality keywords: use the
- regarding the users experience on the portal (number of views, keyword search hits...), we need to figure out how to track the number of clicks, dataset viewed etc...
- for the list of institutions to be represented on a map, we don't currently have that information available. Will need to update the DATS model to require a field that would describe the institution and its address.
- in the meantime, we could have a representation of the conp_status field if that is of interest (
CONP
,Canadian
orexternal
)
- in the meantime, we could have a representation of the conp_status field if that is of interest (
@3design @liamocn feel free to add or correct my notes on the meeting we just had. We will present that at the next CONP dev meeting so everyone is on board with the plan.
from conp-portal.
Discussed at CONP dev meeting of July 15th, 2020:
-
Regarding the map graph, there is a proposal to broaden the range of values for CONP_status, see CONP-PCNO/conp-dataset#366
-
Webiste traffic: could set up Google analytics to run on the portal if this goes with the CONP terms of use (to be followed up with the ethics and governance committee). Could also parse the logs in the server or set up JS counters...
-
Ideas to count downloads:
- Count the number of access to the dataset web page
- Count them through the web download button
- Count the number of git clones using the GitHub API
- Instrument the DataLad client to report download stats
from conp-portal.
@liamocn updates from today's meeting:
- Match dataset and pipeline colours from the first graph with the dataset and pipeline bubble charts colours
- Use a different colour than grey for pipelines (pick from the CONP palette)
- Fix the
View analytics
button not working on Firefox - Remove the
Select Chart
drop down from the dashboard
Is that something you could do this quickly to have that ready for the next CONP dev meeting?
from conp-portal.
Because this issue is gigantic and vague, it has been split into several smaller tickets:
- Display Matomo widgets into the analytics page, #403 (visitors map, page transitions etc...)
- Display number of visits to the portal across time, #404 (to be discussed)
- Display number of views per datasets/tools, #405
- List popular keyword searches, #406
- Analytics on dataset providers, #407
from conp-portal.
Related Issues (20)
- Look into SHACL schema validation for CONP data in Nexus
- Addition of a number of views and number of downloads sort by functionality
- Have a way to add metadata that we generate to better describe tools and datasets HOT 2
- Marking "coming soon" data in the portal. HOT 1
- DATS Editor: Date pickers should allow shortcut to change year. Dates should also follow ISO-8601 standard. HOT 1
- Dataset size, nb of files, etc. should be optional HOT 8
- extending DATS metadata to enable multiple new features HOT 6
- Add search/filter by age buckets - pedriatric, geriatric, lifespan
- License: Hyperlink / hoverhelp for context
- allow search for eeg OR electroencephalography
- versioned Terms of Use HOT 2
- DATS.json : hover/hyperlink to these in the list of datasets
- Task Executions page: Error message trigger can be stood down/ disabled
- Tool Executions page: add-on wishlist
- Tool License
- Proposal: DATS.json redesign for Interlex-based cross-references with CBRAIN HOT 3
- Revised proposal for Interlex-based cross-references with CBRAIN HOT 3
- Datasets: Option to provide Platform logo for third-party / offsite download platforms (vs. Study/org logo)
- List New/recent tools
- [ Tools ] 2 little important Bugs: Licenses are wrong + popup message glitch
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from conp-portal.