Giter Club home page Giter Club logo

sparqlextension's Introduction

OpenRefine

DOI Join the chat at https://gitter.im/OpenRefine/OpenRefine Snapshot release Coverage Status Translation progress

OpenRefine is a Java-based power tool that allows you to load data, understand it, clean it up, reconcile it, and augment it with data coming from the web. All from a web browser and the comfort and privacy of your own computer.

Official website: https://openrefine.org

Community forum: https://forum.openrefine.org

Download

Snapshot releases

You can download snapshots of the development version of OpenRefine. To do so, you need to be logged in to GitHub. Then, click on the first item with a green tick / check mark on this page and scroll down to the Artifacts section to find the version that matches your operating system.

Run from source

If you have cloned this repository to your computer, you can run OpenRefine with:

  • ./refine on Mac OS and Linux
  • refine.bat on Windows

This requires JDK 11 or newer, Apache Maven and NPM 16 or newer.

Documentation

Contributing to the project

Contact us

Licensing and legal issues

OpenRefine is open source software and is licensed under the BSD license located in the LICENSE.txt. See the folder licenses for information on open source libraries that OpenRefine depends on.

Credits

This software was created by Metaweb Technologies, Inc. and originally written and conceived by David Huynh. Metaweb Technologies, Inc. was acquired by Google, Inc. in July 2010 and the product was renamed Google Refine. In October 2012, it was renamed OpenRefine as it transitioned to a community-driven project.

Since 2020, OpenRefine is fiscally sponsored by Code for Science and Society (CS&S).

See CONTRIBUTING.md for instructions on how to contribute yourself.

sparqlextension's People

Contributors

antoine2711 avatar waltong avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

sparqlextension's Issues

Linked Data Fragments importer

Some RDF triple stores offer a Linked Data Fragments endpoint. This is a query interface which is much simpler than SPARQL and is therefore often faster and more reliable (since it is cheaper for the service). Importantly, it supports paging, which lets a client download large sets of results without running into timeouts.

Proposed solution

It would be nice if OpenRefine supported importing from a LDF endpoint, similarly to how SQL databases are supported.

Alternatives considered

This is similar to the request for a SPARQL importer, OpenRefine/OpenRefine#1212 (because a lot of services offer both endpoints).

Create project from WCQS SPARQL query (special case: requires Wikimedia authentication)

This is a feature request for a specific subcase of OpenRefine/OpenRefine#1212. This will be helpful for users who want to edit structured data of existing Wikimedia Commons files with the help of OpenRefine.

Through the SDC project for OpenRefine, users will be able to edit and upload files with structured data on Wikimedia Commons. See more info about this project on meta.wikimedia.org.

In some cases, it may be very handy for users to start an OpenRefine project with a SPARQL query from the Wikimedia Commons Query Service (WCQS). However, this specific SPARQL endpoint requires Wikimedia OAuth authentication.

It would be great if the work done on OpenRefine/OpenRefine#1212 also includes this use case, or alternatively we add support for WCQS after that general task has been completed.

Proposed solution

I have no idea at all about the technical difficulties re: this request. Curious to hear considerations around this!

Additional context

Color syntaxing and auto-completion

Description

  • Autocompletion (Cntl-Return) for entities and properties like in the WDQS
  • Add color syntaxing like in the WDQS

Example

Now:
image

Desired:
image

Skip the level selection step

Description

The step where you choose the JSON import level can be skipped, it's always the same.
The column name should come from the JSON fields name of the first record.

So the project name and tags could/should be asked the step before.

Screenshot

image

Provide a list of SPARQL Importer end-points

The current text field for end-points show no suggestions.
image

It would be good to have a list of suggestion of end-points.
The suggestions could come from a file on our GitHub repository, or gathered from the services’ manifest.

Entity column should be reconciled

Description

If a column contains entities, they should be reconciled instead of showing the URL.

Example

item_original column is the URL, but the second column, item, is what should be expected.
image

Trying to query ArtsData SPARQL end-point fails

Description of the problem

End-point: http://kg.artsdata.ca/sparql/

# Upcoming events in Artsdata.ca

PREFIX schema: <http://schema.org/>
SELECT DISTINCT  ?start ?status ?mode (sample(?label) as ?name)  (sample(?url) as ?webpage)
WHERE {
  ?e a schema:Event ;
     schema:startDate ?start ;
     schema:name ?label ;
     schema:url ?url ;
     schema:eventStatus ?status ;
     schema:eventAttendanceMode ?mode .
  FILTER(?start > '2022-08-10T01:47:07.569Z'^^xsd:dateTime)
} 
GROUP BY ?start ?status ?mode
LIMIT 1000

This will trigger an error: « Unable to parse preview ».

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.