Giter Club home page Giter Club logo

extract-data's Introduction

kofi pdulvp

extract-data's People

Contributors

pdulvp avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

extract-data's Issues

Add an Edit button on rule popup

When editing a rule from the toolbar button, its quite unfriendly to open the first rule at option opening.

Add somehow an edit button to edit the intended one.
image

Ability to defined sub items

On a rule, it is possible to define items, computed from the document root.
Now that is possible to select several elements on an item, it would be nice to be able to compute sub items.

image

The output json may be overhead now though. Maybe change it to something like

{
  "Rule #1": 
  [{
    "Item #1": 
	[
      { "Item #11": [ "https://github.com/pdulvp/extract-data/issues/50#start-of-content" ],  "Item #12": [ "root-link" ] },
      { "Item #11": [ "https://github.com/pdulvp/extract-data" ], "Item #12": [ "root-link-2" ] }
	]
  }]
}

Extract data from URL/Query Parameters

Sometimes identifying information is contained in a URL and not displayed on the page. It would be helpful to also collect URL values, and specifically query parameters to help correlate an Id to the extracted data.

Extract value from input fields

Extracting data from textarea form fields works, I'm assuming because the data is held in between two html tags. It would be handy to be able to extract the value from an input field for sites that return data with a prefilled form.

Add dynamic evaluation on options

When editing a rule, if the opened tab has a new result value, then the option is not updated.

Expected behavior: The value content is updated automatically if the site content change
nutri

Add some predefined expressions unaccessible with XPath or CSS Selectors

As a first step to #34, it is not possible to retrieve document location uri if this value is not available as a meta tag in the html page.

With some predefined expressions, uri might be easily available.

  • document.location
  • document.location.href
  • document.location.protocol
  • document.location.host
  • document.location.hostname
  • document.location.port
  • document.location.pathname
  • document.location.origin

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.