Giter Club home page Giter Club logo

waybacks's Introduction

UKWA (prototype)

This is the initial prototype for the updated UKWA website.

waybacks's People

Contributors

anjackson avatar gilhoggarth avatar min2ha avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

waybacks's Issues

Re-theme Beta Wayback look-and-feel to be consistent with the new website

We need to modernise the look-and-feel of our Wayback playback, and make it more consistent with the new website.

I've started this on the 2017-style-reset branch, which just resets things to use a minimally-modified version of the default OpenWayback theme. This should provide a good basis for development.

  • Switch to a logo consistent with the new UKWA site.
  • For unknown reasons the Welsh language support package for dates etc. is not being used.
  • Reinstate navigation between /archive/ and /archive-cy/ in toolbar
  • There is much un-translated text in the BubbleCalendar page, and no links between Welsh/English versions.

Finish the MissingResourceLoggingFilter implementation

The idea is to log missing resources so we can send them to a 'patch crawl' queue and add them back in again. A partial implementation is here

The idea is to shift to using that for all Wayback instances (not just under wayback-ldwa) and ideally make it POST the 404 URLs to the to-crawl queue. We'd probably want to push them over via in a background thread to avoid 404 responses being delayed and hanging the client.

NPLD playback should indicate licensing terms of current page?

When viewing items in wayback-ld we get very little information or context. For example, should we make the OA/NPLD terms clear, so the user knows they can access it outside the reading room?

Tricky to implement because the system has to somehow check against the whitelist but treat it as advisory rather than dropping the non-OA items. It's not clear if this is possible without significant changes to Wayback.

Add Content Security Policy to avoid comScore web bug and other live leaks

To try to stop leaking to the live web, particularly the comScore web bug, we need to try adding a Content-Security-Policy. To block all, non-www.webarchive.org.uk resource we can use:

Content-Security-Policy: default-src 'unsafe-eval' 'unsafe-inline' 'self' data: blob: mediastream: ws: wss: ; form-action 'self'

Provide links to the secure gateways from the 451 error page

We have reports of people visiting Wayback and finding they get the 451 error page while they are in a reading room. The current implementation expects the reading-room-or-not re-direction to happen upstream in the user journey (as part of the main website). However, it is reasonable to expect that people will end up here sometimes and need a way to get to their secure gateway.

The simplest option is just to add links to each secure gateway into the error page.

A more sophisticated version could try to detect which link is relevant and only provide links that work.

Test and deploy NPLD Wayback with the new back-end configuration

We need to deploy the LD Wayback player with a new backend configuration. I've removed the old custom configuration, so the build should now inherit the new configuration for the back end. i.e. the current set-up should work.

However, this needs to be tested thoroughly, including the LD-specific functionality. So, here's a suggested plan of work:

  • Build this project locally, and check you can deploy wayback-qa to a local Tomcat. @GilHoggarth should be able to supply suitable back-end service endpoints to run like QA Wayback.
  • Then try deploying wayback-ldwa locally, this should work exactly like wayback-qa except if you try to go to the same page in two separate browsers, you should be locked out of one of them.
  • If this seems to be working, check you can visit the lock management UI. IIRC documentation for that is on our internal Wiki.
  • You should also check the 7 day embargo works when configured.

Once this appears to be working, we'll need to merge these changes into the branch for #1.

Don't say '0 captures'

The toolbar says '0 captures', which is silly. We just need a link to the 'calendar' page, so it should just say 'show captures'.

This should be done on the 2017-style-reset branch.

Ensure 404's are logged cleanly.

Can we ensure 404's/missing records are logged cleanly, so we can parse the logs and try to get the missing resources?

Or, is there some other way to do this? Could be part of the screenshotting-of-proxy-playback?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.