I’m curious if there is a standardized way to resolve the URNs found in the lexicon, e

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

I may have found the answer: <a href="http://sites.tufts.edu/perseuscatalog/?page_id=9

I would guess that it's probably just unfinished, but <a class="user-mention notransla

How to use URNs? about lexica HOT 11 OPEN

perseusdl commented on September 4, 2024

How to use URNs?

from lexica.

Comments (11)

balmas commented on September 4, 2024 1

You could probably also use the ScaifeDL CTS API's getCapabilities request:

https://scaife-cts.perseus.org/api/cts?request=GetCapabilities

That gives you the author/work/edition/translation metadata for every URN.

from lexica.

lcerrato commented on September 4, 2024 1

@TinaRussell

I think you want to use something like
http://scaife-cts.perseus.org/api/cts?request=GetLabel&urn=urn:cts:greekLit:tlg0020.tlg001.perseus-grc2 without the passage for that particular call.

A few points to add.

The URNs identified in the LSJ may be incorrect. These were checked against the existing Perseus collections (at the time) and that was done based on whether the link itself was valid. So, if the data was bad, —as was often the case in the "ibid" citations where the wrong antecedent is picked up,— we may not have identified that as a problem. If a URN was included for a work not yet in Perseus, then the problem would have been harder to spot. The quality will be better where an unambiguous reference was given: Plu. Brut. 7 but the data is very tricky in this regard, as you know.
The current Scaife collections do not have all of the texts in Perseus. There are many texts in Scaife not found in Perseus and many works in Perseus not yet moved into Scaife. So there are likely going to be cases where LSJ includes a URN that Scaife does not recognize. (For the most part, the recent additions to Scaife not found in Perseus are post-classical — so they are not generally part of the LSJ canon.)
As works move into Scaife from Perseus the URNs change. So the top level identifier should be consistent but the edition extensions may change. In your example, Scaife features tlg0020.tlg001.perseus-grc2 while Perseus (www) had tlg0020.tlg001.perseus-grc1
The last release of the catalog is several years out of date from the backend data. I do not think you'll see atom feeds for anything added subsequently. We have some tools in development that will better address this hidden data issue.

from lexica.

lcerrato commented on September 4, 2024 1

@TinaRussell
tlg4083 is not in the Scaife Viewer, so I wouldn't expect it to work. It's also not identified in the catalog, although I see an issue that indirectly refers to this.
I see it is the Eustathius Commentary on the Iliad.
I also see this on an old survey of IDs for which no results were returned — which would make sense.

from lexica.

helmadik commented on September 4, 2024 1

hi @TinaRussell , Peter Heslin has incorporated the URNs in his Diogenes application, whose code you can download at https://github.com/pjheslin/diogenes . To accommodate this use in Diogenes, I've done fairly extensive work on the references in LSJ and Lewis & Short (hunting down and repairing where Il. 2.349, 458 becomes Homer-Iliad-2-349, Homer-Iliad-458, or the like). Maybe his code will be helpful? He allows people to type in authors and select works by title, and nobody is confronted with URNs directly, but perhaps you can make use of his code to go in the other direction.

from lexica.

lcerrato commented on September 4, 2024

@TinaRussell
Hi, I know you've been in touch with James Tauber on related issues but I didn't want to leave this unanswered.
I don't know of any converters or other tools for this—we don't host any at Perseus.

The original abbreviations should still be in the data but we don't have a mapping tool for these. The abbreviations in LSJ are fraught with irregularities, though, so this can be a challenge. An early project of mine was cleaning up these links and correcting invalid references, so often times the data itself was either incorrectly entered or inconsistently presented.

I am not aware of a single master list of all of these URNs — particularly the base URNs (such as urn:cts:greekLit:tlg0033.tlg001) but the underlying data is cataloged such as here:

There may be tools or scripts others have created to better address this and James would be the best place to start with that.

FYI, Giuseppe Celano has a Unicode version of the data:
https://github.com/gcelano/LSJ_GreekUnicode

from lexica.