Opened 12 months ago

Last modified 8 weeks ago

#46 reopened Feature request

Variant Namespaces Instead of MIME Type

Reported by: Florian Berger Owned by: florian
Milestone: Document Findings Keywords:
Cc:

Description (last modified by florian)

DE: Varianten: Namespace statt MIME-Typ

Varianten: statt MIME-Typ sowas wie Java/Android?-Namespaces: luna.html, ext.rakewi.quiz -> auch gleich API-Endpunkt-URL? /ext/

Change History (13)

comment:3 Changed 8 months ago by florian

  • _"Namespaces are one honking great idea -- let's do more of those!"_
  • The Java docs say on [Naming a Package](https://docs.oracle.com/javase/tutorial/java/package/namingpkgs.html): _"Companies use their reversed Internet domain name to begin their package names"_
  • The name should serve both _semantic_ and _accountability purposes_, i.e. we should be able to guess what it does, and how I can check it's legit.
  • Finally, what about metadata? The variant type needs a readable name, probably a description etc.
    • A variant definition is part of the code, not of the data.
      • So: no definitions, code, metadata etc. on the _variant type and specification_ should lurk in the course database, in a local folder like icons etc.
      • Instead, it should be provided at the code level, in a folder where Python can load it.
        • That way, again similar to Java, the type name can become the path: import ext.org.example.user , which would imply a ext/org/example/user/__init__.py
      • This module can then register itself with a type, a localised name (or, in Dublin Core, an identifier), and a description
Version 0, edited 8 months ago by florian (next)

comment:2 Changed 8 months ago by florian

The flow:

  • HTTP GET request arrives
  • Decide content type to retrieve:
    • If a specific mode is requested, use as content type
    • Else check Accept: header
    • Else fall back to text/html
  • Try to retrieve content type from storage
    • If content type is not available, fall back to text/html
Last edited 6 months ago by florian (previous) (diff)

comment:1 Changed 8 months ago by florian

Should we leave the MIME notation as is? I.e., use dots as delimiters, but not artificially split up fields?

  • luna.text/html.ar_SY
  • ext.org.example.user.audio/opus.fa_IR

This would also possibly give a hint as to where the domain might end.

comment:3 Changed 6 months ago by florian

https://unicode-org.github.io/icu-docs/apidoc/released/icu4c/classicu_1_1Locale.html#a0638620f0b0c25aad55539f14f739244

>>> import icu
>>> icu.Locale.create
icu.Locale.createCanonical(        icu.Locale.createFromName(         icu.Locale.createKeywords(         icu.Locale.createUnicodeKeywords(
>>> icu.Locale.createFromName("de_DE")
<Locale: de_DE>
>>> str(icu.Locale.createFromName("de_DE"))
'de_DE'
>>> str(icu.Locale.createFromName("de"))
'de'
>>> icu.Locale.createCanonical("de_DE")
<Locale: de_DE>
>>> str(icu.Locale.createCanonical("de_DE"))
'de_DE'
>>> str(icu.Locale.createCanonical("de"))
'de'
>>> str(icu.Locale.createCanonical("fa_IR"))
'fa_IR'
>>> str(icu.Locale.createCanonical("fa-IR"))
'fa_IR'
>>> str(icu.Locale.createFromName("de"))
'de'
>>> str(icu.Locale.createFromName("de").minimizeSubtags())
'de'
>>> str(icu.Locale.createFromName("de_DE").minimizeSubtags())
'de'
>>> str(icu.Locale.createFromName("en-US").minimizeSubtags())
'en'

comment:4 Changed 6 months ago by florian

So.

  • Variants will in principle be described using RDF.
    • Minimum attributes are
    • Variants may not be accessible by a dereferencable URI, so they will be referenced by their ID in a URN, such as urn:luna:variant:contentID:variantID
  • From the attributes, a dot-delimited string is constructed by the above methods. The string serves as the format identifier.
    • It must be suitable for a Python import statement, i.e. obey the rules for Python package and module names.
    • The string may be used when requesting a mode from Luna, such as http://host/courses/view/courseID/contentID?mode=formatID
      • Since the mode string may be very long, verbose, and confusing to outsiders, a hash may be used instead. Currently the hash algorithm is SHA256. A prefix of the hash may be used as long as it is unique regarding the variants in the course.


Last edited 6 months ago by florian (previous) (diff)

comment:5 Changed 6 months ago by florian

I briefly considered using the Accept-Header as the content format, but this is basically just MIME, and supports no further attributes. A language request would be a separate Accept-Language header, and that would become a mess pretty quickly.

comment:6 Changed 6 months ago by florian

https://rdflib.dev/ looks really useful to handle RDF in Python.

comment:7 Changed 6 months ago by florian

Last edited 6 months ago by florian (previous) (diff)

comment:8 Changed 6 months ago by florian

The Luna SQLite database does not store RDF.

This would lead to a possbily awkward duplication of data. Also, if semantic information would be kept in both columns and an RDF dump, how would Luna guarantee these are always in sync?

And if semantic information was exclusively kept in RDF records, then another query language and system would be needed on top of SQLite to retrieve the data. But a Luna storage backend should have a single API, not multiple.

But:

RDF should be easily retrievable from the database.

So probably a table is needed to map column names to URIs.

Consequences for variants:

  • There will be a new table formats.
  • Columns:
    • id – the dot-delimited string, foreign key for variants.format
    • publisher
    • media_type (MIME type)
    • language
    • sha256

comment:9 Changed 6 months ago by florian

Milestone: Demo Course – FeaturesMode Menu – Features
Owner: set to florian
Status: newaccepted
Type: Bug reportFeature request

comment:10 Changed 6 months ago by florian

Resolution: Fixed
Status: acceptedclosed

comment:11 Changed 8 weeks ago by florian

Milestone: Mode Menu – FeaturesDocument Findings
Resolution: Fixed
Status: closedreopened
Note: See TracTickets for help on using tickets.