New License List website features


Gary O'Neall
 

Greetings legal team,

 

I’m working on updating software which generates the website and license list data to include the status of the “Other web pages” links for the licenses.  I have a couple of questions for those interested in the license website look and feel.

 

My plan is if the URL is no longer live, we will remove the hyperlink and only display the text.  I will also append “[No longer live, last checked date]” after the URL text string.  Below is an example:

 

Let me know if this formatting is OK.  I don’t think I’ll have time for fancy HTML formatting, but I can easily change the text.  Of course, if someone with good HTML skills would like to contribute – the template for this file can be found at https://github.com/spdx/LicenseListPublisher/blob/master/resources/htmlTemplate/LicenseHTMLTemplate.html

 

There are several other fields about the URL which is available in the JSON (and other) license data formats.  I do not have any plans to display these values on the website for this release.

 

I’m planning on treating redirected web pages as live pages (turns out there is a very large number of permanent redirects in the URLs).  Let me know if you disagree with this approach.

 

Thanks,
Gary

 

 

-------------------------------------------------

Gary O'Neall

Principal Consultant

Source Auditor Inc.

Mobile: 408.805.0586

Email: gary@...

CONFIDENTIALITY NOTE: The information transmitted, including attachments, is intended only for the person(s) or entity to which it is addressed and may contain confidential and/or privileged material. Any review, re-transmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and destroy any copies of this information.

 


Matija Šuklje
 

Die 13. 11. 20 et hora 07:15 Gary O'Neall scripsit:
My plan is if the URL is no longer live, we will remove the hyperlink and
only display the text. I will also append "[No longer live, last checked
date]" after the URL text string. Below is an example:
Would it make sense to equip it with an Internet Archive link then?


cheers,
Matija
--
gsm: tel:+386.41.849.552
www: https://matija.suklje.name
xmpp: matija.suklje@...
sip: matija_suklje@...


Steve Winslow
 

Gary, thanks for this! I think this will be helpful as we periodically get issues submitted asking about dead links -- will be helpful to have it more visible on the license list page.

Hi Matija, I think that if there is a desire for an Internet Archive link, that's probably most easily handled by a PR to the particular license XML file in https://github.com/spdx/license-list-XML to add to (not replace) the cross-references. I don't expect that the publisher tool will automatically add them in, or that the license list maintainers will hunt for replacement links across all licenses -- but if the community submits PRs to supplement with links for particular licenses, I think those would be merged.

Best,
Steve


On Fri, Nov 13, 2020 at 2:12 AM Matija Šuklje <matija@...> wrote:
Die 13. 11. 20 et hora 07:15 Gary O'Neall scripsit:
> My plan is if the URL is no longer live, we will remove the hyperlink and
> only display the text.  I will also append "[No longer live, last checked
> date]" after the URL text string.  Below is an example:

Would it make sense to equip it with an Internet Archive link then?


cheers,
Matija
--
gsm:    tel:+386.41.849.552
www:    https://matija.suklje.name
xmpp:   matija.suklje@...
sip:    matija_suklje@...









--
Steve Winslow
Director of Strategic Programs
The Linux Foundation


Matija Šuklje
 

Die 13. 11. 20 et hora 15:48 Steve Winslow scripsit:
Hi Matija, I think that if there is a desire for an Internet Archive link,
that's probably most easily handled by a PR to the particular license XML
file in https://github.com/spdx/license-list-XML to add to (not replace)
the cross-references.
Having it as an additional link makes sense to me.

I don't expect that the publisher tool will
automatically add them in, or that the license list maintainers will hunt
for replacement links across all licenses -- but if the community submits
PRs to supplement with links for particular licenses, I think those would
be merged.
I mentioned it because as soon as you have original URL and a timestamp, you
can simply generate an Internet Archive URL as such:

http://web.archive.org/web/{$timestamp}/{$original_url}

Using Gary’s example, all you need to do is to translate:


* https://notvalidopensource.org/licenses/EUPL-1.1 [no longer live -
last checked 2020-11-13 - 05:57:04]

to

http://web.archive.org/web/20201113055704/https://notvalidopensource.org/
licenses/EUPL-1.1

And the cool thing is that if the Internet Archive does not have that exact
timestamp, it will just link to the first snapshot earlier than that.

cheers,
Matija
--
gsm: tel:+386.41.849.552
www: https://matija.suklje.name
xmpp: matija.suklje@...
sip: matija_suklje@...


Till Jaeger
 

Am 13.11.20 um 16:01 schrieb Matija Šuklje:
Die 13. 11. 20 et hora 15:48 Steve Winslow scripsit:
Hi Matija, I think that if there is a desire for an Internet Archive link,
that's probably most easily handled by a PR to the particular license XML
file in https://github.com/spdx/license-list-XML to add to (not replace)
the cross-references.
Having it as an additional link makes sense to me.

I don't expect that the publisher tool will
automatically add them in, or that the license list maintainers will hunt
for replacement links across all licenses -- but if the community submits
PRs to supplement with links for particular licenses, I think those would
be merged.
I mentioned it because as soon as you have original URL and a timestamp, you
can simply generate an Internet Archive URL as such:

http://web.archive.org/web/{$timestamp}/{$original_url}

Using Gary’s example, all you need to do is to translate:


* https://notvalidopensource.org/licenses/EUPL-1.1 [no longer live -
last checked 2020-11-13 - 05:57:04]

to

http://web.archive.org/web/20201113055704/https://notvalidopensource.org/
licenses/EUPL-1.1

And the cool thing is that if the Internet Archive does not have that exact
timestamp, it will just link to the first snapshot earlier than that.
Not sure if it helps but we have many links for licenses to the Internet
Archive which is very interesting vor license archaeologists:

https://github.com/LeChasseur/ifrOSS/blob/master/Lizenzcenter.md


Alexios Zavras
 

"last checked" may or may not mean "last time checked AND found" 😉


-- zvr

-----Original Message-----
From: Spdx-legal@... <Spdx-legal@...> On Behalf Of Matija Šuklje
Sent: Friday, 13 November, 2020 16:02
To: spdx-legal@...
Subject: Re: New License List website features

Die 13. 11. 20 et hora 15:48 Steve Winslow scripsit:
Hi Matija, I think that if there is a desire for an Internet Archive
link, that's probably most easily handled by a PR to the particular
license XML file in https://github.com/spdx/license-list-XML to add to
(not replace) the cross-references.
Having it as an additional link makes sense to me.

I don't expect that the publisher tool will automatically add them in,
or that the license list maintainers will hunt for replacement links
across all licenses -- but if the community submits PRs to supplement
with links for particular licenses, I think those would be merged.
I mentioned it because as soon as you have original URL and a timestamp, you can simply generate an Internet Archive URL as such:

http://web.archive.org/web/{$timestamp}/{$original_url}

Using Gary’s example, all you need to do is to translate:


* https://notvalidopensource.org/licenses/EUPL-1.1 [no longer live -
last checked 2020-11-13 - 05:57:04]

to

http://web.archive.org/web/20201113055704/https://notvalidopensource.org/
licenses/EUPL-1.1

And the cool thing is that if the Internet Archive does not have that exact timestamp, it will just link to the first snapshot earlier than that.

cheers,
Matija
--
gsm: tel:+386.41.849.552
www: https://matija.suklje.name
xmpp: matija.suklje@...
sip: matija_suklje@...







Intel Deutschland GmbH
Registered Address: Am Campeon 10-12, 85579 Neubiberg, Germany
Tel: +49 89 99 8853-0, www.intel.de
Managing Directors: Christin Eisenschmid, Gary Kershaw
Chairperson of the Supervisory Board: Nicole Lau
Registered Office: Munich
Commercial Register: Amtsgericht Muenchen HRB 186928


Matija Šuklje
 

Die 14. 11. 20 et hora 00:35 Gary O'Neall scripsit:
Alexios is correct - the last checked is just the last time the tool was
run. The wording is ambiguous - I think I'll just removed the last checked
string and denote the link as "No Longer Live".
Makes sense to me.

Another option could be to simply store the timestamp of when the license was
suggested/added instead. That one we can be at least fairly sure a human
checked as well.


cheers,
Matija
--
gsm: tel:+386.41.849.552
www: https://matija.suklje.name
xmpp: matija.suklje@...
sip: matija_suklje@...


Alexios Zavras
 

Ehmmm... someone suggests today (and gets accepted next month) a license of a software written in the '90s, and the license text points to a website that has not existed for years.

No current timestamp would be useful.
(this is actually a pretty common case)

-- zvr

-----Original Message-----
From: Spdx-legal@... <Spdx-legal@...> On Behalf Of Matija Šuklje
Sent: Saturday, 14 November, 2020 09:40
To: spdx-legal@...
Subject: Re: New License List website features

Die 14. 11. 20 et hora 00:35 Gary O'Neall scripsit:
Alexios is correct - the last checked is just the last time the tool
was run. The wording is ambiguous - I think I'll just removed the
last checked string and denote the link as "No Longer Live".
Makes sense to me.

Another option could be to simply store the timestamp of when the license was suggested/added instead. That one we can be at least fairly sure a human checked as well.


cheers,
Matija
--
gsm: tel:+386.41.849.552
www: https://matija.suklje.name
xmpp: matija.suklje@...
sip: matija_suklje@...







Intel Deutschland GmbH
Registered Address: Am Campeon 10-12, 85579 Neubiberg, Germany
Tel: +49 89 99 8853-0, www.intel.de
Managing Directors: Christin Eisenschmid, Gary Kershaw
Chairperson of the Supervisory Board: Nicole Lau
Registered Office: Munich
Commercial Register: Amtsgericht Muenchen HRB 186928


Patrice-Emmanuel SCHMITZ
 

Greetings,
The "official" EUPL pages on the European Commission site JOINUP present the text of the current EUPL-1.2
but the EUPL-1.1 may still be used. No automatic switch to 1.2 if no "or later" mention was added. 
The 1.1 text is still published in appendix to the 1.2 page.
P-E


Le ven. 13 nov. 2020 à 07:16, Gary O'Neall <gary@...> a écrit :

Greetings legal team,

 

I’m working on updating software which generates the website and license list data to include the status of the “Other web pages” links for the licenses.  I have a couple of questions for those interested in the license website look and feel.

 

My plan is if the URL is no longer live, we will remove the hyperlink and only display the text.  I will also append “[No longer live, last checked date]” after the URL text string.  Below is an example:

 

Let me know if this formatting is OK.  I don’t think I’ll have time for fancy HTML formatting, but I can easily change the text.  Of course, if someone with good HTML skills would like to contribute – the template for this file can be found at https://github.com/spdx/LicenseListPublisher/blob/master/resources/htmlTemplate/LicenseHTMLTemplate.html

 

There are several other fields about the URL which is available in the JSON (and other) license data formats.  I do not have any plans to display these values on the website for this release.

 

I’m planning on treating redirected web pages as live pages (turns out there is a very large number of permanent redirects in the URLs).  Let me know if you disagree with this approach.

 

Thanks,
Gary

 

 

-------------------------------------------------

Gary O'Neall

Principal Consultant

Source Auditor Inc.

Mobile: 408.805.0586

Email: gary@...

CONFIDENTIALITY NOTE: The information transmitted, including attachments, is intended only for the person(s) or entity to which it is addressed and may contain confidential and/or privileged material. Any review, re-transmission, dissemination or other use of, or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and destroy any copies of this information.

 



--
Patrice-Emmanuel Schmitz
pe.schmitz@...
tel. + 32 478 50 40 65