Question about license matching with SPDX templates
G'day SPDX gurus,
I'm working on a little license matching project [1] leveraging the (great!) SPDX text templates [2], but have noticed that some canonical license texts (e.g. CC-BY-4.0 [3]) don't match the template since they contain a number of horizontal rules (a sequence of '=' characters, in this case). I don't see these in the CC-BY-4.0 text template [4], which means my code doesn't find a match.
Is there a matching guideline [5] or other text processing I'm missing for this case?
Thanks in advance,
Peter
[1] https://github.com/pmonks/lice-comb
[2] https://github.com/spdx/license-list-data/tree/master/template
[3] https://creativecommons.org/licenses/by/4.0/legalcode.txt
[4] https://github.com/spdx/license-list-data/blob/master/template/CC-BY-4.0.template.txt
[5] https://spdx.dev/license-list/matching-guidelines/
I'm working on a little license matching project [1] leveraging the (great!) SPDX text templates [2], but have noticed that some canonical license texts (e.g. CC-BY-4.0 [3]) don't match the template since they contain a number of horizontal rules (a sequence of '=' characters, in this case). I don't see these in the CC-BY-4.0 text template [4], which means my code doesn't find a match.
Is there a matching guideline [5] or other text processing I'm missing for this case?
Thanks in advance,
Peter
[1] https://github.com/pmonks/lice-comb
[2] https://github.com/spdx/license-list-data/tree/master/template
[3] https://creativecommons.org/licenses/by/4.0/legalcode.txt
[4] https://github.com/spdx/license-list-data/blob/master/template/CC-BY-4.0.template.txt
[5] https://spdx.dev/license-list/matching-guidelines/