Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

[Proposal]: Doesn't use any dataset or model that does not compatible with Open Source#725

wannaphong started this conversation inGeneral
Discussion options

I think we should doesn't use any dataset or model that does not compatible with Open Source in our project (PyThaiNLP). If the license that doesn't compatible with open source, we should ignore theirs datasets/models.

You must be logged in to vote

Replies: 5 comments

Comment options

bact
Oct 11, 2022
Maintainer

+1

For data,
see some licenses that compatible with OKFNOpen Definition herehttps://resources.data.gov/open-licenses/

For code,
see all OSI-approved licenses herehttps://opensource.org/licenses

You must be logged in to vote
0 replies
Comment options

wannaphong
Oct 11, 2022
Maintainer Author

+1

For data, see some licenses that compatible with OKFNOpen Definition herehttps://resources.data.gov/open-licenses/

For code, see all OSI-approved licenses herehttps://opensource.org/licenses

@bact Do you think about LST20 Corpus? It is free for open source project only. I thinking remove all LST20 model and replace with Blackboard Treebank.18c8c50

You must be logged in to vote
0 replies
Comment options

wannaphong
Oct 11, 2022
Maintainer Author

Blackboard Treebank license is CC-BY.https://www.facebook.com/dancearmy/posts/10158423653343284?_rdc=1&_rdr

LST20 Corpus (free for non-commercial and open source only).https://www.facebook.com/dancearmy/posts/10157641945708284

You must be logged in to vote
0 replies
Comment options

bact
Oct 11, 2022
Maintainer

NC (non-commercial) clause makes things not fully open. OKFN Open Definition states that to be open it must also be open "for any purpose".

You must be logged in to vote
0 replies
Comment options

wannaphong
Oct 11, 2022
Maintainer Author

I think I will add the source that can’t use in the open source and ours project.

  • ___- Website has legal requirements (Do not forward any information from the web without allowed) and many dataset are CC-BY-SA-NC.
You must be logged in to vote
0 replies
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Category
General
Labels
corpuscorpus/dataset-related issuesdocumentationimprove documentation and test cases
2 participants
@wannaphong@bact
Converted from issue

This discussion was converted from issue #704 on October 11, 2022 10:02.


[8]ページ先頭

©2009-2025 Movatter.jp