{"id":13969,"date":"2024-05-06T18:02:27","date_gmt":"2024-05-06T16:02:27","guid":{"rendered":"http:\/\/plus.maciejpiasecki.info\/index.php\/2024\/05\/06\/open-source-ai-definition-weekly-update-may-6\/"},"modified":"2024-05-13T22:39:25","modified_gmt":"2024-05-13T20:39:25","slug":"open-source-ai-definition-weekly-update-may-6","status":"publish","type":"post","link":"https:\/\/plus.maciejpiasecki.info\/index.php\/2024\/05\/06\/open-source-ai-definition-weekly-update-may-6\/","title":{"rendered":"Open Source AI Definition \u2013 Weekly update May 6"},"content":{"rendered":"<p>Definition validation: Seeking volunteers<\/p>\n<p>The process has entered a new phase: We are now seeking volunteers to validate the Open Source AI Definition, using it to review existing AI systems. The objective of the phase is to confirm that the Definition works as intended and understand where it fails.  \u00a0<\/p>\n<p>A spreadsheet is given where you locate and link to the license, research paper, or other document that grants rights or provides information for each required component.\u00a0<\/p>\n<p>Systems include, but are not limited to:<\/p>\n<p>Arctic<\/p>\n<p>BLOOM<\/p>\n<p>Falcon<\/p>\n<p>Grok<\/p>\n<p>Llama 2<\/p>\n<p>Mistral<\/p>\n<p>OLMo<\/p>\n<p>OpenCV<\/p>\n<p>Phi-2<\/p>\n<p>Pythia<\/p>\n<p>T5<\/p>\n<p>To volunteer by May 20th, please contact Mer on the forum<\/p>\n<p>Summary of comments received on the Definition draft<\/p>\n<p>Grammatical and wording corrections\u00a0<\/p>\n<p>Some minor grammatical suggestions were made. These change and order the layout slightly differently, though the overall message remains.\u00a0<\/p>\n<p>One user suggested to explain what Open Source  is under the \u201cpreamble\u201d and \u201cWhy we need open source AI\u201d. Instead of speaking about why Open Source is important, the section should rather be an introduction to what it is and why it matters for AI.<\/p>\n<p>Under \u201cPreferred form to make modifications to machine-learning systems\u201d and \u201cdata information\u201d, clarification is needed regarding \u201cthe training data set used\u201d. It is not clear whether this means that all training data must be open source for the whole model to be.<\/p>\n<p>Stefano Maffulli added here that the intention is to know what dataset was used, not to necessarily have it made available, and that it indeed seems to need clarification<\/p>\n<p>Technical points<\/p>\n<p>Under \u201cPreferred form to make modifications to machine-learning systems\u201d the release of checkpoints is mentioned as an example of required components, under \u201cmodel parameters\u201d. An objection was raised, arguing that this poses an unnecessary burden: It\u2019d be like requiring that for software to be Open Source, it should include past versions of the program.<\/p>\n<p>Maffulli reiterated that this was merely an example but that this might need to be a submission to the FAQ page<\/p>\n<p>\u201cPreferred form to make modifications to machine-learning systems\u201d and \u201cdata information\u201d, a \u201cskilled person\u201d is mentioned in the context of requiring sufficient information about the training data used to create a model. Question regarding why skill has to do with acquiring data<\/p>\n<p>Clarification was given by Maffulli, pointing out that this is in the context of getting information about the data so that a \u201cskilled person\u201d can use, study, share and modify the AI system.<\/p>\n<p>A user suggested that this confusion can be solved by changing the context of the wording \u201ca skilled person can recreate\u201d. From \u201cusing the same or similar data\u201d to \u201cif able to gain access to the same or similar data\u201d.<\/p>\n<p>A user points out that \u201cskilled person\u201d as a legal term used in patent law might not be appropriate as it has different legal connotations and precedence in different countries.<\/p>\n<p>Discussion on why specifically we focus on machine learning (ML) as an AI system<\/p>\n<p>A question was raised regarding why we explicitly mention ML systems under \u201cpreferred form to make modification to an ML system\u201d and subsequently the \u201cchecklist\u201d, pointing out that not all AI systems are ML.<\/p>\n<p>Maffulli replied that we address ML as they need special and urgent attention as rule-based AI systems can fit under the open source definition. This needs to be addressed in the FAQ<\/p>\n<p>Town hall announcement\u00a0<\/p>\n<p>The 9th town hall meeting was held on the 3d of May.\u00a0Access the recording here if you missed it!<br \/>\n&#013;<br \/>\n&#013;<br \/>\nSource: opensource.org&#013;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Definition validation: Seeking volunteers The process has entered a new phase: We are now seeking volunteers to validate the Open [&hellip;]<\/p>\n","protected":false},"author":63,"featured_media":0,"comment_status":"false","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[],"class_list":["post-13969","post","type-post","status-publish","format-standard","hentry","category-mp"],"_links":{"self":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts\/13969","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/users\/63"}],"replies":[{"embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/comments?post=13969"}],"version-history":[{"count":1,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts\/13969\/revisions"}],"predecessor-version":[{"id":13970,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts\/13969\/revisions\/13970"}],"wp:attachment":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/media?parent=13969"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/categories?post=13969"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/tags?post=13969"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}