{"id":12795,"date":"2023-06-06T20:29:49","date_gmt":"2023-06-06T18:29:49","guid":{"rendered":"http:\/\/plus.maciejpiasecki.info\/index.php\/2023\/06\/06\/the-ai-renaissance-and-why-open-source-matters\/"},"modified":"2023-06-06T22:17:21","modified_gmt":"2023-06-06T20:17:21","slug":"the-ai-renaissance-and-why-open-source-matters","status":"publish","type":"post","link":"https:\/\/plus.maciejpiasecki.info\/index.php\/2023\/06\/06\/the-ai-renaissance-and-why-open-source-matters\/","title":{"rendered":"The AI\u00a0renaissance and why Open Source matters"},"content":{"rendered":"<p>Sharing knowledge and sharing code has always been a key driver for innovation in Artificial Intelligence. Researchers have gathered together since AI was established as a field to develop and advance novel techniques, from Natural Language Processing to Artificial Neural Networks, from Machine Learning to Deep Learning.<br \/>\nThe Open Source community has played a key role in advancing AI and bringing it to solve real-world problems. Libraries and frameworks like TensorFlow,\u00a0PyTorch, Keras, and Scikit-learn, for example, have allowed researchers and data scientists to study and make use of AI.<br \/>\nIn recent years, we have seen the emergence of LLMs (Large Language Models). While most of the early models such as BERT\u00a0and GPT-2\u00a0were commercial friendly, proprietary models like OpenAI\u2019s GPT-4\u00a0have gained wide popularity. Models like LLaMA, Alpaca, Vicuna, and Koala\u00a0have not yet adopted a license that allows for commercial use.<br \/>\nFortunately in these past few years, we have seen the research community\u00a0build several important Open Source models, such as EleutherAI\u2019s GPT-Neo, GPT-J, GPT-NeoX, and Pythia, as well as models from BLOOM\u00a0(BigScience Large Open-science Open-access Multilingual Language Model) and LAION\u00a0(Large-scale Artificial Intelligence Open Network).<br \/>\nMost recently, we have seen Open Source models like DataBricks Dolly 2.0, Stability Diffusion\u2019s StableLM, Cerebras GPT, h2oGPT, RedPajama, UC Berkeley\u2019s OpenLLaMA, MosaicML\u2019s MPT, NVIDIA\u2019s NeMo, Hugging Face\u2019s StarCoder, and TII\u2019s\u00a0Falcon\u00a0emerge and gain traction.<br \/>\nAdditionally, it\u2019s important to note that smaller and more specific models are outperforming larger models, allowing small organizations and even individuals to innovate. Fine-tuning these models using techniques like LoRA\u00a0(Low-Rank Adaptation of Large Language Models) and \u00a0QLoRA\u00a0have allowed for faster iterations.<br \/>\nRecently, users have been able to run and even fine tune these smaller models from their desktop and mobile (e.g. GPT4All\u00a0and PrivateGPT) and their browser (e.g. WebGPT). This will unleash a new wave of applications, where users\u2019 privacy will be respected while allowing them to take advantage of AI. Businesses will also gain full control over these models and be able to run them on prem or on the public cloud using privacy-enhancing technologies. Additionally, these models will also be able to run from IoT devices and on the edge.<br \/>\nThe world of AI is at an important crossroads. There are two paths forward: one where highly regulated proprietary code, models, and datasets are going to prevail, or one where Open Source dominates. One path will lead to a stronghold of AI by a few large corporations where end-users will have limited privacy and control, while the other will democratize AI, allowing anyone to study, adapt, contribute back, innovate, as well as build businesses on top of these foundations with full control and respect for privacy.<br \/>\nIf you are interested in learning more about Open Source and AI, please join our \u201cDeep Dive: Defining Open Source AI\u201d series.<br \/>\nThe post &lt;span class=&#8217;p-name&#8217;&gt;The AI\u00a0renaissance and why Open Source matters&lt;\/span&gt; appeared first on Voices of Open Source.&#013;<br \/>\n<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/plus.maciejpiasecki.info\/wp-content\/uploads\/2023\/06\/oooooo-1024x491-1.png\" width=\"1024\" height=\"491\">&#013;<br \/>\nSource: opensource.org&#013;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Sharing knowledge and sharing code has always been a key driver for innovation in Artificial Intelligence. Researchers have gathered together [&hellip;]<\/p>\n","protected":false},"author":57,"featured_media":12796,"comment_status":"false","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[],"class_list":["post-12795","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-mp"],"_links":{"self":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts\/12795","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/users\/57"}],"replies":[{"embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/comments?post=12795"}],"version-history":[{"count":1,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts\/12795\/revisions"}],"predecessor-version":[{"id":12797,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/posts\/12795\/revisions\/12797"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/media\/12796"}],"wp:attachment":[{"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/media?parent=12795"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/categories?post=12795"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/plus.maciejpiasecki.info\/index.php\/wp-json\/wp\/v2\/tags?post=12795"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}