Begun, the open source AI wars have

Tan KW

Publish date: Sat, 14 Sep 2024, 05:39 PM

Opinion The Open Source Initiative (OSI) and its allies are getting closer to a definition of open source AI. If all goes well, Stefano Maffulli, the OSI's executive director, expects to announce the OSI open source AI definition at All Things Open in late October. But some open source leaders already want nothing to do with it.

Let's start with some background. Lots of companies - I'm looking at you, Meta - have been claiming that their AI models are open source. They're not. They're not even close.

So the OSI and a host of other companies and groups have been working on creating a comprehensive open source AI definition. After all, the OSI is the same organization that defines open source software with the Open Source Definition.

In their latest draft, the Open Source AI Definition - draft v. 0.0.9, which was announced at KubeCon and Open Source Summit Asia in Hong Kong, significant changes were made, which grated on the nerves of some open source supporters. These are:

Role of training data: Training data is beneficial but not required for modifying AI systems. This decision reflects the complexities of sharing data, including legal and privacy concerns. The draft categorizes training data into open, public, and unshareable non-public data, each with specific guidelines to enhance transparency and understanding of AI system biases.
Separation of checklist: The license evaluation checklist has been separated from the main definition document, aligning with the Model Openness Framework (MOF). This separation allows for a focused discussion on identifying open source AI while maintaining general principles in the definition.

As Linux Foundation executive director Jim Zemlin detailed at the KubeCon and Open Source Summit China, the MOF "is a way to help evaluate if a model is open or not open. It allows people to grade models."

Within the MOF, Zemlin added, there are three tiers of openness. "The highest level, level one, is an open science definition where the data, every component used, and all of the instructions must go and create your model the same way. Level two is a subset where not everything is open, but most are. Then, on level three, you have areas where the data may not be available, and the data that describe the data sets would be available. And you can understand that - even though the model is open - not all the data is available."

This doesn't fly with some people. Tara Tarakiyee, FOSS Technologist for the Sovereign Tech Fund, writes: "A system that can only be built on proprietary data can only be proprietary. It doesn't get simpler than this self-evident axiom."

Tarakiyee adds: "The new definition contains so many weasel words that you can start a zoo... These words provide a barn-sized backdoor for what are essentially proprietary AI systems to call themselves open source."

Open source leader julia ferraioli agrees: "The Open Source AI Definition in its current draft dilutes the very definition of what it means to be open source. I am absolutely astounded that more proponents of open source do not see this very real, looming risk."

AWS principal open source technical strategist Tom Callaway said before the latest draft appeared: "It is my strong belief (and the belief of many, many others in open source) that the current Open Source AI Definition does not accurately ensure that AI systems preserve the unrestricted rights of users to run, copy, distribute, study, change, and improve them."

Afterwards, in a more sorrowful than angry statement, Callaway wrote: "I am deeply disappointed in the OSI's decision to choose a flawed definition. I had hoped they would be capable of being aspirational. Instead, we get the same excuses and the same compromises wrapped in a facade of an open process."

Chris Short, an AWS senior developer advocate, Open Source Strategy & Marketing, agreed. He responded to Callaway that he: "100 percent believe in my soul that adopting this definition is not in the best interests of not only OSI but open source at large will get completely diluted."

Steve Pousty, a developer advocacy consultant, commented on the OSI AI draft: "This definition does not grant the freedom to modify and is unacceptable as an Open Source Definition. With AI models, the weights are the user interface. I can use them directly as a user. They are what is typically distributed to everyone."

That's all well and good, but Maffulli doesn't feel a purely idealistic approach to the open source AI definition will work because no one will be able to meet the definition. Thus, the OSI's support for the MOF's levels of openness approach.

Callaway concluded: "They had a chance to lead, and they chose not to. I suppose the question is now: who will choose to lead in their place?"

That is indeed the question. Or will the community decide that the OSI AI Definition is the best practical way forward? Stay tuned. I fear this debate is going to last for years.

The real question to my mind is whether this will become a meaningless tech argument, such as vi vs EMACS (the answer's vi, by the way), while AI goes its merry way without referencing "open source" except as a marketing term. ®

https://www.theregister.com//2024/09/14/opinion_column_osi/

Discussions

Be the first to like this. Showing 0 of 0 comments

Featured Posts

MQ Trader

Introducing MY's First IPO Fund for Sophisticated Investors!

MQ Chat

New Update. Discover investment communities that resonate with your ideas

MQ Trader

M & A Value Partners IPO Equity Fund has been launched - Targeted 13% Return p.a

Latest Videos

0:17

New IPO: A café chain operator, distributor, and retailer, Oriental Kopi Holdings Berhad aims to list on the ACE Market!

MQ Trader 921 views | 6 d ago

0:17

New IPO: Solar PV EPCC services provider, Northern Solar Holdings Berhad aims to list on the ACE Market!

MQ Trader 675 views | 6 d ago

0:43

M & A Value Partners IPO Equity Fund

MQ Trader 13956 views | 5 mo ago

0:15

MQ Market Updates - 13 January 2025

MQ Trader 188 views | 1 d ago

Apps

MQ Chat

Send individual or group chats with anyone on i3investor

MQ Trader

Earn MQ Points while trading with MQ Trader

MQ Affiliate

Earn side income from Affiliate Program

MQdemy

Online learning and teaching marketplace

Hot Stocks Today >

YTLPOWR

YTL POWER INTERNATIONAL BHD

1000

YTL

YTL CORPORATION BHD

606

NATGATE

NATIONGATE HOLDINGS BERHAD

405

GAMUDA

GAMUDA BHD

375

SUPERMX

SUPERMAX CORPORATION BHD

361

SAPNRG

SAPURA ENERGY BERHAD

334

GENTING

GENTING BHD

287

SET

SWIFT ENERGY TECHNOLOGY BERHAD

285

BAUTO

BERMAZ AUTO BERHAD

269

GENM

GENTING MALAYSIA BERHAD

267

Daily Stocks

HSI-CWA1

0.125

0.00

99,559,600

EAH

0.005

0.00

98,891,500

HSI-PWB1

0.105

-0.005

87,836,700

MYEG

0.895

-0.04

48,733,100

GAMUDA-C2R

0.16

-0.025

46,053,600

YTLPOWR-C64

0.04

-0.005

33,511,300

HSI-CWAY

0.08

-0.005

30,321,200

VELOCITY

0.08

0.00

26,506,000

YTL

2.16

-0.02

26,051,900

HSI-PWB4

0.065

0.00

26,030,100

More active Stocks

HLBANK

20.00

+0.10

660,400

PERSTIM

2.30

+0.10

76,000

NESTLE

92.80

+0.10

16,900

HSI-CWA5

0.105

+0.095

620,100

PMETAL

4.76

+0.09

3,036,700

CHINAETF-MYR

4.49

+0.07

500

RHBBANK

6.33

+0.06

2,507,800

PIE

5.79

+0.06

153,100

HLFG

17.96

+0.06

75,100

TECGUAN

1.65

+0.05

500

More gainer Stocks

PETDAG

19.08

-0.44

81,200

MPI

22.80

-0.42

114,400

ORIENT

7.15

-0.41

2,132,800

ALLIANZ

19.98

-0.38

39,800

SUNCON

4.00

-0.35

5,524,700

UTDPLT

31.00

-0.30

59,500

VSTECS

3.50

-0.24

879,100

TENAGA

13.60

-0.22

2,601,300

CARLSBG

20.12

-0.22

53,300

VITROX

3.76

-0.20

929,400

More loser Stocks

MQ Trading Signals

BUY
SELL

HARVEST MIRACLE CAPITAL BERHAD

2025-01-15 12:00:00

ADX

30 Mins

FIAMMA

FIAMMA HOLDINGS BHD

2025-01-15 11:30:00

ADX

30 Mins

FIAMMA

FIAMMA HOLDINGS BHD

2025-01-15 11:30:00

TURTLE SYSTEM 20

30 Mins

XINHWA

XIN HWA HOLDINGS BERHAD

2025-01-15 11:30:00

EMA 5

30 Mins

T7GLOBAL

T7 GLOBAL BERHAD

2025-01-15 11:30:00

ADX

30 Mins

More Trading Signals

XDL

XIDELANG HOLDINGS LTD

2025-01-15 12:00:00

TURTLE SYSTEM 20

30 Mins

XL HOLDINGS BERHAD

2025-01-15 12:00:00

TURTLE SYSTEM 20

30 Mins

XL HOLDINGS BERHAD

2025-01-15 12:00:00

TURTLE SYSTEM 55

30 Mins

OMESTI

OMESTI BHD

2025-01-15 12:00:00

TURTLE SYSTEM 20

30 Mins

HONGSENG

HONG SENG CONSOLIDATED BERHAD

2025-01-15 12:00:00

EMA 5

30 Mins

More Trading Signals

Featured Advertisers / Partners

Top Brokers >

AmEquities

Affin Hwang

Rakuten Trade

Hong Leong Bank

Books Review >

Ride The Bull Short The Bear

CS Tan

4.9 / 5.0

This book is the result of the author's many years of experience and observation throughout his 26 years in the stockbroking industry. It was written for general public to learn to invest based on facts and not on fantasies or hearsay....

Read More