Big Tech's eventual response to my LLM-crasher bug report was dire

Tan KW

Publish date: Wed, 10 Jul 2024, 05:55 PM

Column Found a bug? It turns out that reporting it with a story in The Register works remarkably well ... mostly. After publication of my "Kryptonite" article about a prompt that crashes many AI chatbots, I began to get a steady stream of emails from readers - many times the total of all reader emails I'd received in the previous decade.

Disappointingly, too many of them consisted of little more than a request to reveal the prompt so that they could lay waste to large language models.

If I were of a mind to hand over dangerous weapons to anyone who asked, I'd still be a resident of the United States.

While I ignored those pleas, I responded to anyone who seemed to be someone with an actual need - a range of security researchers, LLM product builders, and the like. I thanked each for their interest and promised further communication - when Microsoft came back to me with the results of its own investigation.

As I reported in my earlier article, Microsoft's vulnerability team opined that the prompt wasn't a problem because it was a "bug/product suggestion" that "does not meet the definition of a security vulnerability."

Following the publication of the story, Microsoft suddenly "reactivated" its assessment process and told me it would provide analysis of the situation in a week.

While I waited for that reply, I continued to sort through and prioritize reader emails.

Trying to exert an appropriate amount of caution - even suspicion - provided a few moments of levity. One email arrived from an individual - I won't mention names, except to say that readers would absolutely recognize the name of this Very Important Networking Talent - who asked for the prompt, promising to pass it along to the appropriate group at the Big Tech company at which he now works.

This person had no notable background in artificial intelligence, so why would he be asking for the prompt? I felt paranoid enough to suspect foul play - someone pretending to be this person would be a neat piece of social engineering.

It took a flurry of messages to another, verified email address, before I could feel confident the mail really came from this eminent person. At that point - as plain-text seeming like a very bad idea - I requested a PGP key so that I could encrypt the prompt before dropping it into an email. Off it went.

A few days later, I received the following reply:

Translated: "It works on my machine."

I immediately went out and broke a few of the LLM bots operated by this luminary's Big Tech employer, emailed back a few screenshots, and soon got an "ouch - thanks" in reply. Since then, silence.

That silence speaks volumes. A few of the LLMs that would regularly crash with this prompt seem to have been updated - behind the scenes. They don't crash anymore, at least not when operated from their web interfaces (although APIs are another matter). Somewhere deep within the guts of ChatGPT and Copilot, something looks like it has been patched to prevent the behavior induced by the prompt.

That may be why, a fortnight after reopening its investigation, Microsoft got back to me with this response:

This reply raised as more questions than it offered answers, as I indicated in my reply to Microsoft:

That went off to Microsoft's vulnerability team a month ago - and I still haven't received a reply.

I can understand why: Although this "deficiency" may not be a direct security threat, prompts like these need to be tested very broadly before being deemed safe. Beyond that, Microsoft hosts a range of different models that remain susceptible to this sort of "deficiency" - what does it intend to do about that? Neither of my questions have easy answers - likely nothing a three-trillion-dollar firm would want to commit to in writing.

I now feel my discovery - and subsequent story - highlighted an almost complete lack of bug reporting infrastructure from the LLM providers. And that's a key point.

Microsoft has something closest to that sort of infrastructure, yet can't see beyond its own branded product to understand why a problem that affects many LLMs - including plenty hosted on Azure - should be dealt with collaboratively. This failure to collaborate means fixes - when they happen at all - take place behind the scenes. You never find out whether the bug's been patched until a system stops showing the symptoms.

I'm told security researchers frequently encounter similar silences only to later discover behind-the-scenes patches. The song remains the same. If we choose to repeat the mistakes of the past - despite all those lessons learned - we can't act surprised when we find ourselves cooked in a new stew of vulnerabilities. ®

https://www.theregister.com//2024/07/10/vendors_response_to_my_llmcrasher/

Discussions

Be the first to like this. Showing 0 of 0 comments

Featured Posts

MQ Chat

New Update. Discover investment communities that resonate with your ideas

Latest Videos

MQ Market Updates - 10 July 2024

MQ Trader

Apps

MQ Chat

Send individual or group chats with anyone on i3investor

MQ Trader

Earn MQ Points while trading with MQ Trader

MQ Affiliate

Earn side income from Affiliate Program

MQdemy

Online learning and teaching marketplace

Hot Stocks Today >

YTLPOWR

YTL POWER INTERNATIONAL BHD

1000

ARMADA

BUMI ARMADA BERHAD

999

SALCON

SALCON BHD

904

GENTING

GENTING BHD

860

GENM

GENTING MALAYSIA BERHAD

726

JPG

JOHOR PLANTATIONS GROUP BERHAD

723

JCY

JCY INTERNATIONAL BERHAD

699

MAYBANK

MALAYAN BANKING BHD

698

PBBANK

PUBLIC BANK BHD

677

DNEX

DAGANG NEXCHANGE BERHAD

646

Daily Stocks

HSI-CXV

0.07

-0.005

153,277,100

INIX-OR

0.02

0.00

141,839,900

HSI-HWE

0.145

-0.005

128,080,500

SALCON-WB

0.095

-0.025

121,410,800

DATAPRP

0.245

+0.035

109,781,400

AHB-WC

0.08

0.00

106,280,300

WCT

1.05

+0.08

81,764,000

EDUSPEC-OR

0.005

0.00

75,002,700

JAKS

0.18

+0.01

70,087,500

HSI-CXN

0.13

-0.005

63,729,300

More active Stocks

NESTLE

122.70

+0.30

43,400

MCEMENT

5.55

+0.27

4,604,000

BKAWAN

19.80

+0.20

16,400

GAMUDA

7.52

+0.19

14,040,800

HEIM

22.08

+0.18

301,500

FRONTKN

4.74

+0.17

9,526,200

GCB

4.16

+0.15

2,697,900

AORB

3.36

+0.13

9,100

CIMB

7.09

+0.12

36,277,600

6.94

+0.12

10,027,800

More gainer Stocks

DLADY

33.66

-0.74

121,000

MPI

40.00

-0.26

230,400

F&N

31.14

-0.24

281,600

ORIENT

6.97

-0.18

1,220,900

ANNJOO

1.17

-0.17

11,888,100

HLBANK

19.12

-0.14

1,485,000

KLUANG

6.06

-0.14

15,200

HSI-CYE

0.155

-0.125

88,500

PETDAG

17.20

-0.12

525,700

KUAISHO-C17

0.08

-0.12

58,800

More loser Stocks

MQ Trading Signals

BUY
SELL

HEVEA

HEVEABOARD BHD

2024-07-10 16:55:00

EMA 5

5 Mins

NIHSIN

NI HSIN GROUP BERHAD

2024-07-10 16:55:00

EMA 5

5 Mins

ADB

AUTOCOUNT DOTCOM BERHAD

2024-07-10 16:55:00

EMA 5

5 Mins

INFOM

INFOMINA BERHAD

2024-07-10 16:55:00

EMA 5

5 Mins

MAXLAND

MAXLAND BERHAD

2024-07-10 16:55:00

EMA 5

5 Mins

More Trading Signals

THETA

THETA EDGE BERHAD

2024-07-10 16:55:00

EMA 5

5 Mins

KGW

KGW GROUP BERHAD

2024-07-10 16:55:00

EMA 5

5 Mins

MTEC

MASTER TEC GROUP BERHAD

2024-07-10 16:55:00

EMA 5

5 Mins

SMART

SMART ASIA CHEMICAL BHD

2024-07-10 16:55:00

EMA 5

5 Mins

REXIT

REXIT BHD

2024-07-10 16:55:00

EMA 5

5 Mins

More Trading Signals

Featured Advertisers / Partners

Top Brokers >

AmEquities

Affin Hwang

Rakuten Trade

Hong Leong Bank

Books Review >

Ride The Bull Short The Bear

CS Tan

4.9 / 5.0

This book is the result of the author's many years of experience and observation throughout his 26 years in the stockbroking industry. It was written for general public to learn to invest based on facts and not on fantasies or hearsay....

Read More