> The Jeopardy! match with Watson was essentially rigged due to the computer’s superhuman reaction time.
From many anecdotes I've heard from previous Jeopardy contestants, buzzer technique is more than half the battle. That's part of the reason Ken Jennings was able to go on winning as long as he did - he continuously refined and developed better intuition for when to buzz.
Many contestants (a large portion of whom likely participated in college Trivia Bowls and were seasoned trivia buffs) could answer the vast majority of Jeopardy clues with relative ease.
The reason for this is that Jeopardy, in order to have household appeal, cannot legitimately engage in an increasingly difficult arms race for its questions - it has to have a capped ceiling in terms of clue difficulty.
There were two astounding takeaways when Watson beat Ken and Brad:
1. We've advanced the state of the art in terms of being able to reason and parse complex language to a level the general population wasn't aware of.
2. Beating humans on the buzzer, of all things, feels intrinsically unfair and there's no graceful way to account for that.
Watson's ability to interpret, correctly, what most of the Jeopardy! "answers" were looking for was a sea change. It really was as big as Deep Blue, the crossing of a frontier we collectively thought would likely be much further in the future, and it was here now: complex clues, difficult to reason out context, and the ability to search its corpus of knowledge and assign a confidence value and then produce accurate results... it was mind blowing.
It's funny because, in a pinch, ChatGPT will answer essentially every Jeopardy! clue correctly every single time now, owing to the specific nature of generative transformers and the advent of attention, specifically. But Watson was pre "attention is all you need!" It was blood, sweat, and tears, and markov chains, and all kinds of craziness. It was a huge swing, and the fact that it actually worked was remarkable.
But back to the buzzer: most people sitting at home probably don't realize that the buzzer is locked out from buzzing in until the host (then Alex, now Ken) completes the read of the clue. At that point, an off-stage producer presses a switch to unlock the buzzers. So prospective players spend their weeks before their tape date spamming a retractable pen in front of the TV trying to nail the timing from the cadence of the host's voice, and hoping they can get it right. Because getting it wrong penalizes you: if you attempt to buzz in before the lock is lifted, your buzzer is individually locked for another 250ms or so. Another producer has a screen that monitors the buzzer activity so they can see if players are habitually buzzing in too early and getting locked, or to monitor for malfunctions, and it's wild to watch it light up like a Christmas tree in the 100 or so last milliseconds of the clue.
This is why you see players spam the buzzer: if they're locked out, the best thing they can do is keep attempting to get back in until the lock lifts. This is also why on rare occasions, you'll see all three players spamming and no one actually getting buzzed in: everyone locked.
So how do you translate this to a computer? The answer was basically to create a makeshift "finger" that would activate the buzzer, but would consistently buzz in at exactly 28ms after the lock cleared... every single time (so long as its confidence was high enough).
I think it's fair to say that there's nothing, at _all_, remarkable about a computer beating a human on a buzzer. It's a signal loop: wait for lock to clear, receive signal lock has cleared, activate buzzer 28ms later. Every device with a stopwatch or timer or clock is more accurate than a human attempting the same timing exercise. It's a solved problem, and not an interesting one. And it disproportionately took the wind out of the sails from the human competitors, who are both the most world-class, best-on-the-planet Jeopardy! players, who were losing by milliseconds to that signal loop.
It's still remarkable Watson was able to successfully answer so many clues correctly and end up winning, but the buzzer aspect was an (intractable) bummer.
I'm really curious how this kind of writing will be perceived in the coming years. It's "good" long form writing but a lot of that also feels like journalistic padding for the sake of the form, as if you asked an LLM to "write in the style of a long form journalist."
Easy example:
"...dismissed as “disappointing,” “unfair,” and—my favorite—“a gimmick”..."
Is it really important for me to know Clare McNear's favourite? A couple of years ago I wouldn't have thought anything of it but now I'm so sickened constantly by LLMs adding so much needless cruft to everything they write I'm reading this thinking "just tell me the story, for the love of humanity!"
I’m seeing hyperreality in real time. You complain that an article of long form journalism appears to be the sort of thing that might be produced by asking an LLM to create it?
Doesn’t that mean it is true to form? Can you not imagine that some people like reading stories written by others?
Yes, that's exactly what it means. Like I say, a few years ago I wouldn't have thought anything of it, but now it's behind painfully clear how formulaic that kind of writing is.
> Can you not imagine that some people like reading stories written by others?
This is quite uncharitable and a straw man. I love reading stories written by others. I'm complaining very specifically about the over reliance on tropes of the form.
I also wasn't leading with the complaint, I was leading with the curiosity I feel about how writing will develop as a result of what I'm seeing here, as I'm sure I'm not the only one who suddenly feels different about human made writing. That's not too say I don't lament it, either, but it is what it is.
From the article:
> The Jeopardy! match with Watson was essentially rigged due to the computer’s superhuman reaction time.
From many anecdotes I've heard from previous Jeopardy contestants, buzzer technique is more than half the battle. That's part of the reason Ken Jennings was able to go on winning as long as he did - he continuously refined and developed better intuition for when to buzz.
Many contestants (a large portion of whom likely participated in college Trivia Bowls and were seasoned trivia buffs) could answer the vast majority of Jeopardy clues with relative ease.
The reason for this is that Jeopardy, in order to have household appeal, cannot legitimately engage in an increasingly difficult arms race for its questions - it has to have a capped ceiling in terms of clue difficulty.
https://archive.is/p2hG5
There were two astounding takeaways when Watson beat Ken and Brad:
1. We've advanced the state of the art in terms of being able to reason and parse complex language to a level the general population wasn't aware of.
2. Beating humans on the buzzer, of all things, feels intrinsically unfair and there's no graceful way to account for that.
Watson's ability to interpret, correctly, what most of the Jeopardy! "answers" were looking for was a sea change. It really was as big as Deep Blue, the crossing of a frontier we collectively thought would likely be much further in the future, and it was here now: complex clues, difficult to reason out context, and the ability to search its corpus of knowledge and assign a confidence value and then produce accurate results... it was mind blowing.
It's funny because, in a pinch, ChatGPT will answer essentially every Jeopardy! clue correctly every single time now, owing to the specific nature of generative transformers and the advent of attention, specifically. But Watson was pre "attention is all you need!" It was blood, sweat, and tears, and markov chains, and all kinds of craziness. It was a huge swing, and the fact that it actually worked was remarkable.
But back to the buzzer: most people sitting at home probably don't realize that the buzzer is locked out from buzzing in until the host (then Alex, now Ken) completes the read of the clue. At that point, an off-stage producer presses a switch to unlock the buzzers. So prospective players spend their weeks before their tape date spamming a retractable pen in front of the TV trying to nail the timing from the cadence of the host's voice, and hoping they can get it right. Because getting it wrong penalizes you: if you attempt to buzz in before the lock is lifted, your buzzer is individually locked for another 250ms or so. Another producer has a screen that monitors the buzzer activity so they can see if players are habitually buzzing in too early and getting locked, or to monitor for malfunctions, and it's wild to watch it light up like a Christmas tree in the 100 or so last milliseconds of the clue.
This is why you see players spam the buzzer: if they're locked out, the best thing they can do is keep attempting to get back in until the lock lifts. This is also why on rare occasions, you'll see all three players spamming and no one actually getting buzzed in: everyone locked.
So how do you translate this to a computer? The answer was basically to create a makeshift "finger" that would activate the buzzer, but would consistently buzz in at exactly 28ms after the lock cleared... every single time (so long as its confidence was high enough).
I think it's fair to say that there's nothing, at _all_, remarkable about a computer beating a human on a buzzer. It's a signal loop: wait for lock to clear, receive signal lock has cleared, activate buzzer 28ms later. Every device with a stopwatch or timer or clock is more accurate than a human attempting the same timing exercise. It's a solved problem, and not an interesting one. And it disproportionately took the wind out of the sails from the human competitors, who are both the most world-class, best-on-the-planet Jeopardy! players, who were losing by milliseconds to that signal loop.
It's still remarkable Watson was able to successfully answer so many clues correctly and end up winning, but the buzzer aspect was an (intractable) bummer.
ChatGPT would have to just always buzz since generating text is too slow for it to come up with an answer before buzzing.
Watson was given the whole prompt (as text) right when Trebek started reading it out aloud, so it had a full 6-7 seconds to choose an answer.
ChatGPT is more than fast enough to get an answer under the same constraints.
I'm really curious how this kind of writing will be perceived in the coming years. It's "good" long form writing but a lot of that also feels like journalistic padding for the sake of the form, as if you asked an LLM to "write in the style of a long form journalist."
Easy example:
"...dismissed as “disappointing,” “unfair,” and—my favorite—“a gimmick”..."
Is it really important for me to know Clare McNear's favourite? A couple of years ago I wouldn't have thought anything of it but now I'm so sickened constantly by LLMs adding so much needless cruft to everything they write I'm reading this thinking "just tell me the story, for the love of humanity!"
I’m seeing hyperreality in real time. You complain that an article of long form journalism appears to be the sort of thing that might be produced by asking an LLM to create it?
Doesn’t that mean it is true to form? Can you not imagine that some people like reading stories written by others?
> Doesn’t that mean it is true to form?
Yes, that's exactly what it means. Like I say, a few years ago I wouldn't have thought anything of it, but now it's behind painfully clear how formulaic that kind of writing is.
> Can you not imagine that some people like reading stories written by others?
This is quite uncharitable and a straw man. I love reading stories written by others. I'm complaining very specifically about the over reliance on tropes of the form.
I also wasn't leading with the complaint, I was leading with the curiosity I feel about how writing will develop as a result of what I'm seeing here, as I'm sure I'm not the only one who suddenly feels different about human made writing. That's not too say I don't lament it, either, but it is what it is.
[flagged]