The vibes when I joined AI in ~2008:
— Andrej Karpathy (@karpathy) March 23, 2023
- workshops w 50 ppl musing on whether deep learning will ever work
- papers w cute toy problems
- fun poster sessions
- this experiment I ran in MATLAB
- high-level panels on paths to AI
- neuroscience guest lectures
Today is *not* the same.
GPT is a new kind of computer architecture that runs on text. Yes it can talk to us, but also to much of our existing software infrastructure. First via apps on top of APIs, now inside ChatGPT via plugins.
— Andrej Karpathy (@karpathy) March 23, 2023
What a time right now...https://t.co/HjeUCv3XE7
https://www.youtube.com/@AndrejKarpathy
https://twitter.com/karpathy
https://karpathy.ai
https://karpathy.medium.com
http://karpathy.github.io
https://github.com/karpathy
https://karpathy.ai/tweets.html
Software 2.0 Neural networks are not just another classifier, they represent the beginning of a fundamental shift in how we develop software. They are Software 2.0......... The “classical stack” of Software 1.0 is what we’re all familiar with — it is written in languages such as Python, C++, etc. It consists of explicit instructions to the computer written by a programmer. By writing each line of code, the programmer identifies a specific point in program space with some desirable behavior. .......... In contrast, Software 2.0 is written in much more abstract, human unfriendly language, such as the weights of a neural network. No human is involved in writing this code because there are a lot of weights (typical networks might have millions), and coding directly in weights is kind of hard (I tried). .........
Software (1.0) is eating the world, and now AI (Software 2.0) is eating software.
.My parents were visiting me once and as I was leaving for work I saw my mom sitting on the couch in the living room just looking forward. I’m like “mom what are you doing?”, “sitting”, she shrugged. Like not reading, listening, planning, or even meditating. Mind blown.
— Andrej Karpathy (@karpathy) January 3, 2019
1 hour and 5 diagrams later I optimized 100 lines of code that ran in 13 seconds to 20 lines of heavily vectorized code that runs in 0.02 seconds, and this might just be the best day of my life, so far.
— Andrej Karpathy (@karpathy) April 18, 2018
After 7pm?
— Andrej Karpathy (@karpathy) April 18, 2018
Coworker on RL research: "We were supposed to make AI do all the work and we play games but we do all the work and the AI is playing games!"
— Andrej Karpathy (@karpathy) October 7, 2016
Jeff Dean: "I like your ConvNets in Javascript". Me: "Thank you. I like your map reduce."
— Andrej Karpathy (@karpathy) December 11, 2014
Still?
— Paramendra Kumar Bhagat (@paramendra) March 25, 2023
While playing around with hooking up GPT-4 to the Internet, I asked it about myself… and had an absolute WTF moment before realizing that I wrote a very special secret message to Bing when Sydney came out and then forgot all about it. Indirect prompt injection is gonna be WILD pic.twitter.com/5Rh1RdMdcV
— Arvind Narayanan (@random_walker) March 18, 2023
Best ChatGPT prompt so far 😂 https://t.co/MNWOAJ8XXZ
— Andrej Karpathy (@karpathy) December 2, 2022
Neural Networks: Zero to Hero
No comments:
Post a Comment