Netizen: Andrej Karpathy

Friday, March 24, 2023

Andrej Karpathy

The vibes when I joined AI in ~2008:
- workshops w 50 ppl musing on whether deep learning will ever work
- papers w cute toy problems
- fun poster sessions
- this experiment I ran in MATLAB
- high-level panels on paths to AI
- neuroscience guest lectures
Today is *not* the same.
— Andrej Karpathy (@karpathy) March 23, 2023

GPT is a new kind of computer architecture that runs on text. Yes it can talk to us, but also to much of our existing software infrastructure. First via apps on top of APIs, now inside ChatGPT via plugins.
What a time right now...https://t.co/HjeUCv3XE7
— Andrej Karpathy (@karpathy) March 23, 2023

https://www.youtube.com/@AndrejKarpathy
https://twitter.com/karpathy
https://karpathy.ai
https://karpathy.medium.com
http://karpathy.github.io
https://github.com/karpathy
https://karpathy.ai/tweets.html

Software 2.0 Neural networks are not just another classifier, they represent the beginning of a fundamental shift in how we develop software. They are Software 2.0......... The “classical stack” of Software 1.0 is what we’re all familiar with — it is written in languages such as Python, C++, etc. It consists of explicit instructions to the computer written by a programmer. By writing each line of code, the programmer identifies a specific point in program space with some desirable behavior. .......... In contrast, Software 2.0 is written in much more abstract, human unfriendly language, such as the weights of a neural network. No human is involved in writing this code because there are a lot of weights (typical networks might have millions), and coding directly in weights is kind of hard (I tried). .........
Software (1.0) is eating the world, and now AI (Software 2.0) is eating software.
.

My parents were visiting me once and as I was leaving for work I saw my mom sitting on the couch in the living room just looking forward. I’m like “mom what are you doing?”, “sitting”, she shrugged. Like not reading, listening, planning, or even meditating. Mind blown.
— Andrej Karpathy (@karpathy) January 3, 2019

1 hour and 5 diagrams later I optimized 100 lines of code that ran in 13 seconds to 20 lines of heavily vectorized code that runs in 0.02 seconds, and this might just be the best day of my life, so far.
— Andrej Karpathy (@karpathy) April 18, 2018

After 7pm?
— Andrej Karpathy (@karpathy) April 18, 2018

Coworker on RL research: "We were supposed to make AI do all the work and we play games but we do all the work and the AI is playing games!"
— Andrej Karpathy (@karpathy) October 7, 2016

Jeff Dean: "I like your ConvNets in Javascript". Me: "Thank you. I like your map reduce."
— Andrej Karpathy (@karpathy) December 11, 2014

Still?
— Paramendra Kumar Bhagat (@paramendra) March 25, 2023

While playing around with hooking up GPT-4 to the Internet, I asked it about myself… and had an absolute WTF moment before realizing that I wrote a very special secret message to Bing when Sydney came out and then forgot all about it. Indirect prompt injection is gonna be WILD pic.twitter.com/5Rh1RdMdcV
— Arvind Narayanan (@random_walker) March 18, 2023

Best ChatGPT prompt so far 😂 https://t.co/MNWOAJ8XXZ
— Andrej Karpathy (@karpathy) December 2, 2022

Neural Networks: Zero to Hero

Pages

Friday, March 24, 2023

Andrej Karpathy

Software (1.0) is eating the world, and now AI (Software 2.0) is eating software.

No comments: