Replies: 13 comments 27 replies
-
Can I get a job studying this? |
Beta Was this translation helpful? Give feedback.
-
Are there any plans to go through this in a video format similar to NN Zero to Hero? |
Beta Was this translation helpful? Give feedback.
-
thanks for your llm.c. could you add more comments in code to explain details for AI beginners? |
Beta Was this translation helpful? Give feedback.
-
My generated text during training:<|endoftext|>I was so upright that I would have never heard you had any talk. I have heard you sometimes datts you, Eats could or should be choked, crows and <|endoftext|>Second Servingman: Will we get a prompt to ask gpt-2 questions and get answers? |
Beta Was this translation helpful? Give feedback.
-
Hi karpathy, |
Beta Was this translation helpful? Give feedback.
-
If anyone wants to play around with this on an amazon AWS cloud machine, here's a newbie friendly article that shows how to set that up. |
Beta Was this translation helpful? Give feedback.
-
I am trying this on a x86 Linux system and running into a weird issue. I see this: On debugging, I found that the mean_loss is sometimes NaN, but making the backward conditional on a NaN check (using != ) still ends up with this problem. My feeling is that this has something to do with the loader, but not 100% sure. Vendor ID: GenuineIntel |
Beta Was this translation helpful? Give feedback.
-
Thank you for the llm.c project. One way to learn how llm.c works is by rewriting the code. This is why I rewrote llm.c into a simple command line utility that runs GPT-2 inference. The link is: https://github.com/jtscm/iim.c/ |
Beta Was this translation helpful? Give feedback.
-
Still got some work to do but I've got an initial implementation in go: https://github.com/joshcarp/llm.go |
Beta Was this translation helpful? Give feedback.
-
Thanks for your great project. Could you please share how you debug and profile the code? That will be helpful. Thanks again. |
Beta Was this translation helpful? Give feedback.
-
I would like to see an implementation where you start with a small transformer and teach it the ABCs and then grow it with more layers and then feed it more complex information until it gets big and with only a small set of data you teach it new tasks. |
Beta Was this translation helpful? Give feedback.
-
Can someone recommend any book or course where I can start my journey of understanding LLMs from a very scratch level? |
Beta Was this translation helpful? Give feedback.
-
Current build on master fails due to C++17 features. Not sure it's my system or others also faces the same issue. Setting nvcc to cpp17 fixes it for me. |
Beta Was this translation helpful? Give feedback.
-
🔥 llm.c 🔥
Turning on
discussions
feature as a place for people to ask, share and engage, without having to create Issues.Beta Was this translation helpful? Give feedback.
All reactions