d

2025-10-10 18:36:50 +00:00 · 2025-06-08 17:54:06 +02:00 · 2025-06-08 17:54:06 +02:00 · 21219807a8
commit 21219807a8
parent c4a1fd3836
11 changed files with 0 additions and 11 deletions
--- a/src/AI/AI-llm-architecture/0.-basic-llm-concepts.md
+++ b/src/AI/AI-llm-architecture/0.-basic-llm-concepts.md
@ -298,4 +298,3 @@ During the backward pass:
 - **Accuracy:** Provides exact derivatives up to machine precision.
 - **Ease of Use:** Eliminates manual computation of derivatives.

-
--- a/src/AI/AI-llm-architecture/1.-tokenizing.md
+++ b/src/AI/AI-llm-architecture/1.-tokenizing.md
@ -96,4 +96,3 @@ print(token_ids[:50])

 - [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

-
--- a/src/AI/AI-llm-architecture/2.-data-sampling.md
+++ b/src/AI/AI-llm-architecture/2.-data-sampling.md
@ -238,4 +238,3 @@ tensor([[  367,  2885,  1464,  1807],

 - [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

-
--- a/src/AI/AI-llm-architecture/3.-token-embeddings.md
+++ b/src/AI/AI-llm-architecture/3.-token-embeddings.md
@ -216,4 +216,3 @@ print(input_embeddings.shape) # torch.Size([8, 4, 256])

 - [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

-
--- a/src/AI/AI-llm-architecture/4.-attention-mechanisms.md
+++ b/src/AI/AI-llm-architecture/4.-attention-mechanisms.md
@ -429,4 +429,3 @@ For another compact and efficient implementation you could use the [`torch.nn.Mu



-
--- a/src/AI/AI-llm-architecture/5.-llm-architecture.md
+++ b/src/AI/AI-llm-architecture/5.-llm-architecture.md
@ -699,4 +699,3 @@ print("Output length:", len(out[0]))

 - [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

-
--- a/src/AI/AI-llm-architecture/6.-pre-training-and-loading-models.md
+++ b/src/AI/AI-llm-architecture/6.-pre-training-and-loading-models.md
@ -970,4 +970,3 @@ There 2 quick scripts to load the GPT2 weights locally. For both you can clone t



-
--- a/src/AI/AI-llm-architecture/7.0.-lora-improvements-in-fine-tuning.md
+++ b/src/AI/AI-llm-architecture/7.0.-lora-improvements-in-fine-tuning.md
@ -62,4 +62,3 @@ def replace_linear_with_lora(model, rank, alpha):

 - [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

-
--- a/src/AI/AI-llm-architecture/7.1.-fine-tuning-for-classification.md
+++ b/src/AI/AI-llm-architecture/7.1.-fine-tuning-for-classification.md
@ -115,4 +115,3 @@ You can find all the code to fine-tune GPT2 to be a spam classifier in [https://

 - [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

-
--- a/src/AI/AI-llm-architecture/7.2.-fine-tuning-to-follow-instructions.md
+++ b/src/AI/AI-llm-architecture/7.2.-fine-tuning-to-follow-instructions.md
@ -105,4 +105,3 @@ You can find an example of the code to perform this fine tuning in [https://gith

 - [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

-
--- a/src/AI/AI-llm-architecture/README.md
+++ b/src/AI/AI-llm-architecture/README.md
@ -98,4 +98,3 @@ You should start by reading this post for some basic concepts you should know ab
 {{#endref}}


-
				`@ -96,4 +96,3 @@ print(token_ids[:50])`

				`- [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)`
				`@ -238,4 +238,3 @@ tensor([[ 367, 2885, 1464, 1807],`

				`- [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)`
				`@ -216,4 +216,3 @@ print(input_embeddings.shape) # torch.Size([8, 4, 256])`

				`- [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)`
				@ -429,4 +429,3 @@ For another compact and efficient implementation you could use the [`torch.nn.Mu
				`@ -699,4 +699,3 @@ print("Output length:", len(out[0]))`

				`- [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)`
				`@ -970,4 +970,3 @@ There 2 quick scripts to load the GPT2 weights locally. For both you can clone t`
				`@ -62,4 +62,3 @@ def replace_linear_with_lora(model, rank, alpha):`

				`- [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)`
				`@ -115,4 +115,3 @@ You can find all the code to fine-tune GPT2 to be a spam classifier in [https://`

				`- [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)`
				`@ -105,4 +105,3 @@ You can find an example of the code to perform this fine tuning in [https://gith`

				`- [https://www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)`
				`@ -98,4 +98,3 @@ You should start by reading this post for some basic concepts you should know ab`
				`{{#endref}}`