Skip to content

Commit e553a41

Browse files
updated
1 parent 5a7a7a7 commit e553a41

36 files changed

+1836
-1113
lines changed
5.78 MB
Loading
5.18 MB
Loading
128 KB
Loading

Images/Rag.png

6.1 MB
Loading

Images/mongodbinternal.png

5.68 MB
Loading

Images/unnamed-(1).png

6.49 MB
Loading

Images/unnamed-(2).png

6.05 MB
Loading

Images/unnamed-(3).png

6.59 MB
Loading

index.xml

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -6,11 +6,11 @@
66
<description>Last 10 notes on 🧠 Second Brain</description>
77
<generator>Quartz -- quartz.jzhao.xyz</generator>
88
<item>
9-
<title>voice agent deployment</title>
10-
<link>https://programmerraja.github.io/notes/2025/Generative-AI/voice-agent-deployment</link>
11-
<guid>https://programmerraja.github.io/notes/2025/Generative-AI/voice-agent-deployment</guid>
12-
<description>This document provides an organized comparison of GPU architectures, deployment platforms, LLMs, and speech models (TTS/STT) relevant for deploying a voice agent ...</description>
13-
<pubDate>Mon, 10 Nov 2025 00:57:37 GMT</pubDate>
9+
<title>How to pick the models</title>
10+
<link>https://programmerraja.github.io/notes/2025/Generative-AI/How-to-pick-the-models</link>
11+
<guid>https://programmerraja.github.io/notes/2025/Generative-AI/How-to-pick-the-models</guid>
12+
<description> Thesis / motivation Picking the newest/biggest LLM is not always optimal. Different models have distinct tradeoffs (code, math, multimodal, deployability, cost, licensing).</description>
13+
<pubDate>Tue, 02 Dec 2025 10:35:47 GMT</pubDate>
1414
</item><item>
1515
<title>question</title>
1616
<link>https://programmerraja.github.io/notes/Microservice/question</link>
@@ -54,11 +54,11 @@
5454
<description>As regular readers of my blog may know, our primary technology stack is the MERN stack MongoDB, Express, React, and Node.js. On the frontend, we use React with TypeScript; on the backend, Node.js with TypeScript, and MongoDB serves as our database.</description>
5555
<pubDate>Tue, 05 Aug 2025 04:53:12 GMT</pubDate>
5656
</item><item>
57-
<title>RAG</title>
58-
<link>https://programmerraja.github.io/notes/2025/Generative-AI/RAG</link>
59-
<guid>https://programmerraja.github.io/notes/2025/Generative-AI/RAG</guid>
60-
<description>RAG RAG stands for Retrieval Augmented Generation, which is a technique to enhance Large Language Models (LLMs) by connecting them to external knowledge bases or datasets ...</description>
61-
<pubDate>Sat, 02 Aug 2025 00:23:48 GMT</pubDate>
57+
<title>Model Quantization</title>
58+
<link>https://programmerraja.github.io/notes/2025/Deep-learning/Model-Quantization</link>
59+
<guid>https://programmerraja.github.io/notes/2025/Deep-learning/Model-Quantization</guid>
60+
<description>Model Compression The process of making a model smaller is called model compression, and the process to make it do inference faster is called inference optimization ...</description>
61+
<pubDate>Wed, 16 Jul 2025 02:56:34 GMT</pubDate>
6262
</item><item>
6363
<title>Context Engineering</title>
6464
<link>https://programmerraja.github.io/notes/2025/Generative-AI/Context-Engineering-and-Memory-in-LLM</link>

notes/2024/Cyptography.html

Lines changed: 104 additions & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)