PinnedConvert a Regular LLM Model into a Full-Fledged DeepSeek R1-Like Reasoning ModelBy now, we all hear about the innovative approach of the open-source model DeepSeek-R1 in training reasoning models that achieve…Feb 12Feb 12
PinnedIntuitive Thinker — Introducing Guided Mental Models to Enhance Small LLM ReasoningAt a high level, Large Language Models (LLMs) function as statistical models that predict the most likely next word(s) in a sequence based…Sep 27, 20241Sep 27, 20241
PinnedVision Language Models for Radiologists: Detecting Regions of Interest in Brain CT ImagesRecent advancements in Vision Language Models have significantly propelled the realization of Smart Visual Assistants for radiologist image…Sep 7, 2024Sep 7, 2024
PinnedAgentic Workflow ImplementationsWhen building AI applications involving multiple tasks implemented by different components (chains or pipelines), complexities can quickly…Aug 2, 202410Aug 2, 202410
PinnedAutomatic Knowledge RAG with R2RThis is part two of Demystify Knowledge RAG frameworks. In our previous writing, we explored Knowledge RAG concepts, Microsoft GraphRAG…Jul 16, 20241504Jul 16, 20241504
Multi-GPU Training for Llama 3.2 using DeepSpeed and Redundancy Optimizer (ZeRO)For inference tasks, it’s preferable to load entire model onto one GPU, containing all necessary parameters, to carry out these tasks…Oct 1, 2024631Oct 1, 2024631
PaliGemma: Receipt & Invoice JSON v2In previous work, I created multiple experimental Large Language Model (LLM) architectures to convert receipt images into JSON or XML…Sep 7, 20242Sep 7, 20242
The Rise of CPU Inference: Powerful and Affordable LLMsGPUs are the default choice for LLM inference due to their speed advantage, but I believe we’re approaching a wall in terms of cost and…Aug 10, 20241Aug 10, 20241
Invest in Strategies to Save LLM CostAs we all know today, not all language models (LLMs) are created equal; some models are more resource-intensive than others. Factors such…Aug 6, 20242Aug 6, 20242
Building High Performance Computing Clusters (GPU) for AI at ScaleThe use of GPU accelerated machine learning workloads. Multi GPUs and Multi-Nodes enable distributed training at scale. Supercomputers rely…Jul 25, 20243Jul 25, 20243
Improving Software Reliability and Security through Memory-Safe LanguagesMemory safety issues refer to vulnerabilities that arise from incorrect or unexpected behavior when dealing with dynamic memory management…Jul 23, 2024Jul 23, 2024
NASA Bidara prompt engineering techniquesAlthough ChatGPT, or like LLM models using natural language as interface, works great for many use cases. However, to achieve a desired…Jul 22, 2024Jul 22, 2024
Apply Built-in Knowledge Graph Algorithms in Neo4J + MemgraphOnce we have data points saved in a Knowledge Graph, then we are ready to explore the use of graph algorithms to uncover hidden patterns…Jul 20, 2024Jul 20, 2024
Deep Dive: Transforming Text into Knowledge Graphs with LLMBenefits of Knowledge graphs is put data in context via linking and semantic metadata which express relationships between data. Knowledge…Jul 18, 2024Jul 18, 2024
Demystify Knowledge RAG frameworksIn this writing, let’s take a dive into knowledge graph concepts, how it solves the problem of finding a needle in a haystack. GraphRAG…Jul 15, 20242Jul 15, 20242
LLM Model MergingIn this article, I will share my views on the innovative method of model blending techniques and algorithms, and discuss the results of my…Mar 14, 20242Mar 14, 20242
State of “Function Calling” in LLMLast year, OpenAI introduced a new function-calling capability in the Chat Completions API, which you can find here. With this feature, you…Mar 8, 2024Mar 8, 2024
Polylith ArchitectureI have recently begun experimenting with Poly Architecture for the development of Python projects and have found it to be an excellent…Mar 7, 20241Mar 7, 20241
Creating a tiny vector store from scratchSometimes I found that constructing a small version of something from scratch can be a helpful way to learn new concepts. In this article…Mar 7, 2024Mar 7, 2024
Part-3: Pavai.Workspace (productivity focus AI workspace)You may have heard that creating an organized workspace can help minimize distractions and improve focus on the task at hand. In this…Mar 6, 2024Mar 6, 2024