PinnedIntuitive Thinker — Introducing Guided Mental Models to Enhance Small LLM ReasoningAt a high level, Large Language Models (LLMs) function as statistical models that predict the most likely next word(s) in a sequence based…Sep 27, 2024Sep 27, 2024
PinnedVision Language Models for Radiologists: Detecting Regions of Interest in Brain CT ImagesRecent advancements in Vision Language Models have significantly propelled the realization of Smart Visual Assistants for radiologist image…Sep 7, 2024Sep 7, 2024
PinnedAgentic Workflow ImplementationsWhen building AI applications involving multiple tasks implemented by different components (chains or pipelines), complexities can quickly…Aug 2, 2024Aug 2, 2024
PinnedAutomatic Knowledge RAG with R2RThis is part two of Demystify Knowledge RAG frameworks. In our previous writing, we explored Knowledge RAG concepts, Microsoft GraphRAG…Jul 16, 20244Jul 16, 20244
PinnedBuilding High Performance Computing Clusters (GPU) for AI at ScaleThe use of GPU accelerated machine learning workloads. Multi GPUs and Multi-Nodes enable distributed training at scale. Supercomputers rely…Jul 25, 2024Jul 25, 2024
Multi-GPU Training for Llama 3.2 using DeepSpeed and Redundancy Optimizer (ZeRO)For inference tasks, it’s preferable to load entire model onto one GPU, containing all necessary parameters, to carry out these tasks…Oct 1, 20241Oct 1, 20241
PaliGemma: Receipt & Invoice JSON v2In previous work, I created multiple experimental Large Language Model (LLM) architectures to convert receipt images into JSON or XML…Sep 7, 2024Sep 7, 2024
The Rise of CPU Inference: Powerful and Affordable LLMsGPUs are the default choice for LLM inference due to their speed advantage, but I believe we’re approaching a wall in terms of cost and…Aug 10, 2024Aug 10, 2024
Invest in Strategies to Save LLM CostAs we all know today, not all language models (LLMs) are created equal; some models are more resource-intensive than others. Factors such…Aug 6, 2024Aug 6, 2024
Improving Software Reliability and Security through Memory-Safe LanguagesMemory safety issues refer to vulnerabilities that arise from incorrect or unexpected behavior when dealing with dynamic memory management…Jul 23, 2024Jul 23, 2024