PinnedAgentic-Powered Desktop AutomationMany enterprise business processes can be fully automated from beginning to end. However, there are still workflows that require human…Feb 27Feb 27
PinnedConvert a Regular LLM Model into a Full-Fledged DeepSeek R1-Like Reasoning ModelBy now, we all hear about the innovative approach of the open-source model DeepSeek-R1 in training reasoning models that achieve…Feb 12Feb 12
PinnedAgentic Workflow ImplementationsWhen building AI applications involving multiple tasks implemented by different components (chains or pipelines), complexities can quickly…Aug 2, 2024Aug 2, 2024
PinnedAutomatic Knowledge RAG with R2RThis is part two of Demystify Knowledge RAG frameworks. In our previous writing, we explored Knowledge RAG concepts, Microsoft GraphRAG…Jul 16, 20244Jul 16, 20244
Multi-GPU Training for Llama 3.2 using DeepSpeed and Redundancy Optimizer (ZeRO)For inference tasks, it’s preferable to load entire model onto one GPU, containing all necessary parameters, to carry out these tasks…Oct 1, 20241Oct 1, 20241
Intuitive Thinker — Introducing Guided Mental Models to Enhance Small LLM ReasoningAt a high level, Large Language Models (LLMs) function as statistical models that predict the most likely next word(s) in a sequence based…Sep 27, 2024Sep 27, 2024
PaliGemma: Receipt & Invoice JSON v2In previous work, I created multiple experimental Large Language Model (LLM) architectures to convert receipt images into JSON or XML…Sep 7, 2024Sep 7, 2024
Vision Language Models for Radiologists: Detecting Regions of Interest in Brain CT ImagesRecent advancements in Vision Language Models have significantly propelled the realization of Smart Visual Assistants for radiologist image…Sep 7, 2024Sep 7, 2024
The Rise of CPU Inference: Powerful and Affordable LLMsGPUs are the default choice for LLM inference due to their speed advantage, but I believe we’re approaching a wall in terms of cost and…Aug 10, 2024Aug 10, 2024