LangChain Archives - gettectonic.com

15May

Designing AI Agents the Right Way

Designing AI agents effectively involves a structured approach, starting with defining clear objectives and aligning them with business needs. It also requires careful data collection and preparation, selecting the right machine learning models, and crafting a robust architecture. Finally, building in feedback loops and prioritizing continuous monitoring and improvement are crucial for success. Here’s a more detailed breakdown: 1. Define Objectives and Purpose: 2. Data Collection and Preparation: 3. Choose the Right Models and Tools: 4. Design the Agent Architecture: 5. Training and Refinement: 6. Testing and Validation: 7. Deployment, Monitoring, and Iteration: 8. Key Considerations: By following these principles, you can design AI agents that are not only effective but also robust, scalable, and aligned with your business objectives. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

May 15, 2025in AI Tools, Data0 Comments

20Apr

Architecture for Enterprise-Grade Agentic AI Systems

LangGraph: The Architecture for Enterprise-Grade Agentic AI Systems Modern enterprises need AI that doesn’t just answer questions—but thinks, plans, and acts autonomously. LangGraph provides the framework to build these next-generation agentic systems capable of: ✅ Multi-step reasoning across complex workflows✅ Dynamic decision-making with real-time tool selection✅ Stateful execution that maintains context across operations✅ Seamless integration with enterprise knowledge bases and APIs 1. LangGraph’s Graph-Based Architecture At its core, LangGraph models AI workflows as Directed Acyclic Graphs (DAGs): This structure enables:✔ Conditional branching (different paths based on data)✔ Parallel processing where possible✔ Guaranteed completion (no infinite loops) Example Use Case:A customer service agent that: 2. Multi-Hop Knowledge Retrieval Enterprise queries often require connecting information across multiple sources. LangGraph treats this as a graph traversal problem: python Copy # Neo4j integration for structured knowledge from langchain.graphs import Neo4jGraph graph = Neo4jGraph(url=”bolt://localhost:7687″, username=”neo4j”, password=”password”) query = “”” MATCH (doc:Document)-[:REFERENCES]->(policy:Policy) WHERE policy.name = ‘GDPR’ RETURN doc.title, doc.url “”” results = graph.query(query) # → Feeds into LangGraph nodes Hybrid Approach: 3. Building Autonomous Agents LangGraph + LangChain agents create systems that: python Copy from langchain.agents import initialize_agent, Tool from langchain.chat_models import ChatOpenAI # Define tools search_tool = Tool( name=”ProductSearch”, func=search_product_db, description=”Searches internal product catalog” ) # Initialize agent agent = initialize_agent( tools=[search_tool], llm=ChatOpenAI(model=”gpt-4″), agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION ) # Execute response = agent.run(“Find compatible accessories for Model X-42”) 4. Full Implementation Example Enterprise Document Processing System: python Copy from langgraph.graph import StateGraph from langchain.embeddings import OpenAIEmbeddings from langchain.vectorstores import Pinecone # 1. Define shared state class DocProcessingState(BaseModel): query: str retrieved_docs: list = [] analysis: str = “” actions: list = [] # 2. Create nodes def retrieve(state): vectorstore = Pinecone.from_existing_index(“docs”, OpenAIEmbeddings()) state.retrieved_docs = vectorstore.similarity_search(state.query) return state def analyze(state): # LLM analysis of documents state.analysis = llm(f”Summarize key points from: {state.retrieved_docs}”) return state # 3. Build workflow workflow = StateGraph(DocProcessingState) workflow.add_node(“retrieve”, retrieve) workflow.add_node(“analyze”, analyze) workflow.add_edge(“retrieve”, “analyze”) workflow.add_edge(“analyze”, END) # 4. Execute agent = workflow.compile() result = agent.invoke({“query”: “2025 compliance changes”}) Why This Matters for Enterprises The Future:LangGraph enables AI systems that don’t just assist workers—but autonomously execute complete business processes while adhering to organizational rules and structures. “This isn’t chatbot AI—it’s digital workforce AI.” Next Steps: Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

April 20, 2025in Agentic AI, Data, Enterprise

03Mar

Building Intelligent Order Management Workflows

Mastering LangGraph: Building Intelligent Order Management Workflows Introduction In this comprehensive guide, we will explore LangGraph—a robust library designed for orchestrating complex, multi-step workflows with Large Language Models (LLMs). We will apply it to a practical e-commerce use case: determining whether to place or cancel an order based on a user’s query. By the end of this tutorial, you will understand how to: We will walk through each step in detail, making it accessible to beginners and useful for those seeking to develop dynamic, intelligent workflows using LLMs. A dataset link is also provided for hands-on experimentation. Table of Contents 1. What Is LangGraph? LangGraph is a library that brings a graph-based approach to LangChain workflows. Traditional pipelines follow a linear progression, but real-world tasks often involve branching logic, loops (e.g., retrying failed steps), or human intervention. Key Features: 2. The Problem Statement: Order Management The workflow needs to handle two types of user queries: Since these operations require decision-making, we will use LangGraph to implement a structured, conditional workflow: 3. Environment Setup and Imports Explanation of Key Imports: 4. Data Loading and State Definition Load Inventory and Customer Data Define the Workflow State 5. Creating Tools and Integrating LLMs Define the Order Cancellation Tool Initialize LLM and Bind Tools 6. Defining Workflow Nodes Query Categorization Check Inventory Compute Shipping Costs Process Payment 7. Constructing the Workflow Graph 8. Visualizing and Testing the Workflow Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

March 3, 2025in Data, Salesforce

03Feb

Pydantic AI

The evaluation of agentic applications is most effective when integrated into the development process, rather than being an afterthought. For this to succeed, developers must be able to mock both internal and external dependencies of the agent being built. PydanticAI introduces a groundbreaking framework that supports dependency injection from the start, enabling developers to build agentic applications with an evaluation-driven approach. An architectural parallel can be drawn to the historic Krakow Cloth Hall, a structure refined over centuries through evaluation-driven enhancements. Similarly, PydanticAI allows developers to iteratively address challenges during development, ensuring optimal outcomes. Challenges in Developing GenAI Applications Developers of LLM-based applications face recurring challenges, which become significant during production deployment: To address non-determinism, developers must adopt evaluation-driven development, a method akin to test-driven development. This approach focuses on designing software with guardrails, real-time monitoring, and human oversight, accommodating systems that are only x% correct. The Promise of PydanticAI PydanticAI stands out as an agent framework that supports dependency injection, model-agnostic workflows, and evaluation-driven development. Its design is Pythonic and simplifies testing by allowing the injection of mock dependencies. For instance, in contrast to frameworks like Langchain, where dependency injection is cumbersome, PydanticAI streamlines this process, making the workflows more readable and efficient. Building an Evaluation-Driven Application with PydanticAI Example Use Case: Evaluating Mountain Data By employing tools like Wikipedia as a data source, the agent can fetch accurate mountain heights during production. For testing, developers can inject mocked responses, ensuring predictable outputs and faster development cycles. Advancing Agentic Applications with PydanticAI PydanticAI provides the building blocks for creating scalable, evaluation-driven GenAI applications. Its support for dependency injection, structured outputs, and model-agnostic workflows addresses core challenges, empowering developers to create robust and adaptive LLM-powered systems. This paradigm shift ensures that evaluation is seamlessly embedded into the development lifecycle, paving the way for more reliable and efficient agentic applications. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

February 3, 2025in Salesforce

14Jan

On Premise Gen AI

In 2025, enterprises transitioning generative AI (GenAI) into production after years of experimentation are increasingly considering on-premises deployment as a cost-effective alternative to the cloud. Since OpenAI ignited the AI revolution in late 2022, organizations have tested large language models powering GenAI services on platforms like AWS, Microsoft Azure, and Google Cloud. These experiments demonstrated GenAI’s potential to enhance business operations while exposing the substantial costs of cloud usage. To avoid difficult conversations with CFOs about escalating cloud expenses, CIOs are exploring on-premises AI as a financially viable solution. Advances in software from startups and packaged infrastructure from vendors such as HPE and Dell are making private data centers an attractive option for managing costs. A survey conducted by Menlo Ventures in late 2024 found that 47% of U.S. enterprises with at least 50 employees were developing GenAI solutions in-house. Similarly, Informa TechTarget’s Enterprise Strategy Group reported a rise in enterprises considering on-premises and public cloud equally for new applications—from 37% in 2024 to 45% in 2025. This shift is reflected in hardware sales. HPE reported a 16% revenue increase in AI systems, reaching $1.5 billion in Q4 2024. During the same period, Dell recorded a record .6 billion in AI server orders, with its sales pipeline expanding by over 50% across various customer segments. “Customers are seeking diverse AI-capable server solutions,” noted David Schmidt, senior director of Dell’s PowerEdge server line. While heavily regulated industries have traditionally relied on on-premises systems to ensure data privacy and security, broader adoption is now driven by the need for cost control. Fortune 2000 companies are leading this trend, opting for private infrastructure over the cloud due to more predictable expenses. “It’s not unusual to see cloud bills exceeding 0,000 or even million per month,” said John Annand, an analyst at Info-Tech Research Group. Global manufacturing giant Jabil primarily uses AWS for GenAI development but emphasizes ongoing cost management. “Does moving to the cloud provide a cost advantage? Sometimes it doesn’t,” said CIO May Yap. Jabil employs a continuous cloud financial optimization process to maximize efficiency. On-Premises AI: Technology and Trends Enterprises now have alternatives to cloud infrastructure, including as-a-service solutions like Dell APEX and HPE GreenLake, which offer flexible pay-per-use pricing for AI servers, storage, and networking tailored for private data centers or colocation facilities. “The high cost of cloud drives organizations to seek more predictable expenses,” said Tiffany Osias, vice president of global colocation services at Equinix. Walmart exemplifies in-house AI development, creating tools like a document summarization app for its benefits help desk and an AI assistant for corporate employees. Startups are also enabling enterprises to build AI applications with turnkey solutions. “About 80% of GenAI requirements can now be addressed with push-button solutions from startups,” said Tim Tully, partner at Menlo Ventures. Companies like Ragie (RAG-as-a-service) and Lamatic.ai (GenAI platform-as-a-service) are driving this innovation. Others, like Squid AI, integrate custom AI agents with existing enterprise infrastructure. Open-source frameworks like LangChain further empower on-premises development, offering tools for creating chatbots, virtual assistants, and intelligent search systems. Its extension, LangGraph, adds functionality for building multi-agent workflows. As enterprises develop AI applications internally, consulting services will play a pivotal role. “Companies offering guidance on effective AI tool usage and aligning them with business outcomes will thrive,” Annand said. This evolution in AI deployment highlights the growing importance of balancing technological innovation with financial sustainability. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

January 14, 2025in Data, Enterprise, Generative AI, Google, Technology

Autonomy, Architecture, and Action

Redefining AI Agents: Autonomy, Architecture, and Action AI agents are reshaping how technology interacts with us and executes tasks. Their mission? To reason, plan, and act independently—following instructions, making autonomous decisions, and completing actions, often without user involvement. These agents adapt to new information, adjust in real time, and pursue their objectives autonomously. This evolution in agentic AI is revolutionizing how goals are accomplished, ushering in a future of semi-autonomous technology. At their foundation, AI agents rely on one or more large language models (LLMs). However, designing agents is far more intricate than building chatbots or generative assistants. While traditional AI applications often depend on user-driven inputs—such as prompt engineering or active supervision—agents operate autonomously. Core Principles of Agentic AI Architectures To enable autonomous functionality, agentic AI systems must incorporate: Essential Infrastructure for AI Agents Building and deploying agentic AI systems requires robust software infrastructure that supports: Agent Development Made Easier with Langflow and Astra DB Langflow simplifies the development of agentic applications with its visual IDE. It integrates with Astra DB, which combines vector and graph capabilities for ultra-low latency data access. This synergy accelerates development by enabling: Transforming Autonomy into Action Agentic AI is fundamentally changing how tasks are executed by empowering systems to act autonomously. By leveraging platforms like Astra DB and Langflow, organizations can simplify agent design and deploy scalable, effective AI applications. Start building the next generation of AI-powered autonomy today. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

January 11, 2025in Agentic AI, Data, Enterprise, Technology

09Jan

Generative AI Tools

Generative AI Tools: A Comprehensive Overview of Emerging Capabilities The widespread adoption of generative AI services like ChatGPT has sparked immense interest in leveraging these tools for practical enterprise applications. Today, nearly every enterprise app integrates generative AI capabilities to enhance functionality and efficiency. A broad range of AI, data science, and machine learning tools now support generative AI use cases. These tools assist in managing the AI lifecycle, governing data, and addressing security and privacy concerns. While such capabilities also aid in traditional AI development, this discussion focuses on tools specifically designed for generative AI. Not all generative AI relies on large language models (LLMs). Emerging techniques generate images, videos, audio, synthetic data, and translations using methods such as generative adversarial networks (GANs), diffusion models, variational autoencoders, and multimodal approaches. Here is an in-depth look at the top categories of generative AI tools, their capabilities, and notable implementations. It’s worth noting that many leading vendors are expanding their offerings to support multiple categories through acquisitions or integrated platforms. Enterprises may want to explore comprehensive platforms when planning their generative AI strategies. 1. Foundation Models and Services Generative AI tools increasingly simplify the development and responsible use of LLMs, initially pioneered through transformer-based approaches by Google researchers in 2017. 2. Cloud Generative AI Platforms Major cloud providers offer generative AI platforms to streamline development and deployment. These include: 3. Use Case Optimization Tools Foundation models often require optimization for specific tasks. Enterprises use tools such as: 4. Quality Assurance and Hallucination Mitigation Hallucination detection tools address the tendency of generative models to produce inaccurate or misleading information. Leading tools include: 5. Prompt Engineering Tools Prompt engineering tools optimize interactions with LLMs and streamline testing for bias, toxicity, and accuracy. Examples include: 6. Data Aggregation Tools Generative AI tools have evolved to handle larger data contexts efficiently: 7. Agentic and Autonomous AI Tools Developers are creating tools to automate interactions across foundation models and services, paving the way for autonomous AI. Notable examples include: 8. Generative AI Cost Optimization Tools These tools aim to balance performance, accuracy, and cost effectively. Martian’s Model Router is an early example, while traditional cloud cost optimization platforms are expected to expand into this area. Generative AI tools are rapidly transforming enterprise applications, with foundational, cloud-based, and domain-specific solutions leading the way. By addressing challenges like accuracy, hallucination, and cost, these tools unlock new potential across industries and use cases, enabling enterprises to stay ahead in the AI-driven landscape. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

January 9, 2025in AI Tools, Data, Enterprise, Generative AI, Google, Snowflake

Evaluating RAG With Needle in Haystack Test

31Oct

Agentic RAG

Agentic RAG: The Next Evolution of AI-Powered Knowledge Retrieval From RAG to Agentic RAG: A Paradigm Shift in AI Applications While Retrieval-Augmented Generation (RAG) dominated AI advancements in 2023, agentic workflows are now driving the next wave of innovation in 2024. By integrating AI agents into RAG pipelines, developers can build more powerful, adaptive, and intelligent LLM-powered applications. This article explores:✔ What is Agentic RAG?✔ How it works (single-agent vs. multi-agent architectures)✔ Implementation methods (function calling vs. agent frameworks)✔ Enterprise adoption & real-world use cases✔ Benefits & limitations Understanding the Foundations: RAG & AI Agents What is Retrieval-Augmented Generation (RAG)? RAG enhances LLMs by retrieving external knowledge before generating responses, reducing hallucinations and improving accuracy. Traditional (Vanilla) RAG Pipeline: Limitations of Vanilla RAG: ❌ Single knowledge source (no dynamic tool integration).❌ One-shot retrieval (no iterative refinement).❌ No reasoning over retrieved data quality. What Are AI Agents? AI agents are autonomous LLM-driven systems with: The ReAct Framework (Reason + Act) What is Agentic RAG? Agentic RAG embeds AI agents into RAG pipelines, enabling:✅ Multi-source retrieval (databases, APIs, web search).✅ Dynamic query refinement (self-correcting searches).✅ Validation of results (quality checks before generation). How Agentic RAG Works Instead of a static retrieval step, an AI agent orchestrates: Agentic RAG Architectures 1. Single-Agent RAG (Router) 2. Multi-Agent RAG (Orchestrated Workflow) Implementing Agentic RAG Option 1: LLMs with Function Calling Example: Function Calling with Ollama python Copy def ollama_generation_with_tools(query, tools_schema): # LLM decides tool use → executes → refines response … Option 2: Agent Frameworks Why Enterprises Are Adopting Agentic RAG Real-World Use Cases 🔹 Replit’s AI Dev Agent – Helps debug & write code.🔹 Microsoft Copilots – Assist users in real-time tasks.🔹 Customer Support Bots – Multi-step query resolution. Benefits ✔ Higher accuracy (validated retrievals).✔ Dynamic tool integration (APIs, web, databases).✔ Autonomous task handling (reducing manual work). Limitations ⚠ Added latency (LLM reasoning steps).⚠ Unpredictability (agents may fail without safeguards).⚠ Complex debugging (multi-agent coordination). Conclusion: The Future of Agentic RAG Agentic RAG represents a leap beyond traditional RAG, enabling:🚀 Smarter, self-correcting retrieval.🤖 Seamless multi-tool workflows.🔍 Enterprise-grade reliability. As frameworks mature, expect AI agents to become the backbone of next-gen LLM applications—transforming industries from customer service to software development. Ready to build your own Agentic RAG system? Explore frameworks like LangChain, CrewAI, or OpenAI’s function calling to get started. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

October 31, 2024in Data, Enterprise

28Oct

AI Agents and Tabular Data

Have you ever wondered how AI agents understand tabular data, such as that in CSV or Excel files? Or how a file loaded into a platform like ChatGPT can be instantly understood and processed? This insight explores the creation of a custom AI agent capable of achieving these tasks from scratch. AI Agents and Tabular Data. Context Jen, an AI Engineer at AI United, leads a team developing an AI agent within a 30-day timeline. This agent will generate tailored interactive charts based on uploaded data files, enabling users to better visualize and interpret the data. To achieve this, the team needed to ensure the AI agent could analyze the file’s data context and autonomously recommend the most appropriate chart types. The agent development was broken down into four main steps: Here’s a look at how the team developed the AI system to ingest CSV data and aggregate it into an actionable format. Setup The development began by configuring package installations and defining the language model to be used: pythonCopy code%pip install langchain_openai %pip install langchain_core %pip install langchain_community %pip install langchain_experimental from langchain_openai.chat_models import ChatOpenAI openai_key = os.environ.get(“OPENAI_API”) gpt4o = ChatOpenAI(temperature=0.0, model=”gpt-4o”, openai_api_key=openai_key) Step 1: Context Creation Before generating the code to process raw data, the team created context around the dataset to enhance the AI’s response accuracy. Metadata extraction included: For demonstration, a wine reviews dataset was used, and metadata was extracted as follows: pythonCopy codeimport pandas as pd def extract_metadata(df): metadata = { ‘Number of Columns’: df.shape[1], ‘Schema’: df.columns.tolist(), ‘Data Types’: str(df.dtypes), ‘Sample’: df.head(1).to_dict(orient=”records”) } return metadata df = pd.read_csv(“wine_reviews.csv”) metadata = extract_metadata(df) Step 2: Prompt Augmentation To help the AI model interpret the dataset, prompts were augmented with extracted metadata using a template: pythonCopy codeprompt_template = ”’ Assistant is an AI model that suggests charts to visualize data based on the following metadata. SCHEMA: ——– {schema} DATA TYPES: ——– {data_types} SAMPLE: ——– {sample} ”’.format(schema=metadata[“Schema”], data_types=metadata[“Data Types”], sample=metadata[“Sample”]) gpt40.invoke(prompt_template) Step 3: Simple Agent Code Generation & Execution With the prompt augmented, the model was able to suggest suitable charts. For data transformation, an agentic workflow with a Python REPL tool was used, where the AI generated code for aggregating data and then executed it to provide the necessary structure for plotting. A REPL instance was created to pass data into Python functions, enabling the AI to perform aggregation. pythonCopy codefrom langchain_experimental.utilities import PythonREPL repl = PythonREPL() repl.globals[‘df’] = df from langchain_core.tools import tool @tool def python_repl(code: str): try: result = repl.run(code) except BaseException as e: return f”Failed to execute. Error: {repr(e)}” result_str = f”Successfully executed:n“`pythonn{code}n“`nStdout: {result}” return result_str tools = [python_repl] Step 4: Final Data Aggregation and Charting Finally, the AI suggested the Bar Chart type for plotting the top 10 wineries by average points, and the REPL instance executed the code to transform the data for the chart: pythonCopy code# Code to aggregate and convert data into dictionary format top_wineries = df.groupby(‘winery’)[‘points’].mean().sort_values(ascending=False).head(10) top_wineries_dict = top_wineries.to_dict() print(top_wineries_dict) The aggregated data was output as: jsonCopy code{‘Macauley’: 96.0, ‘Heitz’: 95.5, ‘Bodega Carmen Rodríguez’: 95.5, ‘Maurodos’: 95.0, ‘Blue Farm’: 95.0, ‘Numanthia’: 95.0, ‘Château Lagrézette’: 95.0, ‘Patricia Green Cellars’: 95.0, ‘Ponzi’: 95.0, ‘Muga’: 95.0} With this approach, the AI agent was capable of not only understanding the data in the uploaded file but also generating interactive visualizations, making complex datasets more accessible and insightful. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

October 28, 2024in Agentforce Platform, Artificial Intelligence, Data, Generative AI, Technology

27Oct

AI Assistants Using LangGraph

In the evolving world of AI, retrieval-augmented generation (RAG) systems have become standard for handling straightforward queries and generating contextually relevant responses. However, as demand grows for more sophisticated AI applications, there is a need for systems that move beyond simple retrieval tasks. Enter AI agents—autonomous entities capable of executing complex, multi-step processes, maintaining state across interactions, and dynamically adapting to new information. LangGraph, a powerful extension of the LangChain library, is designed to help developers build these advanced AI agents, enabling stateful, multi-actor applications with cyclic computation capabilities. AI Assistants Using LangGraph. In this insight, we’ll explore how LangGraph revolutionizes AI development and provide a step-by-step guide to building your own AI agent using an example that computes energy savings for solar panels. This example will demonstrate how LangGraph’s unique features enable the creation of intelligent, adaptable, and practical AI systems. What is LangGraph? LangGraph is an advanced library built on top of LangChain, designed to extend Large Language Model (LLM) applications by introducing cyclic computational capabilities. While LangChain allows for the creation of Directed Acyclic Graphs (DAGs) for linear workflows, LangGraph enhances this by enabling the addition of cycles—essential for developing agent-like behaviors. These cycles allow LLMs to continuously loop through processes, making decisions dynamically based on evolving inputs. LangGraph: Nodes, States, and Edges The core of LangGraph lies in its stateful graph structure: LangGraph redefines AI development by managing the graph structure, state, and coordination, allowing for the creation of sophisticated, multi-actor applications. With automatic state management and precise agent coordination, LangGraph facilitates innovative workflows while minimizing technical complexity. Its flexibility enables the development of high-performance applications, and its scalability ensures robust and reliable systems, even at the enterprise level. Step-by-step Guide Now that we understand LangGraph’s capabilities, let’s dive into a practical example. We’ll build an AI agent that calculates potential energy savings for solar panels based on user input. This agent can function as a lead generation tool on a solar panel seller’s website, providing personalized savings estimates based on key data like monthly electricity costs. This example highlights how LangGraph can automate complex tasks and deliver business value. Step 1: Import Necessary Libraries We start by importing the essential Python libraries and modules for the project. pythonCopy codefrom langchain_core.tools import tool from langchain_community.tools.tavily_search import TavilySearchResults from langchain_core.prompts import ChatPromptTemplate from langchain_core.runnables import Runnable from langchain_aws import ChatBedrock import boto3 from typing import Annotated from typing_extensions import TypedDict from langgraph.graph.message import AnyMessage, add_messages from langchain_core.messages import ToolMessage from langchain_core.runnables import RunnableLambda from langgraph.prebuilt import ToolNode Step 2: Define the Tool for Calculating Solar Savings Next, we define a tool to calculate potential energy savings based on the user’s monthly electricity cost. pythonCopy code@tool def compute_savings(monthly_cost: float) -> float: “”” Tool to compute the potential savings when switching to solar energy based on the user’s monthly electricity cost. Args: monthly_cost (float): The user’s current monthly electricity cost. Returns: dict: A dictionary containing: – ‘number_of_panels’: The estimated number of solar panels required. – ‘installation_cost’: The estimated installation cost. – ‘net_savings_10_years’: The net savings over 10 years after installation costs. “”” def calculate_solar_savings(monthly_cost): cost_per_kWh = 0.28 cost_per_watt = 1.50 sunlight_hours_per_day = 3.5 panel_wattage = 350 system_lifetime_years = 10 monthly_consumption_kWh = monthly_cost / cost_per_kWh daily_energy_production = monthly_consumption_kWh / 30 system_size_kW = daily_energy_production / sunlight_hours_per_day number_of_panels = system_size_kW * 1000 / panel_wattage installation_cost = system_size_kW * 1000 * cost_per_watt annual_savings = monthly_cost * 12 total_savings_10_years = annual_savings * system_lifetime_years net_savings = total_savings_10_years – installation_cost return { “number_of_panels”: round(number_of_panels), “installation_cost”: round(installation_cost, 2), “net_savings_10_years”: round(net_savings, 2) } return calculate_solar_savings(monthly_cost) Step 3: Set Up State Management and Error Handling We define utilities to manage state and handle errors during tool execution. pythonCopy codedef handle_tool_error(state) -> dict: error = state.get(“error”) tool_calls = state[“messages”][-1].tool_calls return { “messages”: [ ToolMessage( content=f”Error: {repr(error)}n please fix your mistakes.”, tool_call_id=tc[“id”], ) for tc in tool_calls ] } def create_tool_node_with_fallback(tools: list) -> dict: return ToolNode(tools).with_fallbacks( [RunnableLambda(handle_tool_error)], exception_key=”error” ) Step 4: Define the State and Assistant Class We create the state management class and the assistant responsible for interacting with users. pythonCopy codeclass State(TypedDict): messages: Annotated[list[AnyMessage], add_messages] class Assistant: def __init__(self, runnable: Runnable): self.runnable = runnable def __call__(self, state: State): while True: result = self.runnable.invoke(state) if not result.tool_calls and ( not result.content or isinstance(result.content, list) and not result.content[0].get(“text”) ): messages = state[“messages”] + [(“user”, “Respond with a real output.”)] state = {**state, “messages”: messages} else: break return {“messages”: result} Step 5: Set Up the LLM with AWS Bedrock We configure AWS Bedrock to enable advanced LLM capabilities. pythonCopy codedef get_bedrock_client(region): return boto3.client(“bedrock-runtime”, region_name=region) def create_bedrock_llm(client): return ChatBedrock(model_id=’anthropic.claude-3-sonnet-20240229-v1:0′, client=client, model_kwargs={‘temperature’: 0}, region_name=’us-east-1′) llm = create_bedrock_llm(get_bedrock_client(region=’us-east-1′)) Step 6: Define the Assistant’s Workflow We create a template and bind the tools to the assistant’s workflow. pythonCopy codeprimary_assistant_prompt = ChatPromptTemplate.from_messages( [ ( “system”, ”’You are a helpful customer support assistant for Solar Panels Belgium. Get the following information from the user: – monthly electricity cost Ask for clarification if necessary. ”’, ), (“placeholder”, “{messages}”), ] ) part_1_tools = [compute_savings] part_1_assistant_runnable = primary_assistant_prompt | llm.bind_tools(part_1_tools) Step 7: Build the Graph Structure We define nodes and edges for managing the AI assistant’s conversation flow. pythonCopy codebuilder = StateGraph(State) builder.add_node(“assistant”, Assistant(part_1_assistant_runnable)) builder.add_node(“tools”, create_tool_node_with_fallback(part_1_tools)) builder.add_edge(START, “assistant”) builder.add_conditional_edges(“assistant”, tools_condition) builder.add_edge(“tools”, “assistant”) memory = MemorySaver() graph = builder.compile(checkpointer=memory) Step 8: Running the Assistant The assistant can now be run through its graph structure to interact with users. python import uuidtutorial_questions = [ ‘hey’, ‘can you calculate my energy saving’, “my montly cost is $100, what will I save”]thread_id = str(uuid.uuid4())config = {“configurable”: {“thread_id”: thread_id}}_printed = set()for question in tutorial_questions: events = graph.stream({“messages”: (“user”, question)}, config, stream_mode=”values”) for event in events: _print_event(event, _printed) Conclusion By following these steps, you can create AI Assistants Using LangGraph to calculate solar panel savings based on user input. This tutorial demonstrates how LangGraph empowers developers to create intelligent, adaptable systems capable of handling complex tasks efficiently. Whether your application is in customer support, energy management, or other domains, LangGraph provides the Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched

October 27, 2024in Salesforce

24Oct

AI Agent Workflows

AI Agent Workflows: The Ultimate Guide to Choosing Between LangChain and LangGraph Explore two transformative libraries—LangChain and LangGraph—both created by the same developer, designed to build Agentic AI applications. This guide dives into their foundational components, differences in handling functionality, and how to choose the right tool for your use case. Language Models as the Bridge Modern language models have unlocked revolutionary ways to connect users with AI systems and enable AI-to-AI communication via natural language. Enterprises aiming to harness Agentic AI capabilities often face the pivotal question: “Which tools should we use?” For those eager to begin, this question can become a roadblock. Why LangChain and LangGraph? LangChain and LangGraph are among the leading frameworks for crafting Agentic AI applications. By understanding their core building blocks and approaches to functionality, you’ll gain clarity on how each aligns with your needs. Keep in mind that the rapid evolution of generative AI tools means today’s truths might shift tomorrow. Note: Initially, this guide intended to compare AutoGen, LangChain, and LangGraph. However, AutoGen’s upcoming 0.4 release introduces a foundational redesign. Stay tuned for insights post-launch! Understanding the Basics LangChain LangChain offers two primary methods: Key components include: LangGraph LangGraph is tailored for graph-based workflows, enabling flexibility in non-linear, conditional, or feedback-loop processes. It’s ideal for cases where LangChain’s predefined structure might not suffice. Key components include: Comparing Functionality Tool Calling Conversation History and Memory Retrieval-Augmented Generation (RAG) Parallelism and Error Handling When to Choose LangChain, LangGraph, or Both LangChain Only LangGraph Only Using LangChain + LangGraph Together Final Thoughts Whether you choose LangChain, LangGraph, or a combination, the decision depends on your project’s complexity and specific needs. By understanding their unique capabilities, you can confidently design robust Agentic AI workflows. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

October 24, 2024in Agentforce Platform, AI Tools, Artificial Intelligence, Data, Enterprise, Generative AI, Salesforce

24Oct

Rise of Agentforce

The Rise of Agentforce: How AI Agents Are Shaping the Future of Work Salesforce wrapped up its annual Dreamforce conference this September, leaving attendees with more than just memories of John Mulaney’s quips. As the swarms of Waymos ferried participants across a cleaner-than-usual San Francisco, it became clear that AI-powered agents—dubbed Agentforce—are poised to transform the workplace. These agents, controlled within Salesforce’s ecosystem, could significantly change how work is done and how customer experiences are delivered. Dreamforce has always been known for its bold predictions about the future, but this year’s vision of AI-based agents felt particularly compelling. These agents represent the next frontier in workplace automation, but as exciting as this future is, some important questions remain. Reality Check on the Agentforce Vision During his keynote, Salesforce CEO Marc Benioff raised an interesting point: “Why would our agents be so low-hallucinogenic?” While the agents have access to vast amounts of data, workflows, and services, they currently function best within Salesforce’s own environment. Benioff even made the claim that Salesforce pioneered prompt engineering—a statement that, for some, might have evoked a scene from Austin Powers, with Dr. Evil humorously taking credit for inventing the question mark. But can Salesforce fully realize its vision for Agentforce? If they succeed, it could be transformative for how work gets done. However, as with many AI-driven innovations, the real question lies in interoperability. The Open vs. Closed Debate As powerful as Salesforce’s ecosystem is, not all business data and workflows live within it. If the future of work involves a network of AI agents working together, how far can a closed ecosystem like Salesforce’s really go? Apple, Microsoft, Amazon, and other tech giants also have their sights set on AI-driven agents, and the race is on to own this massive opportunity. As we’ve seen in previous waves of technology, this raises familiar debates about open versus closed systems. Without a standard for agents to work together across platforms, businesses could find themselves limited. Closed ecosystems may help solve some problems, but to unlock the full potential of AI agents, they must be able to operate seamlessly across different platforms and boundaries. Looking to the Open Web for Inspiration The solution may lie in the same principles that guide the open web. Just as mobile apps often require a web view to enable an array of outcomes, the same might be necessary in the multi-agent landscape. Tools like Slack’s Block Kit framework allow for simple agent interactions, but they aren’t enough for more complex use cases. Take Clockwise Prism, for example—a sophisticated scheduling agent designed to find meeting times when there’s no obvious availability. When integrated with other agents to secure that critical meeting, businesses will need a flexible interface to explore multiple scheduling options. A web view for agents could be the key. The Need for an Open Multi-Agent Standard Benioff repeatedly stressed that businesses don’t want “DIY agents.” Enterprises seek controlled, repeatable workflows that deliver consistent value—but they also don’t want to be siloed. This is why the future requires an open standard for agents to collaborate across ecosystems and platforms. Imagine initiating a set of work agents from within an Atlassian Jira ticket that’s connected to a Salesforce customer case—or vice versa. For agents to seamlessly interact regardless of the system they originate from, a standard is needed. This would allow businesses to deploy agents in a way that’s consistent, integrated, and scalable. User Experience and Human-in-the-Loop: Crucial Elements for AI Agents A significant insight from the integration of LangChain with Assistant-UI highlighted a crucial factor: user experience (UX). Whether it’s streaming, generative interfaces, or human-in-the-loop functionality, the UX of AI agents is critical. While agents need to respond quickly and efficiently, businesses must have the ability to involve humans in decision-making when necessary. This principle of human-in-the-loop is key to the agent’s scheduling process. While automation is the goal, involving the user at crucial points—such as confirming scheduling options—ensures that the agent remains reliable and adaptable. Any future standard must prioritize this capability, allowing for user involvement where necessary, while also enabling full automation when confidence levels are high. Generative or Native UI? The discussion about user interfaces for agents often leads to a debate between generative UI and native UI. The latter may be the better approach. A native UI, controlled by the responding service or agent, ensures the interface is tailored to the context and specifics of the agent’s task. Whether this UI is rendered using AI or not is an implementation detail that can vary depending on the service. What matters is that the UI feels native to the agent’s task, making the user experience seamless and intuitive. What’s Next? The Push for an Open Multi-Agent Future As we look ahead to the multi-agent future, the need for an open standard is more pressing than ever. At Clockwise, we’ve drafted something we’re calling the Open Multi-Agent Protocol (OMAP), which we hope will foster collaboration and innovation in this space. The future of work is rapidly approaching, where new roles—like Agent Orchestrators—will emerge, enabling people to leverage AI agents in unprecedented ways. While Salesforce’s vision for Agentforce is ambitious, the key to unlocking its full potential lies in creating a standard that allows agents to work together, across platforms, and beyond the boundaries of closed ecosystems. With the right approach, we can create a future where AI agents transform work in ways we’re only beginning to imagine. Like1 Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data

October 24, 2024in Data, Enterprise, Salesforce, Technology

30Aug

Tectonic Guide to Rag Part 2

The first notable change in the field of language models is the significant expansion of context window sizes and a reduction in token costs. For instance, Anthropic’s largest model, Claude, has a context window exceeding 200,000 tokens, while recent reports indicate that Gemini’s context window can reach up to 10 million tokens. Under such circumstances, Retrieval-Augmented Generation (RAG) may no longer be necessary for many tasks, as all required data can be accommodated within the expanded context window. Several financial and analytical projects have already demonstrated that tasks can be solved without needing a vector database as intermediate storage. This trend of reducing token costs and increasing context window sizes is likely to continue, potentially decreasing the need for external mechanisms in LLMs, although they are still relevant for the time being. If the context window remains insufficient, methods for summarization and context compression have been introduced. LangChain, for example, offers a class called ConversationSummaryMemory to address this challenge. pythonCopy codellm = OpenAI(temperature=0) conversation_with_summary = ConversationChain( llm=llm, memory=ConversationSummaryMemory(llm=OpenAI()), verbose=True ) conversation_with_summary.predict(input=”Hi, what’s up?”) Knowledge Graphs As the volume of data continues to grow, navigating through it efficiently becomes increasingly critical. In certain cases, understanding the structure and attributes of data is essential for effective use. For example, if the data source is a company’s wiki, an LLM might not recognize a phone number unless the structure or metadata indicates that it’s the company’s contact information. Humans can infer meaning from conventions, such as the subdirectory “Company Information,” but standard RAG may miss such connections. This challenge can be addressed by Knowledge Graphs, also known as Knowledge Maps, which provide both raw data and metadata that illustrates how different entities are interconnected. This method is referred to as Graph Retrieval-Augmented Generation (GraphRAG). Graphs are excellent for representing and managing structured, interconnected information. Unlike vector databases, they excel at capturing complex relationships and attributes among diverse data types. Creating a Knowledge Graph The process of creating a knowledge graph typically involves collecting and structuring data, which requires expertise in both the subject matter and graph modeling. However, LLMs can automate a significant portion of this process by analyzing textual data, identifying entities, and recognizing their relationships, which can then be represented in a graph structure. In many cases, an ensemble of vector databases and knowledge graphs can improve accuracy, as discussed previously. For example, search functionality might combine keyword search through a regular database (e.g., Elasticsearch) and graph-based queries. LangChain can also assist in extracting structured data from entities, as demonstrated in this code example: pythonCopy codedocuments = parse_and_load_data_from_wiki_including_metadata() graph_store = NebulaGraphStore( space_name=”Company Wiki”, tags=[“entity”] ) storage_context = StorageContext.from_defaults(graph_store=graph_store) index = KnowledgeGraphIndex.from_documents( documents, max_triplets_per_chunk=2, space_name=space_name, tags=[“entity”] ) query_engine = index.as_query_engine() response = query_engine.query(“Tell me more about our Company”) Here, searching is conducted based on attributes and related entities, instead of similar vectors. If set up correctly, metadata from the company’s wiki, such as its phone number, would be accessible through the graph. Access Control One challenge with this system is that data access may not be uniform. For instance, in a wiki, access could depend on roles and permissions. Similar issues exist in vector databases, leading to the need for access management mechanisms such as Role-Based Access Control (RBAC), Attribute-Based Access Control (ABAC), and Relationship-Based Access Control (ReBAC). These access control methods function by evaluating paths between users and resources within graphs, such as in systems like Active Directory. To ensure the integrity of data during the ingestion phase, metadata related to permissions must be preserved in both the knowledge graph and vector database. Some commercial vector databases already have this functionality built in. Ingestion and Parsing Data needs to be ingested into both graphs and vector databases, but for graphs, formatting is especially critical since it reflects the data’s structure and serves as metadata. One particular challenge is handling complex formats like PDFs, which can contain diverse elements like tables, images, and text. Extracting structured data from such formats can be difficult, and while frameworks like LLama Parse exist, they are not always foolproof. In some cases, Optical Character Recognition (OCR) may be more effective than parsing. Enhancing Answer Quality Several new approaches are emerging to improve the quality of LLM-generated answers: While these advancements in knowledge graphs, access control, and retrieval mechanisms are promising, challenges remain, particularly around data formatting and parsing. However, these methods continue to evolve, enhancing LLM capabilities and efficiency. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

August 30, 2024in Data

02Aug

Tectonic Guide to RAG

Guide to RAG (Retrieval-Augmented Generation) Retrieval-Augmented Generation (RAG) has become increasingly popular, and while it’s not yet as common as seeing it on a toaster oven manual, it is expected to grow in use. Despite its rising popularity, comprehensive guides that address all its nuances—such as relevance assessment and hallucination prevention—are still scarce. Drawing from practical experience, this insight offers an in-depth overview of RAG. Why is RAG Important? Large Language Models (LLMs) like ChatGPT can be employed for a wide range of tasks, from crafting horoscopes to more business-centric applications. However, there’s a notable challenge: most LLMs, including ChatGPT, do not inherently understand the specific rules, documents, or processes that companies rely on. There are two ways to address this gap: How RAG Works RAG consists of two primary components: While the system is straightforward, the effectiveness of the output heavily depends on the quality of the documents retrieved and how well the Retriever performs. Corporate documents are often unstructured, conflicting, or context-dependent, making the process challenging. Search Optimization in RAG To enhance RAG’s performance, optimization techniques are used across various stages of information retrieval and processing: Python and LangChain Implementation Example Below is a simple implementation of RAG using Python and LangChain: pythonCopy codeimport os import wget from langchain.vectorstores import Qdrant from langchain.embeddings import OpenAIEmbeddings from langchain import OpenAI from langchain_community.document_loaders import BSHTMLLoader from langchain.chains import RetrievalQA # Download ‘War and Peace’ by Tolstoy wget.download(“http://az.lib.ru/t/tolstoj_lew_nikolaewich/text_0073.shtml”) # Load text from html loader = BSHTMLLoader(“text_0073.shtml”, open_encoding=’ISO-8859-1′) war_and_peace = loader.load() # Initialize Vector Database embeddings = OpenAIEmbeddings() doc_store = Qdrant.from_documents( war_and_peace, embeddings, location=”:memory:”, collection_name=”docs”, ) llm = OpenAI() # Ask questions while True: question = input(‘Your question: ‘) qa = RetrievalQA.from_chain_type( llm=llm, chain_type=”stuff”, retriever=doc_store.as_retriever(), return_source_documents=False, ) result = qa(question) print(f”Answer: {result}”) Considerations for Effective RAG Ranking Techniques in RAG Dynamic Learning with RELP An advanced technique within RAG is Retrieval-Augmented Language Model-based Prediction (RELP). In this method, information retrieved from vector storage is used to generate example answers, which the LLM can then use to dynamically learn and respond. This allows for adaptive learning without the need for expensive retraining. Guide to RAG RAG offers a powerful alternative to retraining large language models, allowing businesses to leverage their proprietary knowledge for practical applications. While setting up and optimizing RAG systems involves navigating various complexities, including document structure, query processing, and ranking, the results are highly effective for most business use cases. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

August 2, 2024in Data

02Jul

Best Practices for RAG LLM

Best Practices for Building a RAG LLM Chatbot In today’s competitive landscape, a robust Retrieval-Augmented Generation (RAG) Large Language Model (LLM) chatbot can provide a significant edge. However, implementing such a system involves multiple steps and challenges. Below is a guide to help you navigate this journey effectively. 1. Define Clear Objectives Start by identifying your goals: A clear purpose will drive your design and implementation decisions. 2. Prepare Your Data Your chatbot’s success relies heavily on the quality and structure of your data. Assess Your Data Format: Enrich Your Data: 3. Select the Right Platform Choose a platform based on your goals and data type. Common architectures include: Standard RAG with Vector Databases: Relational Databases: Hybrid Text Search: Graph Databases: 4. Fine-Tune the Model While RAG often enriches data without altering the base model, fine-tuning may be essential in specific scenarios, such as: You can fine-tune open-source models directly or use OpenAI’s APIs for custom training. OpenAI also offers domain-specific GPTs, which can save development time. 5. Master Prompt Engineering Crafting precise prompts is critical to guide LLM responses. Prompts generally include: Example for Elasticsearch: javascriptCopy codeYour task is to create a valid Elasticsearch DSL query. Given the mapping: “`{mapping}“`, translate the query: “`{query}“` into JSON. – Use only fields from the mapping. – Ensure case-insensitivity and support fuzzy matches. – Compress the JSON output, removing spaces. For advanced prompt strategies, see Modern Advances in Prompt Engineering. 6. Iterative Testing Testing machine learning models is inherently complex. Key strategies: 7. Optimize the Front-End Collaborate with UI/UX teams to integrate LLM capabilities seamlessly. Consider: 8. Avoid Common Pitfalls Looking Ahead The rise of LLMs has transformed how we think about search and chatbot applications. While we’re in the early stages, the possibilities are immense. Businesses that adapt and strategically integrate these technologies will unlock innovative opportunities and thrive in the evolving landscape. Like Related Posts Salesforce OEM AppExchange Expanding its reach beyond CRM, Salesforce.com has launched a new service called AppExchange OEM Edition, aimed at non-CRM service providers. Read more The Salesforce Story In Marc Benioff’s own words How did salesforce.com grow from a start up in a rented apartment into the world’s Read more Salesforce Jigsaw Salesforce.com, a prominent figure in cloud computing, has finalized a deal to acquire Jigsaw, a wiki-style business contact database, for Read more Service Cloud with AI-Driven Intelligence Salesforce Enhances Service Cloud with AI-Driven Intelligence Engine Data science and analytics are rapidly becoming standard features in enterprise applications, Read more

July 2, 2024in Data

LangChain

Designing AI Agents the Right Way

Building Intelligent Order Management Workflows

Pydantic AI

Autonomy, Architecture, and Action

Agentic RAG

AI Agents and Tabular Data

AI Assistants Using LangGraph

AI Agent Workflows

Rise of Agentforce

Tectonic Guide to Rag Part 2

Tectonic Guide to RAG

Best Practices for RAG LLM

Recent Posts

Salesforce’s Enterprise General Intelligence

How Agentic AI is Redefining Customer Service

Data-Driven Decision-Making in the Age of AI

Salesforce Achieves FedRAMP High Authorization for Agentforce

A Strategic Approach to Governing Enterprise AI Systems

Contact Us

Be in touch today — and start your business on a path to success.

Category

Archives