My Agent Development: Why I Focus on Tool Selection

📖 10 min read•1,892 words•Updated May 16, 2026

Hey everyone, Leo here from agntdev.com! Today, I want to talk about something that’s been rattling around my brain for a while now, something that often gets overlooked when we’re deep in the weeds of prompt engineering or orchestrator logic: the often-underestimated, sometimes infuriating, but always crucial role of tool selection and integration in agent development.

We’re all chasing that dream of autonomous agents, right? The ones that can truly understand a goal, break it down, and execute. But let’s be honest, a large language model (LLM), no matter how brilliant, is just a fancy text generator without the ability to *do* things. It needs hands and feet in the real world, and those are its tools.

I’ve seen so many projects stall, or worse, produce hilariously bad results, because the tools an agent has access to aren’t quite right for the job. It’s like giving a master chef a dull butter knife and expecting a Michelin-star meal. You might get something edible, but it won’t be pretty, and it definitely won’t be efficient.

Today, I want to dive into how we pick the right tools, how we integrate them effectively, and how to avoid some common pitfalls I’ve stumbled into myself. This isn’t about writing the perfect prompt for a specific tool; it’s about the strategic decision-making around the tools themselves.

Beyond the Obvious: Why Tool Selection Matters More Than You Think

When I first started playing with agents, my approach to tools was pretty basic: “Does it do the thing I need? Great, slap it in!” If I needed to fetch data, I’d give it a generic web search. If I needed to write a file, I’d give it a generic file writer. Simple, right?

The problem is, “generic” often means “inefficient,” “prone to error,” or “requiring excessive prompting.” I remember working on an agent designed to research market trends for small businesses. My initial setup involved giving it a general Google search tool and a file writer. The idea was it would search, summarize, and save reports.

What actually happened was a lot of aimless searching, trying to parse entire web pages for specific data points, and then attempting to synthesize huge chunks of text. It was slow, expensive (API calls add up!), and the output was… fluffy. It would spend ages trying to figure out how to extract a specific quarterly revenue number from a financial news article when a direct API call to a financial data provider would have been instantaneous and accurate.

That was my “aha!” moment. The agent wasn’t failing because its reasoning was flawed; it was failing because its hands were tied. It was trying to use a hammer for a screw.

The “Right Tool for the Job” Isn’t Just a Saying

Think about a human expert. A data analyst doesn’t just “Google” everything. They use SQL databases, specific BI tools, Python scripts with Pandas, and maybe a financial API. Each tool is chosen for its precision, efficiency, and suitability for a particular task.

Our agents need the same consideration. When you’re designing an agent’s capabilities, ask yourself:

What exact data does it need? Is it a simple fact, structured data, or a general overview?
What actions does it need to perform? Is it just reading, or does it need to modify external systems?
What’s the desired output format? Raw text, JSON, a formatted report?
What are the constraints? Cost, speed, accuracy, data privacy?

These questions guide you away from the generic and towards specialized tools. Instead of a general web search, maybe it needs a specific API to a financial data provider. Instead of a generic file writer, maybe it needs a tool that writes structured JSON to a specific cloud storage bucket.

Practical Tool Integration: Beyond the Function Signature

Let’s talk brass tacks. Integrating a tool usually means wrapping some functionality in a way the LLM can understand and call. Most frameworks (LangChain, LlamaIndex, custom orchestrators) handle the boilerplate of exposing a tool’s name and description. But there’s more to it.

1. Precise Tool Descriptions: The Agent’s Manual

This is where many of us get lazy. A tool description like “search the internet” is vague. An LLM might interpret that as “go to Google and type in whatever I feel like.” A better description for a *specific* search tool might be:


Tool Name: `market_trend_search`
Description: "Searches for recent market trend reports and analyses from reputable financial news outlets and industry research firms. Use this tool when you need to understand current market sentiment, growth areas, or emerging technologies within a specific industry. Provide a concise, specific search query focusing on market trends, e.g., 'AI in healthcare market trends 2026'."
Parameters:
 query (string, required): The specific market trend to search for.

Notice the details: “reputable financial news outlets,” “industry research firms,” “current market sentiment,” and even an example query. This guides the LLM to use the tool appropriately and to formulate better inputs.

I remember an agent I built for summarizing news articles. My initial `news_fetcher` tool description was just “Fetches news articles.” The agent, bless its heart, would sometimes try to fetch articles about historical events or even fictional stories if my main prompt was broad. Adding “Fetches *recent* news articles from *major news outlets* related to *current events*” drastically improved its focus.

2. Input Validation and Sanitization: Protecting Your Tools

LLMs are powerful, but they can be… creative with inputs. If your tool expects an integer for a `quantity` parameter, and the LLM decides to pass “five units,” your tool will likely crash. Always, always, validate inputs within your tool’s wrapper.


# Example Python Tool (simplified)
def get_stock_price(symbol: str, date: Optional[str] = None) -> float:
 """
 Fetches the closing stock price for a given ticker symbol on a specific date.
 
 Args:
 symbol (str): The stock ticker symbol (e.g., "AAPL").
 date (str, optional): The date in YYYY-MM-DD format. Defaults to today.
 
 Returns:
 float: The closing price.
 """
 if not isinstance(symbol, str) or not symbol.isalpha() or len(symbol) > 5:
 raise ValueError("Invalid stock symbol provided.")
 
 # Basic date validation
 if date:
 try:
 datetime.strptime(date, '%Y-%m-%d')
 except ValueError:
 raise ValueError("Invalid date format. Use YYYY-MM-DD.")
 
 # ... actual API call logic ...
 return 175.50 # Placeholder for actual price

This isn’t just about preventing errors; it’s about giving the agent clear feedback. If your validation raises a `ValueError` with a helpful message (“Invalid date format. Use YYYY-MM-DD.”), a well-tuned agent can learn from that and correct its next attempt.

3. Output Formatting and Summarization: Making Sense for the LLM

Just as important as inputs is what the tool returns. Often, a raw API response is a massive JSON blob. Feeding that entire blob back to the LLM is inefficient, expensive, and can easily exceed context windows. Your tool should process its raw output into something concise and relevant.

Let’s say your `financial_data_api` tool fetches a huge report. Instead of returning the whole thing, your tool wrapper could extract just the key figures the agent usually needs (e.g., “Q1 2026 Revenue: $X billion, Net Profit: $Y billion, YOY Growth: Z%”).


# Example Tool Output Processing
def fetch_quarterly_report(company_ticker: str, quarter: str) -> str:
 """
 Fetches and summarizes key financial data for a company's specific quarter.
 
 Args:
 company_ticker (str): The stock ticker symbol.
 quarter (str): The quarter (e.g., "Q1 2026").
 
 Returns:
 str: A summarized string of key financial metrics.
 """
 # ... hypothetical API call to get a large JSON report ...
 raw_data = {
 "company": company_ticker,
 "quarter": quarter,
 "revenue": {"value": 12.5, "unit": "billion USD", "yoy_growth": 0.15},
 "net_profit": {"value": 2.1, "unit": "billion USD", "yoy_growth": 0.22},
 "eps": {"value": 1.50, "unit": "USD", "yoy_growth": 0.18},
 "full_report_link": "http://example.com/report.pdf",
 # ... many other fields ...
 }

 summary = (
 f"For {raw_data['company']} {raw_data['quarter']}:\n"
 f"- Revenue: ${raw_data['revenue']['value']} {raw_data['revenue']['unit']} "
 f"(YoY Growth: {raw_data['revenue']['yoy_growth']:.0%})\n"
 f"- Net Profit: ${raw_data['net_profit']['value']} {raw_data['net_profit']['unit']} "
 f"(YoY Growth: {raw_data['net_profit']['yoy_growth']:.0%})\n"
 f"- EPS: ${raw_data['eps']['value']} {raw_data['eps']['unit']}\n"
 f"Full report available at: {raw_data['full_report_link']}"
 )
 return summary

This not only saves tokens but also gives the LLM pre-digested information, making its reasoning easier and more reliable. It’s like giving your agent a highlighted summary instead of the entire textbook.

Advanced Considerations: When to Build, When to Buy, When to Adapt

Building Custom Tools

Sometimes, existing APIs or libraries just don’t cut it. This is where you roll up your sleeves and build a custom tool. My advice here: keep them focused. Don’t try to build a single “Swiss Army knife” tool that does 10 different things. An agent will struggle to understand when to use which part of it.

Instead, create several small, single-purpose tools. For instance, instead of one `data_manipulator` tool, have `csv_reader`, `json_parser`, and `dataframe_aggregator`. This gives the LLM clear, distinct options.

Adapting Existing Tools (or APIs)

Often, you’ll be working with existing APIs. The key here is to create thin wrappers that translate the LLM’s intent into the API’s expected format, and then translate the API’s response into something digestible for the LLM. This is where the `get_stock_price` and `fetch_quarterly_report` examples above come in. You’re not reinventing the API; you’re just making it agent-friendly.

The Tool-Agent Feedback Loop

Finally, remember that tool selection and integration isn’t a one-and-done job. It’s an iterative process. As you test your agents, pay close attention to:

Tool usage frequency: Is it using the right tools often enough?
Tool selection errors: Is it trying to use a tool for the wrong purpose? (This points to bad descriptions or missing validation.)
Tool execution errors: Are tools failing? (Often due to bad inputs from the LLM, or the tool itself being flaky.)
Agent performance after tool output: Is the agent making good decisions based on the tool’s output, or is it getting confused? (This might mean the output needs further summarization or reformatting.)

My own journey developing agents has been a constant cycle of refining tool descriptions, adding more specific tools, breaking down complex tools into simpler ones, and enhancing output processing. It’s not glamorous work, but it’s where the rubber meets the road for truly capable agents.

Actionable Takeaways

If you’re building agents, here’s what I want you to remember from today:

Be specific with tool selection: Don’t just dump generic tools into your agent’s arsenal. Think about the *exact* capabilities it needs and find or build tools that provide those precisely.
Craft detailed and guiding tool descriptions: Your tool descriptions are the agent’s manual. They should be clear, concise, and provide examples of how and when to use the tool.
Implement robust input validation: Protect your tools from unexpected inputs from the LLM and provide helpful error messages for the agent to learn from.
Process and summarize tool outputs: Don’t just pass raw API responses back. Filter, summarize, and format outputs to be as useful and token-efficient as possible for the LLM.
Iterate and observe: Tool integration is an ongoing process. Monitor how your agent uses tools and refine them based on real-world performance.

Focusing on these aspects will elevate your agents from interesting prototypes to genuinely effective, reliable automation powerhouses. Give it a shot, and let me know what kinds of specialized tools you’re building!

🕒 Published: May 16, 2026

✍️

Written by Jake Chen

AI technology writer and researcher.

Learn more →