Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
foundry-iq-cookbook.ipynb	foundry-iq-cookbook.ipynb

Episode 2 Cookbook: Building the Data Pipeline with Knowledge Sources

This folder contains the hands-on cookbook for Episode 2 of The Foundry IQ Series.

📋 Prerequisites

Azure Subscription with permissions to create resources and assign roles
Azure CLI installed and configured (Install guide)
Python 3.10+ installed
A region that supports agentic retrieval (default: eastus2)

🚀 Deploy Azure Resources

Note: This deployment is shared across all Foundry IQ episodes. You only need to deploy once — if you've already deployed for another episode, skip this step and reuse your existing resources.

Deploy all required Azure resources with one click — this creates AI Search, Azure OpenAI, AI Services, a Foundry project, an AI Search connection, Azure Blob Storage, model deployments, and RBAC roles:

⚠️ Troubleshooting: Deployment script failed?

Some Azure tenants enforce policies that block key-based access on storage accounts. This can cause the data seeding script to fail while all other resources deploy successfully. If this happens, your Azure resources are fully deployed — only the sample data and knowledge base setup is missing. You can seed the data manually using either of these alternatives:

Run the Episode 1 cookbook: Open the Episode 1 cookbook and run it end-to-end — it indexes the same NASA "Earth at Night" sample data to your AI Search and creates the knowledge source and knowledge base.

Seed via Foundry IQ UI: Create an index in AI Search manually using the NASA Earth at Night dataset, then create a knowledge source and knowledge base pointing to it through the Foundry IQ portal.

In the deployment form:

Create a new resource group (e.g., iq-series-rg) — click Create new under the Resource group field. If you've already created one for a previous episode, select it instead
Enter your User Object ID (see below)
Customize the resource prefix, location, and SKUs

How to get your User Object ID: Open a terminal and run:

az ad signed-in-user show --query id -o tsv

This returns your Microsoft Entra ID unique identifier — paste it into the deployment form. It's needed to assign proper RBAC roles to your account.

After deployment, create a .env file in this folder (2-Foundry-IQ-Building-the-Data-Pipeline-with-Knowledge-Sources/cookbook/.env) with your values from the deployment outputs:

SEARCH_ENDPOINT=https://<your-search-service>.search.windows.net
AOAI_ENDPOINT=https://<your-openai-resource>.openai.azure.com
AOAI_EMBEDDING_MODEL=text-embedding-3-large
AOAI_EMBEDDING_DEPLOYMENT=text-embedding-3-large
AOAI_GPT_MODEL=gpt-4o-mini
AOAI_GPT_DEPLOYMENT=gpt-4o-mini
BLOB_CONNECTION_STRING=<your-blob-connection-string>
BLOB_CONTAINER_NAME=<your-container-name>

Where to find these values: All values are available in the deployment Outputs tab in the Azure portal. Copy searchEndpoint, openAiEndpoint, blobConnectionString, and blobContainerName directly from the outputs.

For CLI deployment and cleanup instructions, see the Infrastructure Guide.

📓 Cookbook Notebook

The Foundry IQ Cookbook walks you through building the data pipeline with Knowledge Sources, step by step:

Understanding Knowledge Source types (indexed vs. remote)
Creating a search index and uploading sample product data
Creating an indexed Knowledge Source (Azure AI Search)
Creating a Blob Storage Knowledge Source (automated ingestion pipeline)
Creating a Web Knowledge Source (real-time public information)
Combining multiple sources in a single Knowledge Base
Querying across sources and inspecting the activity log
Security and governance considerations

Quick Start

Install dependencies: pip install -U azure-search-documents==12.1.0b1 azure-identity python-dotenv
Sign in to Azure: run az login in a terminal
Create a .env file with your endpoint values (see the notebook for details)
Open foundry-iq-cookbook.ipynb in VS Code and run the cells

Running in GitHub Codespaces? The devcontainer already installs all dependencies and the VS Code Jupyter extension automatically. Just open the .ipynb file directly in the VS Code editor — no need to install or launch a standalone Jupyter server. The notebook renders and runs natively inside VS Code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Episode 2 Cookbook: Building the Data Pipeline with Knowledge Sources

📋 Prerequisites

🚀 Deploy Azure Resources

📓 Cookbook Notebook

Quick Start

Additional Resources

FilesExpand file tree

cookbook

Directory actions

More options

Directory actions

More options

Latest commit

History

cookbook

Folders and files

parent directory

README.md

Episode 2 Cookbook: Building the Data Pipeline with Knowledge Sources

📋 Prerequisites

🚀 Deploy Azure Resources

📓 Cookbook Notebook

Quick Start

Additional Resources