How to Spin Up Models in Azure AI Foundry and Use Keys the Right Way

Setting up your first deployment in Azure AI Foundry can feel like a maze—especially if you just want to spin up the latest GPT foundation model to use in your app or building an agent.

Let’s walkthrough how to spin up language models in Azure AI Foundry.

Azure AI Foundry is Microsoft’s newest platform for discovering, deploying, and managing AI models—whether you’re using OpenAI’s GPT-4, open-source options from Hugging Face, or other model providers like Mistral. It’s pro-code, built for developers, and designed to unify AI model lifecycle management with real governance and observability.

But the interface and workflow is… confusing. So I put together this walkthrough based on a fresh install—so you can get started without tripping over the setup.

What You’ll Learn in This Guide

What Azure AI Foundry is and how it compares to Copilot Studio
The difference between foundation models and embedding models
How to choose the right model(s) for your use case
How to deploy them in Azure AI Foundry
How to get your API key and endpoint URL
How to test your model inside the Foundry UI

Foundation Models vs Embedding Models: Which One Do You Need?

Before we jump in, a quick clarification.

For Retrieval-Augmented Generation (RAG) systems—or most real-world GenAI apps—you’ll need two types of models:

Embedding Model: Converts your content (text, audio, etc) into vector format so it can be stored, searched, and matched semantically. Used for indexing documents or database records.
Foundation Model: Generates responses. Think GPT-4 or Mistral. Used to summarize, chat, reason, or generate code.

Pro tip: Start small. Use lightweight embedding models (like text-embedding-ada-002) and only deploy GPT-4 when you’re sure you need it. Azure AI charges by token usage, so scaling smart matters when setting your token parameters.

Access note: You’ll need an Azure subscription and possibly approval for Azure OpenAI Service access if using proprietary models like GPT-5.

Step 1: Access Azure AI and Start a New Project

Let’s start from square one—spinning up your first Azure AI Foundry project. This step is all about getting logged in, creating a project, and getting to the dashboard where you can start deploying models. Whether you’re using a fresh Azure account or this is your first time in Foundry, I’ll walk you through every screen and explain the decisions you’ll make along the way.

Navigate to the Azure AI Foundry Portal

First, head to the Azure AI Foundry portal. Depending on how your tenant is configured, this might be:

https://ai.azure.com
Or accessed via the Azure AI Studio interface inside the main Azure Portal

You’ll need to log in using your Azure credentials. If you don’t have an Azure account yet, you can create a free one here.

First-Time Setup: Create Your First Project

If it’s your first time in Azure AI Foundry, you’ll likely be prompted right away to create a new project. Look for messaging like:

“No projects found”
“Get started by creating a project”
“Welcome to Azure AI Foundry”

Click the “+ Create new” or “Create project” button to proceed.

If for some reason this screen doesn’t appear automatically, you can manually start a new project by clicking Create > Project from the top nav or sidebar.

Choose the Project Type

At this point, Azure AI Foundry may offer two resource types:

Azure AI Foundry Project
Hub Resource

For most users (especially if you’re just experimenting or trying to deploy your first model), choose Azure AI Foundry Project. The Hub is useful later if you’re managing multiple projects or need shared configuration, security, or networking setups across a team. But it’s not required to get started.

📝 Tip: You can always associate a project with a Hub later if needed.

Fill in the Project Details

You’ll now see a form to configure your new project. Here’s what each field means and how to fill it out:

Project Name: This is a friendly display name. Call it something simple like “My-First-AI-Project”.
Subscription: Choose the Azure subscription to bill this project’s usage against. Most users will only see one subscription here.
Resource Group: You can select an existing resource group or click “Create new” to make one. Example name: “AIFoundryResources”. This is just an organizational container for your project and its assets.
Project Domain Name: This must be globally unique (across all of Azure). It’s used for internal identifiers and endpoints. Try something like “my-aifoundry-demo” or “my-ai-app-1234”. If the name is already taken, you’ll get an error and need to try another.
Region: Pick a region closest to your app’s resources like your database to improve latency. If your database is in West US, spin up your models in West US. Common options include:
- West US
- East US
- West Europe
- Australia East

📝 Note: Not all model families are available in all regions. For example, if you want to deploy GPT-4 Turbo, stick with known regions like East US or West US.

You may also see advanced settings like “Associate with Hub” or “Networking options.” Leave these at their default values unless you know you need something specific.

Create the Project

Once everything looks good, click Create or Review + Create > Create to start provisioning your project.

Behind the scenes, Azure will spin up the necessary infrastructure for you. This includes:

Azure OpenAI resource
Storage
Key Vault
Service connections

This can take anywhere from 30 seconds to 2 minutes.

Project Overview & Dashboard

Once provisioning is complete, you’ll land on your Project Overview page. This is your launchpad inside Foundry.

From here, you can:

Deploy new models
View keys and endpoints
Track usage and billing
Manage team access

Your dashboard may show:

Project Name and ID
Project Domain
Empty model list
Endpoint info for CLI or SDK usage

By the end of this step, you’ve successfully created your first Azure AI Foundry project. You’re now ready to deploy a model, generate API keys, and start building with real LLMs.

Step 2: Search the Model Catalog

Now that you’ve created your first Azure AI Foundry project, it’s time to put it to work. In this step, we’ll explore the Model Catalog to find the AI models you want to deploy—specifically:

A Text Embedding model for converting text into vector representations (great for search, clustering, and retrieval tasks)
A Chat Completion model like GPT-3.5 or GPT-4 for generating conversational responses

You’ll see how to search for these models, view their details, and prep them for deployment.

Open the Model Catalog

Once you’re inside your project workspace, look for “Model Catalog” in the sidebar navigation. This is where Foundry lists all the available models you can deploy into your project.

Click Model Catalog to open it.

Depending on your screen layout, you may see this appear as a tab at the top or a section on the left.

Browse or Search for Models

You’ll now see a searchable list or grid of foundation models provided by Azure. This catalog includes:

Text generation models like GPT-3.5 and GPT-4
Vision models (for images)
Embedding models (used for turning text into vector data)
Fine-tuned or task-specific models (depending on your region)

Let’s start with the Embedding Model.

Search for the Embedding Model

In the search bar at the top of the catalog, type:

embedding

You should see results like:

text-embedding-3-small
Possibly other versions like “text-embedding-ada-002” depending on the release cycle and naming

These models are typically based on OpenAI’s Ada family and offer a fast, efficient way to convert chunks of text into numerical vectors for downstream tasks.

Click into the Model for Details

Click on the model name to open the details panel. This will show you:

Model description and version
Availability (regions where it can be deployed)
Pricing per 1K tokens (useful for estimating costs)
Model offer (typically “Azure AI Services,” which means fully managed)
Possibly info about latency or SLA (optional)

This is where you decide whether this model fits your use case. For our purpose—vectorizing data for semantic search or clustering—this is a great fit.

Repeat for Chat Model (Optional)

Now let’s find the Chat Completion model.

In the search bar, type:

chat

Or search for a specific model like “gpt-3.5-turbo” or “gpt-4”.

Depending on your Azure region and account access, you might see options like:

GPT-3.5 Turbo
GPT-4 Turbo
Possibly multilingual or tuned variants

Again, click into the model name to view:

Description (often “chat-based large language model”)
Pricing (per 1K tokens)
Regions
Offer type (e.g., Azure AI Services)

What You’re Seeing in the Catalog

Each entry in the catalog represents a model version that Azure is offering as a managed service. Key things to notice:

The Offer will usually say “Azure AI Services” — this means you’re not responsible for hosting or maintaining the model.
You’ll see a region list where the model is available. This matters: you can only deploy the model to your project if it’s in the same region as your project.
Some models show token pricing, which is helpful for budget planning.

You don’t need to worry about infrastructure or scaling. These are API-based deployments—Azure handles the rest.

By the end of Step 2, you’ve:

Opened the Model Catalog
Found the Embedding model and Chat model
Reviewed their details
Confirmed they’re available for deployment in your region

Next up, we’ll deploy these models into your project and get your API keys and endpoint—so you can actually call them from your app or script.

Step 3: Deploy Your Embedding Model

You’ve got your project. You’ve found your model. Now let’s actually deploy it.

In Azure AI Foundry, deploying a model means you’re making it available for use in your project—whether you’re calling it via the portal, a notebook, or directly via API. In this step, we’ll walk through deploying the Text Embedding Ada model (or whichever embedding model you picked in Step 2).

Use This Model

After you’ve clicked into the model details page, look for a “Use this model” button.

This kicks off the deployment process. You may also see a small rocket ship icon or a “Deploy” button—it depends slightly on the UI version you’re seeing, but the idea is the same: you’re creating a live deployment of that model inside your project.

Configure the Deployment

Now you’ll fill out a few required fields to finalize the deployment.

Deployment Name

This is a unique name you assign to your model deployment.

Example: text-embed-model-1 or embedding-demo
I used embed-model for mine.
Use only lowercase letters, numbers, and hyphens—no spaces or special characters.

This name becomes part of the internal API identifier, so make it clear and consistent.

Tier / SKU Selection

Depending on your Azure account and the model, you may see options like:

Standard
Global Standard
Provisioned (for high-throughput or SLA-guaranteed workloads)

For most projects, Standard is fine. I went with Standard in my walkthrough.

If you see Global Standard, just know it usually comes with higher throughput and a higher token quota, which might be helpful in production scenarios—but it’s not necessary for testing or light use.

Region / Location

This should match the region you selected when you created the project. If not, you may run into region to region latency issues later.

I used East US 2
Common options include East US, West Europe, or Southeast Asia

Azure should auto-fill the region for you if your project only supports one.

Click Deploy

Once your configuration is set, click Deploy.

Azure will now begin spinning up the backend infrastructure for your model deployment. This typically includes:

Creating a deployment instance for the model
Linking it to your Azure AI Foundry project
Connecting with the underlying Azure OpenAI service (if applicable)

This part may take 30–60 seconds depending on your region and system load.

Deployment Success Confirmation

Once the deployment is complete, you should see one of two things:

A confirmation toast or success message, and/or
A list of active assets or My Assets showing your new deployed model

You can now see that your embedding model is live and available for use.

It’ll typically show:

Deployment name (embed-model)
Status: “Deployed” or “Succeeded”
Type: Embedding
Tier: Standard
Region

✅ At this point, you’ve:

Deployed your first model into Azure AI Foundry
Set its name, tier, and region
Watched Azure provision the backend resources for you

In the next step, we’ll deploy your foundation model.

Step 4: Deploy a model

Now that the embedding model is live in our workspace, let’s bring in the conversational heavy hitter—GPT-3.5.

This model will power your text generation workflows: think chatbots, natural language interfaces, summarization tools, and anything that needs a brain behind a paragraph.

Find GPT-3.5 in the Model Catalog

Head back to the Model Catalog from your Foundry project dashboard.

You can either scroll through the Chat Completion section, or type “gpt-3.5” into the search bar. You’ll likely see a few variations depending on availability in your region and your Azure account:

GPT-4 (standard version)
GPT-4o (is a faster, more cost-effective, and multimodal version of GPT-4)
GPT-3.5 Turbo (a solid fallback if GPT-4 isn’t available)

Click on the version you want. I used the base GPT-3.5 model for my project.