Extending Microsoft Copilot: Visual Intelligence with Midjourney
No video selected
Select a video type in the sidebar.
Earlier this month, I presented at M365 Philly on integrating Copilot and the image AI tool Midjourney.
Midjourney is not a Microsoft product, but is fantastic at producing images based on prompts.
Other Key features
- Text-to-image generation – Converts written prompts into artwork
- Highly artistic style – Known for cinematic, stylized, and aesthetically strong images
- Variations & upscaling – You can refine or enhance generated images
- Prompt-based control – You guide the output through wording, style tags, and parameters
How it works
Midjourney uses advanced diffusion models (similar to DALL·E or Stable Diffusion) trained on vast datasets of images and text to “imagine” new visuals based on your input.
How people use it
- Marketing visuals and campaigns
- Concept art (games, movies, products)
- Social media content
- Mood boards / creative exploration
- Branding and design inspiration
How does it work with Microsoft Copilot?
Firstly, there's no native integration with connectors
- Midjourney → independent AI image tool (runs via Discord + web app)
- Microsoft Copilot → productivity AI embedded in Microsoft 365, Edge, Windows, etc.Microsoft uses its own image model (DALL·E / OpenAI) inside Copilot.. .
Where Copilot is very good, it is building prompts for Midjourney.
The exercise I presented on how Copilot and Midjourney work together is that I asked someone in the audience to mention a movie... and they said - Gone in 60 Seconds. Then put someone in the audience to give me their Linkedin profile and we'll put their face on the poster.
Here are the steps:
In Copilot: Describe this movie poster?
>>
Foreground elements
1. The face (primary focal point)
- The image shows a highly detailed close-up of a human face, occupying roughly 70–80% of the poster.
- The eyes are sharply in focus and positioned near the upper center, making them the main anchor point.
- The facial features are lit with strong, warm lighting, creating pronounced highlights and shadows.
- The overall texture appears grainy and stylized, giving a gritty, cinematic feel.
2. The car (secondary focal point)
- In the lower-right corner, there is a sports car rendered with motion blur, suggesting speed.
- The car is angled diagonally, moving from right to left across the frame.
- Bright reflections and streaks of light emphasize motion and energy.
🎨 Colors and lighting
- The poster uses a warm, high-saturation palette:
- Dominant tones: orange, gold, and amber
- Accents: red and white in the text
- Lighting is intense and directional:
- The face is illuminated from one side, creating contrast and depth
- The car reflects bright highlights, adding a metallic sheen
- The overall effect is fiery and high-energy, reinforcing action and intensity.
>>If this was a midjounrey prompt what would it be?
extreme close-up portrait of a person’s face dominating the frame, sharp detailed eyes in focus, gritty skin texture, warm orange and gold color grading, dramatic side lighting, intense cinematic look, motion-blurred sports car streaking across the lower corner, strong sense of speed and urgency, high contrast highlights, glowing reflections, action movie poster style, bold composition, shallow depth of field, 85mm lens, film grain --ar 2:3 --v 6 --style raw
Put the prompt in Midjourney. This was the output

After prompting in Midjourney with the face in the audience, this was the result
