Grok's composer teardown
Updated June 13, 2026
Grok strips the landing page down to a single bar: + on the left, Fast on the right, mic and voice beside it. Imagine and Private sit top-right. Connectors and SuperGrok promos show up in the same focal band when the product wants your attention.
Calm default

What works
- No starter pills, no tool chips, no mode vocabulary on first load. You can just type.
- Fast shows before the first keystroke. Price-sensitive users see what they are spending on; everyone sees model tier in one control.
- + on the left and model on the right keep attach separate from which brain answers.
What we would push on
- Imagine and Private live top-right, far from the bar. Image gen and privacy modes feel like separate products.
- Mic and waveform sit as plain icons side by side on the right rail. Dictation and voice conversation look the same until you tap.
- Placeholder copy shifts between sessions (How can I help you today versus What do you want to know?). Small inconsistency on a page with almost no other text.
Takeaway
Radical calm with honest model visibility. The bar split works. Voice entry icons and top-right modes need clearer differentiation.
Feature discovery

What works
- Connectors banner teaches app wiring at the point of ask, with Dismiss and Connect so users stay in control.
- App icons in the banner preview what connectors mean before you open settings.
- Feature education sits directly under the bar where eyes already landed.
What we would push on
- SuperGrok countdown pill fights the bar for attention on the same screen as the connectors banner.
- Two growth surfaces at once on an otherwise empty page. A lot of noise for a calm default.
- Orange Claim Offer is the only strong color on the page. It pulls focus from the composer itself.
Takeaway
Good inline feature surfacing for connectors. Keep upsell chrome out of the composer focal band or sequence promos so they do not stack.
Attachments

What works
- The image sits inside the composer pill as a small thumb with x. You see what Grok will read without leaving the bar.
- Attachment does not break the pill shape. The bar still reads as one control.
What we would push on
- Inline preview shows pixels, not filename and type. Harder to scan when you stack multiple files.
- Thumbnail competes with + for space on the left. Dense once you add skills chips too.
Takeaway
Steal compact inline preview for a single image. Add filename chips when users attach more than one thing.
Ready to send

What works
- Send affordance appears as a filled black arrow once there is content to ship. Clear shippable state.
- Image thumb and text coexist in one pill without pushing model controls off the bar.
- Fast stays visible so model choice does not disappear when the bar gets busy.
What we would push on
- Send icon swaps from waveform to arrow when text appears. The right rail changes meaning as you type.
- No explicit filename on the image chip. You trust the thumb, not a label.
Takeaway
Strong ready-to-send state. Keep right-rail iconography consistent so send does not feel like a mode swap.
Skills applied

What works
- Armed skills show as chips inside the bar. Scope is visible before send.
- Same chip grammar as file attachments. One visual system for what will affect the next message.
What we would push on
- Chips say docx and pdf, not the skill name or outcome. Extension labels hide what Grok will actually do.
- Multiple chips crowd the left side of the pill fast once you add an image too.
Takeaway
Chip persistence is right. Rename chips to match the tooltip copy users saw when they picked the skill.
Model picker

What works
- Fast, Expert, and Heavy use outcome labels with model version in subtext. Capabilities without jargon.
- Auto row explains it chooses Fast or Expert. Intelligent default is one tap away.
- Custom instructions at the bottom shows Concise as current state. Settings visible inside the same menu.
What we would push on
- SuperGrok upgrade sits inside the model menu between capability rows and settings. Mixes paywall with model pick.
- Fast is selected by default, not Auto. Power users get transparency; casual users may not discover routing.
- Bar still says Fast while the menu exposes four tiers plus upgrade. Naming is consistent but the ladder is long.
Takeaway
Strong tier naming and honest subtext. Pull upgrade into its own row or footer so model choice stays about capability.
Voice dictation

What works
- Listening replaces the right rail with a waveform and checkmark. Clear in-bar dictation mode.
- You stay on the landing page. No full-screen handoff for a short utterance.
What we would push on
- Mic and waveform look the same on the default bar. Users only learn the difference after tapping.
- Waveform control inside a narrow pill can feel cramped on desktop.
Takeaway
Inline dictation works. Differentiate mic versus conversation icons on the default state, not only after activation.
Voice session

What works
- You may start speaking sets expectations before the first word. Human entry to hands-free mode.
- Stop is prominent and unambiguous. Easy exit from a live session.
- Paperclip still available in voice mode. Attach does not disappear when you switch modalities.
What we would push on
- Composer layout changes completely between text and voice. The bar you learned on the landing page is not the bar in session.
- Ara · Assistant persona picker only shows in voice mode. Text users may never see persona controls.
Takeaway
Strong conversation shell with a clear Stop. Bridge text and voice layouts so the bar does not feel like two products.
How it fits together
What works
- Calm pill first, then + for uploads, skills, and connectors.
- Fast on the right rail, separate from attach on the left.
- Chips inside the bar for files and armed skills before send.
What we would push on
- Growth banners and SuperGrok promos stack on an otherwise empty page.
- Skills labeled as file extensions, not outcomes.
- Mic and waveform indistinguishable until activated.
- Text and voice composers use different layouts.
Takeaway
Grok bets on maximum calm and visible model tier. The + tree and in-bar chips carry scope honestly. Growth chrome, skill naming, and voice entry need the same discipline the default bar already has.
Steal this
- Fast visible in the bar before the first keystroke
- Connectors banner with Dismiss and Connect at the point of ask
- Inline image thumb inside the composer pill
- Skill tooltips beside flyout rows
- Custom instructions state shown inside the model menu
Skip this
- Stacking upsell pill and feature banner on the same empty landing
- Skill chips labeled docx instead of the job they perform
- Mic and waveform icons that look identical on default
- SuperGrok upgrade row inside the model capability list
Original gallery pages: Tool Switching in Composer · Multimodal Input · Model Selection · Persona Switcher

