Should I use aspect-ratio or width/height attributes?

Use width/height when intrinsic size is known to prevent CLS; aspect-ratio is a good fallback when dimensions are unknown at build time.

Is AVIF worth it over WebP?

Often, yes. AVIF can deliver smaller files at similar quality, especially for photos. Provide both formats with <picture> for broad coverage and measure on your actual assets.

How many breakpoints do I need in srcset?

Three to four well-chosen widths (e.g., 640, 800, 1200, 1600) plus an accurate sizes string is typically sufficient.

Building a Responsive Image Grid for Modern Websites

By Mason Goulding · Aug 7, 2025 Updated Sep 6, 2025

A developer-first walk-through of building fast, accessible, responsive image grids that are safe from Cumulative Layout Shift (CLS). Using CSS Grid (with a lightweight Flexbox fallback), modern media formats, and build-time automation is the exact playbook I apply on client builds where appropriate.

No frameworks required — no yak-shaving — and no mystery performance regressions later. Take the time to get the fundamentals down to save developer hours and keep projects scalable long-term.

Why Grid First (and How to Avoid CLS Issues)

CSS Grid gives you reliable, gap-controlled layouts that adapt to any viewport without JavaScript, in other words, it is essential to any modern build. The real win: the grid does not fight you as content changes, it instead adapts to content additions in predictable ways.

The common performance failure condition I see in audits is layout shift from image assets rendering without reserved space. That shift is not just a minor annoyance — it is visible, measurable, and taxing to user attention. If you want stable pages, you have to set aside space for the pixels before they arrive.

The simplest, most robust fix is to ship intrinsic dimensions: set the width and height HTML attributes to the image’s natural size. In modern development, browsers now compute the aspect ratio from those numbers and lay out the page accordingly; if you do not know the intrinsic size, set a deliberate aspect-ratio in CSS for the slot and use object-fit: cover to keep crops tight and set as the baseline for Core Web Vitals sanity.

Performance: serve next-gen formats (avif/webp), describe your layout with srcset/sizes, and lazy-load non-critical images. See: web.dev on CLS and MDN on responsive images.
Accessibility: write descriptive, purposeful alt text; use <figure>/<figcaption> when the image adds context.
Maintainability: lean utilities or Tailwind classes so you don’t refactor a stylesheet every time the grid changes.

Semantic Markup Scaffold

Start with semantic containers so screen readers and search engines understand the gallery. Each tile is a <figure> with an optional caption. This also makes it trivial to link the whole card later without weird nesting or duplicated labels.

<section aria-labelledby="gallery-title" class="gallery">
  <h2 id="gallery-title" class="sr-only">Project Gallery</h2>

  <figure class="gallery__item">
    <picture>
      <source srcset="/img/hero-1200.avif 1200w, /img/hero-800.avif 800w" type="image/avif">
      <source srcset="/img/hero-1200.webp 1200w, /img/hero-800.webp 800w" type="image/webp">
      <img
        src="/img/hero-800.jpg"
        srcset="/img/hero-800.jpg 800w, /img/hero-1200.jpg 1200w"
        sizes="(min-width: 1024px) 33vw, (min-width: 640px) 50vw, 100vw"
        width="1200" height="800"
        alt="Redesign of a marketing homepage with bold headline"
        loading="lazy" decoding="async" class="gallery__img">
    </picture>
    <figcaption class="gallery__caption">Homepage redesign</figcaption>
  </figure>

  <!-- Repeat items... -->
</section>

Explicit width/height preserves aspect ratio and prevents CLS. The sizes attribute must reflect the actual rendered width; guesswork here forces the browser to pick the wrong source and waste bytes. For a deeper dive on how browsers select candidates, read MDN: <picture>.

Option A: Pure CSS Grid (Auto-Fit Columns)

This pattern creates fluid columns that adjust as space allows. It reads well, it’s stable, and it survives design tweaks without having to rip out layout code. If your product or case study grid is constantly changing count and aspect ratios then this is the workhorse layout.

/* gallery.css */
.gallery{
  --gap: 1rem;
  display: grid;
  grid-template-columns: repeat(auto-fit, minmax(220px, 1fr));
  gap: var(--gap);
  align-items: start;
}
.gallery__item{
  display: grid; /* allows caption to size below image */
  gap: .5rem;
}
.gallery__img{
  width: 100%;
  height: auto;
  aspect-ratio: 3 / 2;          /* safety if width/height omitted */
  object-fit: cover;            /* crops nicely in varying cells */
  border-radius: .5rem;
  background: #f4f6f8;          /* placeholder color */
}
.gallery__caption{
  font-size: .875rem;
  color: #6b7280;               /* gray-500 */
}

Tweak minmax(220px, 1fr) to control the smallest card size. If Grid is new to you, MDN’s mental model pages do a good job of explaining track sizing and auto-placement.

Option B: Flexbox Rows (Simple Fallback)

Flexbox does not align rows like Grid, but it’s a fine fallback if you need broad compatibility or a drop-in utility for a legacy template. The trick is to calculate widths so the gaps remain consistent across wraps, which can be frustrating and require a lot of manual adjustments.

.gallery{
  --gap: 1rem;
  display: flex;
  flex-wrap: wrap;
  gap: var(--gap);
}
.gallery__item{ width: calc(50% - var(--gap)/2); }
@media (min-width: 640px){ .gallery__item{ width: calc(33.333% - var(--gap)*2/3); } }
@media (min-width: 1024px){ .gallery__item{ width: calc(25% - var(--gap)*3/4); } }

Tailwind-First Variant (One-Liners)

If you are using Tailwind, then you likely know that the same layout is only one class away. Go ahead and keep the semantics and the attributes; the classes simply remove boilerplate CSS. For a broader discussion of utility-first development and when to reach for it, see file structure for speed and scale.

<section aria-labelledby="gallery-title"
  class="grid grid-cols-1 sm:grid-cols-2 lg:grid-cols-3 gap-4">
  <figure class="space-y-2">
    <img class="w-full aspect-[3/2] object-cover rounded-xl bg-gray-100"
         src="/img/hero-800.jpg" width="1200" height="800"
         alt="Homepage redesign" loading="lazy" decoding="async">
    <figcaption class="text-sm text-gray-500">Homepage redesign</figcaption>
  </figure>
  <!-- ... -->
</section>

Performance: Formats, Srcset, and Byte Budgets

Images dominate payload on marketing sites, well, most sites to be frank. The fix is not just “use WebP,” or else most sites would perform at far higher ranges than they currently do. The fix is a repeatable process: derive multiple widths, choose realistic sizes, cap bytes per tile, and measure your Largest Contentful Paint (LCP) on mobile.

When you choose to do this, your grid stops being a performance risk where every stylistic and content change is a frustrating experience, and instead becomes a non-event across monitoring and polish efforts.

Formats: prefer avif, then webp, then jpg/png fallback with <picture>. See web.dev on next-gen formats.
Srcset/sizes: ship a few widths that match your layout and write an honest sizes string. MDN’s breakdown of srcset/sizes is the canonical reference: MDN responsive images.
Lazy load: use loading="lazy" for offscreen tiles; eager-load the first one if it is your LCP. See Lighthouse docs on LCP.
Preload the LCP: when the first tile is your hero, consider a <link rel="preload" as="image"> with the exact source the browser will choose.
Automation: generate derivatives at build time and enforce a byte budget (for example, ≤ 150 KB per tile). My builds use a tiny Node script with sharp to keep asset decisions out of human hands.

// scripts/make-images.js (Node + sharp)
import sharp from "sharp"; import fg from "fast-glob"; import { dirname } from "node:path"; import { mkdirSync } from "node:fs";
for (const src of await fg("src/images/*.{jpg,jpeg,png}")){
  const base = src.replace(/^src\/images\//,"").replace(/\.(jpe?g|png)$/i,"");
  const sizes = [640, 800, 1200, 1600];
  for (const w of sizes){
    const out = `dist/images/${base}-${w}.webp`;
    mkdirSync(dirname(out), { recursive: true });
    await sharp(src).resize({ width: w }).webp({ quality: 78 }).toFile(out);
  }
}

For a deeper primer on why these details matter for rankings, read optimizing images for performance.

Container Queries: When the Grid Lives in Different Contexts

Not to go too deep down the rabbit hole, but responsive design based only on viewport width breaks down when the same component appears in a sidebar in one template and full-width in another. Luckily, the solution is as simple as setting container queries which let you adjust columns based on the parent’s inline size. Systems only work at scale if you actively seek to avoid one-offs.

.gallery-wrap{ container-type: inline-size; }
@container (min-width: 40rem){
  .gallery{ grid-template-columns: repeat(3, 1fr); }
}
@container (min-width: 64rem){
  .gallery{ grid-template-columns: repeat(4, 1fr); }
}

The underlying math uses the element’s size, not the viewport. The formal definition of aspect ratio (which browsers use with your width/height attributes) lives in the CSS Sizing spec: W3C: CSS Sizing Level 4.

“Masonry” Layouts Without Heavy JS (Progressive Enhancement)

Masonry is the trade and art of stonework and building.

Traditional coursed masonry stacks stones in neat, perfectly aligned rows — just like a strict CSS Grid with equal row heights.

Random rubble masonry arranges stones of different sizes where they best fit, leaving staggered edges.

That’s exactly what the CSS column-count technique does: items flow down columns at their natural height, creating that “Pinterest-style” staggered look.

The trade-off is the same as in stonework: you lose a perfectly level baseline, but you gain speed and simplicity. No JavaScript layout engine, no extra reflows — just native CSS doing the heavy lifting. For many editorial sites, portfolios, or blog grids, that trade is worth it.

.masonry{ column-count: 1; column-gap: 1rem; }
@media (min-width: 640px){ .masonry{ column-count: 2; } }
@media (min-width: 1024px){ .masonry{ column-count: 3; } }
.masonry figure{ break-inside: avoid; margin: 0 0 1rem; }

Use CSS Grid for “coursed” precision when you need straight rows. Use the column-flow “random rubble” pattern when you want speed, visual rhythm, and less code. If you need a modal viewer on top, keep it progressive: ship plain links first, then layer on a dialog with JavaScript when available. For more on this philosophy, see progressive enhancement in practice and my lightweight lightbox tutorial.

Turn Tiles Into Linkable Cards (Keyboard & Screen Reader Safe)

Suggestion: wrap the media and caption in a single focusable link to avoid nested interactive controls and keeps the card’s name + purpose unambiguous for assistive technologies, which also means using a high-contrast focus treatment that is visible over images.

<a href="/case-studies/homepage/"
   class="group block focus:outline-none focus:ring-2 focus:ring-offset-2 focus:ring-[#d4a856] rounded-lg">
  <figure class="space-y-2">
    <img src="/img/hero-800.jpg" width="1200" height="800"
         alt="Homepage redesign" loading="lazy" decoding="async"
         class="w-full aspect-[3/2] object-cover rounded-lg group-hover:opacity-95 transition">
    <figcaption class="text-sm text-gray-600">Homepage redesign</figcaption>
  </figure>
</a>

If the grid backs a blog or documentation hub, structure your URLs and internal links deliberately. Related reading that reinforces this approach: handling responsive breakpoints and lazy vs. eager loading.

Optional: Tiny, Accessible Lightbox

If a lightbox is a requirement, keep it boring and accessible. The HTML <dialog> element handles focus management well when used correctly. Provide an obvious close control and support the Escape key.

<dialog id="lb" class="backdrop:bg-black/70 p-0 rounded-lg">
  <button class="absolute top-3 right-3 p-2" aria-label="Close" data-close>✕</button>
  <img alt="" id="lb-img" class="max-h-[90vh] max-w-[90vw] object-contain">
  <p id="lb-cap" class="p-3 text-sm text-gray-200"></p>
</dialog>
<script>
  const d = document.getElementById('lb'), img = document.getElementById('lb-img'), cap = document.getElementById('lb-cap');
  document.addEventListener('click', e =>{
    const a = e.target.closest('[data-lightbox]');
    if(!a) return;
    e.preventDefault();
    img.src = a.href; img.alt = a.dataset.alt || '';
    cap.textContent = a.dataset.caption || '';
    d.showModal();
  });
  d.addEventListener('click', e =>{ if(e.target.matches('[data-close], dialog')) d.close(); });
  document.addEventListener('keydown', e =>{ if(e.key==='Escape' && d.open) d.close(); });
</script>

If you prefer a no-JS pattern, link thumbnails to their large versions and let the browser handle navigation. It is not flashy, but it is resilient.

Common Developer Pain Points (and How This Pattern Solves Them)

1) “My grid looks fine, but scores tank on mobile.”

On real devices, the first contentful image in a grid often becomes the LCP element by default. If its source is oversized, compressed poorly, or discovered late — Lighthouse will nose-dive — so use derivative widths, honest sizes, and preload the first visible image when necessary. Track LCP in the field, not just in the lab. Reference: Lighthouse: LCP.

2) “We fixed everything and CLS still spikes on first load.”

Check that every image renders with dimensions on initial HTML and not just after hydration. If a CMS or component strips width/height then the browser cannot reserve space and you will see shift. Also, inspect third-party components above the grid (promo bars, consent banners) as they often cause the shift you blame the gallery for. Reference: web.dev: Cumulative Layout Shift.

3) “Design wants cards to work in three different templates.”

Reach for container queries because the grid decides column count based on the container, not the viewport, so the same component works full-bleed on a landing page and snug in a two-column blog layout. This prevents a cascade of one-off media queries. Spec reference for aspect ratios that underpin the reserved-space model: W3C CSS Sizing.

4) “Engineering time disappears into thumbnail production.”

If it works, automate it — add a build step that crawls a folder, generates derivative widths in avif/webp, writes predictable filenames, and logs any asset that exceeds a byte budget. Keep humans out of repetitive export chores by investing in a systems view. For a broader build-systems view, see file structure for speed and scale.

5) “Accessibility reviews keep flagging our galleries.”

Most issues are basic: non-descriptive alt text, missing captions where context is needed, focus rings that disappear on dark imagery, and cards with nested links — so solve these at the pattern level once and for all. MDN’s guidance on <picture> and responsive images covers authoring details well: MDN picture, MDN responsive images.

QA Checklist (Ship With Confidence)

— No layout shifts: every image has width/height or an explicit aspect-ratio.
— srcset/sizes reflect real rendered widths at each breakpoint.
— Alt text is descriptive; captions appear when context adds meaning.
— Keyboard flow reaches every card; focus ring is visible against imagery; Escape closes lightbox.
— Byte budget enforced; typical tile ≤ 150 KB; mobile LCP ≤ 2.5 s on mid-tier hardware.
— Non-critical images are lazy-loaded; the first visible tile is eager-loaded if it is LCP. See: web.dev on LCP.