AI

After an outcry, OpenAI swiftly rereleased 4o to paid users. But experts say it should not have removed the model so suddenly.

OpenAI’s decision to replace 4o with the more straightforward GPT-5 follows a steady drumbeat of news about the potentially harmful effects of extensive chatbot use. Reports of incidents in which ChatGPT sparked psychosis in users have been everywhere for the past few months, and in a blog post last week, OpenAI acknowledged 4o’s failure to…

AI

‘Cheapfake’ AI Celeb Videos Are Rage-Baiting People on YouTube

“They’re tweaking my voice or whatever they’re doing, tweaking their own voice to make it sound like me, and people are commenting on it like it is me and it ain’t me,” Washington recently told WIRED, when asked about AI. “I don’t have an Instagram account. I don’t have TikTok. I don’t have any of…

AI

GPT-5 Doesn’t Dislike You—It Might Just Need a Benchmark for Emotional Intelligence

Since the all-new ChatGPT launched on Thursday, some users have mourned the disappearance of a peppy and encouraging personality in favor of a colder, more businesslike one (a move seemingly designed to reduce unhealthy user behavior.) The backlash shows the challenge of building artificial intelligence systems that exhibit anything like real emotional intelligence. Researchers at…

AI

OpenAI Designed GPT-5 to Be Safer. It Still Outputs Gay Slurs

OpenAI is trying to make its chatbot less annoying with the release of GPT-5. And I’m not talking about adjustments to its synthetic personality that many users have complained about. Before GPT-5, if the AI tool determined it couldn’t answer your prompt because the request violated OpenAI’s content guidelines, it would hit you with a…

Software

Cache Me If You Can: Design Patterns for Performance

psitbdUser2 months ago04 mins

In part 3 of our System Design series, we’re tackling caching and load balancing — the unsung heroes of performance. Without them, systems crumble under scale.

We’ll cover:

Caching – App/DB/CDN; write-through/write-back, TTLs
Cache Invalidation – TTLs, versioning, stampede protection
Load Balancing – L4/L7, round-robin, least-connections, hashing

1. Caching

TL;DR: Caching is your first lever for scale. Use it everywhere, but know the trade-offs.

App cache: In-memory (Redis, Memcached). Ultra-fast but volatile.
DB cache: Query or object cache to offload hot queries.
CDN cache: Push static assets near users.

Strategies:

Write-through: Write to cache + DB simultaneously (safe, consistent, slower writes)
Write-back: Write to cache first, sync to DB later (fast, risky if cache crashes)
TTL (Time To Live): Expire stale data automatically

👉 Example: A news homepage caches top stories for 30s — thousands of requests saved.

👉 Interview tie-in: “How would you scale a read-heavy service?” — caching is the first answer.

2. Cache Invalidation

TL;DR: The hardest part of caching isn’t caching — it’s invalidation.

TTL: Safe default, but may serve stale data.
Versioning: Change cache key when data updates (e.g., user:v2:123)
Stampede protection: Use locking or request coalescing so multiple clients don’t hammer the DB when cache expires.

👉 Example: If 1M users refresh when a cache expires, that’s a cache stampede. Use jittered TTLs or async refresh.

👉 Interview tie-in: They’ll ask “What’s the hardest part about caching?” — answer: invalidation and consistency.