Jul 3Saturday, July 4, 2026 · all days
1.The bottleneck might be the air in the room(blog.mikebowler.ca)
411 points by gslin 6 hours ago | 244 comments | permalink
tl;dr: CO2 levels in closed meeting rooms routinely climb past 1,000-2,000 ppm within an hour, and research from Lawrence Berkeley and Harvard shows measurable declines in decision-making and strategic thinking at those concentrations. The impairment is invisible from inside—people just feel foggy and blame the meeting length. Before blaming teams for poor performance or disengagement, check the air: a cheap CO2 monitor and an open window may fix what looks like a people problem.
HN Discussion:
  • Awareness via integrated CO2 monitors in devices would solve the problem
  • Skepticism about the actual science behind CO2 productivity claims
  • Counter-evidence from submarines shows no ill effects at high CO2 levels
  • Sharing practical hardware recommendations for measuring indoor CO2
  • ~Correlation concern: CO2 may proxy for other pollutants, not be causal itself
2.Performance per dollar is getting faster and cheaper(wafer.ai)
269 points by latchkey 14 hours ago | 100 comments | permalink
tl;dr: Wafer benchmarked GLM-5.2 on AMD's MI355X (roughly 2.75x cheaper than NVIDIA's B300) and hit 213 tok/s single-stream and 2626 tok/s/node aggregate throughput—about 80% of B200 performance at less than half the cost. Getting there required MXFP4 quantization via AMD Quark, switching to sglang, and fixing two ROCm bugs blocking speculative decode plus tuning the MoE kernel selection, but notably no custom kernels. The takeaway: NVIDIA's CUDA moat is increasingly about day-0 model support rather than fundamental software superiority.
HN Discussion:
  • FP4 quantization degrades model quality, undermining the performance claims
  • Headlines should disclose quantization since benchmarks aren't comparing full-fat models
  • Requesting performance-per-watt metrics to better evaluate AMD's competitiveness
  • The aggregate throughput number is misleading versus real single-stream throughput
  • ~NVIDIA's next-gen Rubin will leapfrog Blackwell on inference, limiting AMD's window
3.Leanstral 1.5: Proof abundance for all(mistral.ai)
270 points by programLyrique 14 hours ago | 79 comments | permalink
tl;dr: Leanstral 1.5 is an Apache-2.0 licensed Lean 4 proof model (119B total/6B active params) that saturates miniF2F, solves 587/672 PutnamBench problems at ~$4 each (vs. ~$300 for Seed-Prover), and sets new SOTA on FATE-H/X. Trained via mid-training, SFT, and RL with CISPO across multiturn and code-agent environments, it also demonstrates real-world utility by finding 5 previously unknown bugs across 57 repositories and proving O(log n) complexity for AVL trees. Weights and a free API are available on Hugging Face.
HN Discussion:
  • Mistral's niche of high-quality specialized small models is valuable despite not competing with frontier models
  • The highlighted bug-finding example is unconvincing since testing/fuzzing would easily catch that boundary case
  • Benchmark comparisons use outdated competitor models, making the SOTA claims less impressive
  • Questions about practical accessibility for non-Lean users and choice of Lean 4 over Isabelle/TLA+
  • Enthusiasm for Lean and the release, with interest in integrating it into existing tools
4.Giant trees have no trouble pumping water to top branches: new research(news.exeter.ac.uk)
217 points by hhs 14 hours ago | 102 comments | permalink
tl;dr: New research on Dipterocarp trees in Malaysian Borneo (ranging 7-71 meters tall) contradicts the long-held theory that height impairs water transport and increases drought vulnerability. The tallest trees compensate through wider water-carrying vessels near the ground and leaves adapted to withstand greater water stress, showing no height-related growth loss during the 2023-2024 El Niño drought. The findings challenge current climate models that predict tall trees face higher drought mortality risk—significant because the tallest 1% of trees store over half of forests' above-ground carbon.
HN Discussion:
  • Plants are adaptable and this finding is unsurprising given their malleability
  • Findings contradict prior research and don't explain why no trees exceed 130m
  • ~Tall trees rely on other mechanisms like fog absorption, not just root pumping
  • Alternative theories like structured water may explain water transport in tall trees
  • The conclusion is obvious and merely restates what should already be known
5.SearXNG: A free internet metasearch engine(github.com)
231 points by theanonymousone 16 hours ago | 63 comments | permalink
tl;dr: Summary not available.
HN Discussion:
  • Long-time satisfied users endorse SearXNG as their daily search engine despite minor tradeoffs
  • ~Original creator has moved on due to metasearch concept limitations, promoting an alternative project
  • ~Reliability issues with scraping backends like Google and DuckDuckGo cause rate limiting and captchas
  • SearXNG is valuable as a search tool for local AI models and coding agents via MCP integrations
  • Appreciated as a privacy-respecting way to degoogle and avoid unwanted AI features
6.Jamesob's guide to running SOTA LLMs locally(github.com)
362 points by livestyle 21 hours ago | 163 comments | permalink
tl;dr: A detailed hardware/software guide for running SOTA LLMs locally, ranging from a $2k dual-RTX-3090 setup (running Qwen3.6-27B and Whisper STT) to a $40k rig with 4× RTX PRO 6000s (384GB VRAM) running GLM-5.2-594B at near-Opus quality. The build uses a last-gen EPYC/DDR4 base with c-payne PCIe Gen4 switches for GPU peer-to-peer communication, and covers finicky details like BIOS bifurcation, ACS disabling, IOMMU quirks, and power-limiting to run $46k of GPUs on a 110V circuit.
HN Discussion:
  • Total cost is understated; real build closer to $50-55K with quantization caveats
  • Local LLMs are wildly uneconomical compared to cloud subscriptions like Claude/Codex
  • The 'almost-Opus' quality claim is misleading given aggressive pruning and quantization
  • ~Cheaper alternatives like M5 MacBooks or single GPUs offer better value for most users
  • ~Mid-range unified-memory options (96-128GB) are a better compromise than the extremes presented
7.Costco is the anti-Amazon(phenomenalworld.org)
432 points by bookofjoe 21 hours ago | 399 comments | permalink
tl;dr: Costco's business model—limited SKUs (~4,000 vs. Walmart's 130,000), in-person bulk shopping, and minimal distribution infrastructure—is more logistically efficient and socially beneficial than Amazon's infinite-assortment, home-delivery approach, resulting in lower overhead, higher wages, and just 6% employee turnover. The author argues this constraint-based model is more generalizable than Amazon's, which relies on complex last-mile delivery that isn't universally scalable. The piece concludes that NYC Mayor Mamdani's proposed public grocery stores should adopt Costco's playbook: low SKU count, high volume, centralized distribution at sufficient scale, and a signature loss-leader item.
HN Discussion:
  • Costco's approach of avoiding the last-mile problem exemplifies wise engineering by sidestepping complexity
  • Costco represents American excellence and is admirable as a national achievement
  • The article overlooks that Costco offers delivery via Instacart, undermining the strict in-person contrast
  • Costco caters to affluent suburbanites with big vehicles and storage, limiting its generalizability
  • ~Costco isn't really competing with Amazon since they sell different categories, weakening the comparison
8.Factories are just rooms(interconnected.org)
255 points by arbesman 21 hours ago | 107 comments | permalink
tl;dr: The author visited his kid's school to talk about manufacturing his AI clock, sharing photos from a Shenzhen factory, prototypes, and CAD sketches to demystify how physical products get made. He deliberately avoided the typical "awe-inspiring" factory footage because it makes manufacturing feel distant and untouchable—instead he wanted 7-year-olds to see that "factories are just rooms" and that they could become the designers, engineers, and makers behind everyday objects. He encourages others to do the same at their local schools.
HN Discussion:
  • Demystifying making preserves the curiosity and 'you can do that' mindset in kids
  • Personal experience in manufacturing/small factories confirms the accessibility and joy of making things
  • The 'factory is just a room' framing oversimplifies large industrial complexes and complex plants
  • Everyday analogs like kitchens are factories too, reinforcing accessibility of manufacturing
  • ~Teaching kids the leap from making one to making many is the key valuable lesson
9.Hunting a 16-year-old SQLite WAL bug with TLA+(ubuntu.com)
220 points by peterparker204 4 days ago | 26 comments | permalink
tl;dr: Canonical's dqlite team used TLA+ to model a 16-year-old SQLite WAL checkpoint bug that could corrupt databases via a data race: a second checkpoint could miss that the WAL was reset by a concurrent writer, causing it to incorrectly mark frames as backfilled and later skip transactions. The model quickly reproduced the bug in ~20 states, matching SQLite's own description. Modeling dqlite showed it's unaffected since it acquires the write lock during checkpoints, preventing the race. SQLite's fix adds a salt comparison to detect WAL resets before proceeding.
HN Discussion:
  • Background context on TLA+ and its creator Leslie Lamport
  • Author is available to answer questions about the post
  • Title is misleading since article is really about proving dqlite safe, not hunting the SQLite bug
  • Interest in porting TLA+ concepts to Lean with tactics
  • Speculation about using TLA+ models as verification for LLM-generated code
10.Espionage Against the European Parliament(citizenlab.ca)
394 points by ledoge 16 hours ago | 98 comments | permalink
tl;dr: Citizen Lab found that former MEP Stelios Kouloglou was infected with NSO Group's Pegasus spyware at least twice (October 2022 and March 2023) while serving on the European Parliament's PEGA Committee investigating spyware abuses, likely exposing confidential committee deliberations. Researchers did not attribute the attack to a specific government but noted infrastructure overlap with a prior Pegasus campaign targeting Russian and Belarusian exiled journalists in Europe, suggesting an operator licensed across multiple EU jurisdictions. This marks the first known case of a sitting PEGA Committee member being hacked during the committee's work.
HN Discussion:
  • Apple's delayed threat notifications amount to security theatre rather than real protection
  • ~The attack likely came from an EU member state (Greece) rather than being an attack on the EU parliament itself
  • Smartphone architecture is fundamentally insecure and shouldn't be trusted for sensitive work
  • EU parliament should have policy separating work and personal devices
  • EU member states have a pattern of abusing Pegasus with no consequences, making this unsurprising
11.Wordgard: In-browser rich-text editor from the creator of ProseMirror(wordgard.net)
303 points by indy 1 day ago | 100 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • ~Users want clarity on why Wordgard exists and note lack of upgrade path from ProseMirror
  • Website's artwork and design are beautiful and stand out
  • Editor fails basic mobile input tests like iPhone autocorrect and swipe backspace
  • Excitement and validation seeing an evolution of ProseMirror concepts
  • ~Wishes for built-in features like typed JSON schemas and @-mentions that ProseMirror lacked
12.Half-Baked Product(weli.dev)
1287 points by weli 1 day ago | 386 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • Founders motivated by wealth rather than domain expertise cause startup failures
  • Satirical parody reinforcing the article's critique of half-baked startups
  • The fundamental problem is disconnect between specialists who ignore other domains
  • ~The story is timeless and could benefit from the salesperson's perspective
  • Personal recognition of the described dysfunctions from own career experience
13.Valve open-source the Steam Machine e-ink screen so you can make your own(gamingonlinux.com)
566 points by ahlCVA 23 hours ago | 106 comments | permalink
tl;dr: Valve has open-sourced the design files for the Steam Machine's e-ink front display (dubbed "Inkterface") on their GitLab under an MIT license, since they won't be manufacturing it themselves. Builders will need parts like an Adafruit ESP32 Feather, an eInk breakout board, a 5.83" monochrome panel, and some screws and magnets to assemble one. Third parties like JSAUX have already indicated plans to sell pre-built versions.
HN Discussion:
  • Praise for Valve's open-source approach and hope more companies follow suit
  • Sharing technical details or related DIY projects inspired by the release
  • Interest in adapting the concept to other hardware like Framework Desktop
  • Questioning the practical usefulness of an e-ink faceplate versus an active display
  • Curiosity about Valve's business model and the economics of their openness
14.60% Fable cost cut by converting code to images and having the model OCR it(github.com)
283 points by dimitropoulos 20 hours ago | 92 comments | permalink
tl;dr: pxpipe is a local proxy that rewrites bulky Claude Code request context (system prompts, tool docs, older history) into PNG images, exploiting the fact that dense text packs ~3.1 chars per image-token vs ~1 char per text-token, cutting end-to-end costs ~59-70% on Fable 5. The tradeoff is lossy verbatim recall — exact strings like hashes and IDs can silently confabulate, so it's gated to token-dense content and specific models (Fable 5, GPT 5.6), with recent turns and byte-exact data kept as text. SWE-bench results show near-parity with 60-65% token reduction.
HN Discussion:
  • ~This is a pricing loophole that providers will likely close once they optimize token accounting
  • Tried this approach before and found it increased completion tokens making it more expensive overall
  • This is wasteful pricing arbitrage that promotes digital junk and exploits a pricing failure
  • Using images for agent context is genuinely effective and leads to better workflows
  • Related prior work and techniques exist (DeepSeek paper, other tools) validating the approach
15.Protect your right to run local AI(righttointelligence.org)
522 points by thoughtpeddler 1 day ago | 185 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • Restrictions on local AI are unlikely given massive OEM industry investment in local LLM hardware
  • Agrees local AI is important to preserve against fragile, invasive cloud-dependent alternatives
  • ~Real threat is hardware access via Nvidia prioritizing datacenters, not software laws
  • Article makes claims about restrictive laws without providing evidence or specifics
  • Regulation will happen sneakily via 'certification' requirements rather than outright bans, validating concerns
16.Holes(xkcd.com)
224 points by caminanteblanco 18 hours ago | 39 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • Sharing supplementary links and references related to holes mentioned in the article
  • Highlighting fascinating facts about Lake Baikal's depth and sediment
  • Expressing awe or terror at specific holes like Mponeng Gold Mine and crystal caves
  • ~Pointing out a misspelling (Derinkuyu) despite the article's evident effort
  • Asking clarifying questions about article content (the 'oops' markers)
17.Android Developer Verification: Threat masquerading as protection(f-droid.org)
1693 points by drewfax 2 days ago | 727 comments | permalink
tl;dr: F-Droid argues that Google's upcoming Android Developer Verification (ADV) program—rolling out September 30 in Brazil, Indonesia, Singapore, and Thailand—is effectively a pre-installed gatekeeping mechanism that will block apps from developers not centrally registered with Google, requiring ID verification and agreement to vague terms where "malware" is undefined. They warn this threatens sideloading, alternative app stores like F-Droid, and could be weaponized against legitimate software like ad-blockers. The fate of already-installed F-Droid apps and user data after activation remains unclear.
HN Discussion:
  • Users should switch to alternative Linux-based mobile OSes to escape Google's control
  • Article's tone is childish and undermines F-Droid's credibility with unserious rhetoric
  • Device ownership means users should decide what to install, not Google
  • Anonymous software installation is essential for freedom and resisting authoritarianism
  • Regulators like the EU should intervene against this monopolistic overreach
18.CarPlay Is Additive(caseyliss.com)
561 points by sprawl_ 1 day ago | 698 comments | permalink
tl;dr: Rivian's CSO defended the company's refusal to support CarPlay by claiming it "takes over every single pixel," but the author counters that standard CarPlay doesn't take over the whole screen (only optional CarPlay Ultra does), and that CarPlay is entirely optional for drivers anyway. The author argues CarPlay is purely additive—if Rivian's native UI is great, customers won't use it—and says he won't buy a Rivian until they add support, despite otherwise loving the vehicles.
HN Discussion:
  • CarPlay provides valuable cross-vehicle consistency and personalization that native systems can't match
  • CarPlay is a must-have feature and dealbreaker when purchasing a new vehicle
  • Rivian's native infotainment is good enough that CarPlay isn't missed in practice
  • CarPlay is actually inferior to well-designed native systems like Tesla's for key tasks
  • Modern infotainment systems broadly are becoming dangerous attention-grabs, regardless of CarPlay
19.Postgres transactions are a distributed systems superpower(dbos.dev)
217 points by KraftyOne 1 day ago | 92 comments | permalink
tl;dr: Co-locating workflow state and application data in the same Postgres database lets you commit both in a single transaction, eliminating the idempotency and atomicity problems that plague durable workflow engines. This means transactional steps get exactly-once semantics without bookkeeping tables, and you can replace the transactional outbox pattern by enqueuing workflows via a UDF in the same transaction as your app updates. The post is from DBOS, which builds a Postgres-backed durable execution framework around this idea.
HN Discussion:
  • Personal validation from experience that DB+queue atomicity is impossible without this approach
  • Confirmation from practitioners who built similar in-database queue/pubsub solutions successfully
  • Skepticism that this qualifies as a distributed system rather than centralized services
  • ~Confusion about the UDF/outbox replacement claim since side effects still require idempotency
  • Criticism that centralizing on Postgres creates a single point of failure
20.PeerTube is a free, decentralized and federated video platform(github.com)
663 points by doener 2 days ago | 346 comments | permalink
tl;dr: PeerTube is an open-source, federated video platform from Framasoft that uses ActivityPub to interoperate with the Fediverse (Mastodon, Pleroma, etc.) and WebRTC for peer-to-peer streaming to reduce bandwidth costs. It supports both video-on-demand and live streaming, lets instances cache each other's content for redundancy, and offers creator support via donation links rather than ads or recommendation algorithms. Anyone can self-host an instance, and users can follow channels across instances without needing accounts on each.
HN Discussion:
  • Lack of monetization makes PeerTube impractical for professional content creators
  • Content and audience are lacking, limiting real-world usefulness of the platform
  • Successfully using PeerTube for open source project tutorials shows practical viability
  • ~PeerTube only solves distribution, not discovery, monetization, or hosting problems
  • Poor UX and browsing experience hurts adoption of federated platforms like PeerTube