Jun 28Monday, June 29, 2026 · all days
1.HackerRank open sourced its ATS. My resume scored 90/100. Oh wait 74. No – 88(danunparsed.com)
518 points by sambellll 10 hours ago | 206 comments | permalink
tl;dr: HackerRank's open-sourced ATS (hiring-agent) produces wildly inconsistent resume scores—the same resume scored anywhere from 66 to 99 across 100 runs, meaning a candidate could fail an 85-point cutoff 65% of the time purely by chance. The author traces this to flawed design: subjective categories like "projects" rely on LLM vibe-checks with no proper rubric, while "experience" gives everyone 25/25 regardless of seniority due to a two-line prompt. With 65% of the score weighted on open source and projects, the tool filters by noise rather than quality.
HN Discussion:
  • LLM stochasticity is poorly understood and confirms why resume screening is broken
  • Inconsistent randomness in hiring is absurd and unfair to candidates
  • Weighting open source/projects heavily disadvantages candidates with other life commitments
  • From a recruiter's perspective, the randomness is actually an acceptable filtering rate given application volume
  • The tool is poorly built (tiny model, vibe-coded) which explains its flaws
2.GLM 5.2 beats Claude in our benchmarks(semgrep.dev)
882 points by jms703 18 hours ago | 407 comments | permalink
tl;dr: Semgrep benchmarked open-weight and frontier models on IDOR vulnerability detection and found Zhipu AI's GLM 5.2 scored 39% F1 with just a bare prompt, beating Claude Code (32%) at roughly $0.17 per bug found. Both were beaten by Semgrep's own multimodal pipeline (53-61% F1), suggesting the harness/scaffolding matters more than the underlying model. The authors caution this is a single task on one dataset, but argue GLM 5.2's performance at ~1/6 the cost of frontier models—plus the ability to run locally—makes open weights newly viable for security teams.
HN Discussion:
  • GLM 5.2 is a genuinely capable, cost-effective workhorse for daily coding tasks
  • Open Chinese models are catching up or surpassing US frontier models, especially in specific domains like cybersecurity
  • ~Other open models like DeepSeek may actually outperform GLM 5.2 across broader benchmarks
  • The article's title and conclusions are misleading; one narrow benchmark doesn't generalize and terminology is sloppy
  • ~Coding-focused evaluation ignores broader concerns like model bias and non-programmer use cases
3.Age verification is just a precursor to automated attribution of speech(nonogra.ph)
551 points by arkhiver 8 hours ago | 313 comments | permalink
tl;dr: Age verification laws, marketed as child protection measures, are actually identity attribution systems that link online accounts to real-world identities like SSNs and government IDs. This solves law enforcement's traditionally labor-intensive problem of identifying anonymous speakers, potentially enabling automated enforcement against inconvenient speech once adoption is widespread. The author urges readers to refuse verification, or if unavoidable, use third-party services paid in Monero.
HN Discussion:
  • Public lacks systems thinking to anticipate second-order effects of age verification laws
  • Age verification is part of broader control infrastructure including device attestation and surveillance
  • Identity layer will inevitably expand beyond original purpose via scope creep
  • ~Solution lies in decentralized alternatives rather than worrying about verification itself
  • Users should preemptively prune social media history before retroactive enforcement arrives
4.Historical memory prices 1960-2026(dam.stanford.edu)
322 points by vga1 17 hours ago | 118 comments | permalink
tl;dr: An interactive dataset tracking historical memory prices ($/GB) from 1960 to 2026 across DRAM, NAND flash, and HBM, extending John C. McCallum's classic dataset with monthly updates from Keepa (Amazon retail) and quarterly HBM estimates from Epoch AI. It breaks out DRAM by generation (SDRAM through DDR5), HBM by generation (HBM2e through projected HBM4), and includes a modeled accelerator cost breakdown for Nvidia, AMD, Google TPU, and AWS Trainium. Raw CSV data is downloadable; caveats note that "cheapest retail" often reflects EOL clearance rather than leading-edge pricing.
HN Discussion:
  • ~Pre-1990 per-GB pricing is unrealistic and not inflation-adjusting distorts the historical picture
  • The data refutes memory manufacturers' claims that RAM/storage are no longer commodities
  • The DRAM data is misleading because recent points reflect EOL DDR3 rather than current generations
  • Curiosity about what drives the visible cyclical price patterns in the data
  • AI and crypto demand are driving price volatility and hurting consumers
5.5k menus from the New York Public Library’s Buttolph Collection (1880-1920)(pudding.cool)
375 points by xbryanx 21 hours ago | 96 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • Sharing tangential historical or cultural trivia related to menus and food traditions
  • Appreciation for the collection's historical insights into food trends and dish categories
  • Recommending related resources or curated companion content to explore the menus
  • Fascination with economic and pricing aspects of historical menus
  • Interest in the typography and printing techniques used in the menus
6.I used Claude Code to get a second opinion on my MRI(antoine.fi)
447 points by engmarketer 19 hours ago | 584 comments | permalink
tl;dr: After an orthopedist diagnosed a Grade III partial-thickness subscapularis tendon tear and began aggressive treatment (including shockwave therapy and a homeopathic injection), the author ran their DICOM MRI files through Claude Code (Opus) for a second opinion, which instead found an intact tendon with only mild tendinosis. A follow-up arbitration run by Claude sided with the "no tear" reading at moderate-to-high confidence. The author acknowledges they can't fully trust either verdict but is now skeptical of the clinic's intervention-heavy plan.
HN Discussion:
  • Radiologists warn that current AI models are unreliable at interpreting MRI images and shouldn't be trusted
  • AI offers comfort and accessibility for second opinions despite trust concerns about both AI and doctors
  • Doctors often have financial incentives pushing unnecessary interventions, validating skepticism of aggressive treatment
  • Shoulder abnormalities are extremely common incidental findings, supporting skepticism of the original tear diagnosis
  • ~Medical diagnosis is inherently non-deterministic and expecting consistent readings misunderstands medicine
7.Show HN: Zanagrams(zanagrams.com)
298 points by pompomsheep 20 hours ago | 73 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • Praise for the game's minimal UI, animations, and overall design quality
  • ~Confusion about the game's name and mechanics initially made it hard to understand
  • ~Suggestions for improvements like timed mode, undo, hints, or showing solutions
  • Criticism of specific design choices like bonus words concept and word list omissions
  • Game is a reimplementation of existing puzzle Ribbit from Puzzmo
8.The KIDS Act would require age checks to get online(eff.org)
504 points by bilsbie 23 hours ago | 403 comments | permalink
tl;dr: Congress is fast-tracking the KIDS Act, a bundle that combines a revised KOSA with bills like SAFE BOTS and SCREEN, imposing liability whenever platforms "knew or should have known" a user is a minor—effectively forcing age verification (via ID or facial scans) for all users despite disclaimers to the contrary. The package also pressures platforms to moderate broad categories of lawful speech (addiction, gambling, fraud discussions) and contains encryption carve-outs with loopholes that could undermine private and ephemeral messaging.
HN Discussion:
  • International coordination suggests special interest lobbying behind global internet lockdown push
  • Citizens should contact representatives to oppose the bill
  • Research doesn't actually support claims that social media harms kids' mental health
  • Mandatory ID disclosure online contradicts longstanding privacy advice
  • The bill's scope may be narrower than claimed, excluding many sites like HN
9.Professor denounces mass AI fraud on an exam at Brown(english.elpais.com)
421 points by geox 19 hours ago | 559 comments | permalink
tl;dr: Brown University economics professor Roberto Serrano caught at least 50 students cheating on a take-home midterm using ChatGPT, after the class averaged 96/100 but dropped to 48/100 on an in-person final—with 22 of 27 no-shows having previously scored perfect 100s. Serrano criticized Brown's administration for responding with silence and is abandoning take-home exams, while urging broader debate on AI-enabled academic fraud. The incident reflects a wider trend: Princeton recently ended its 133-year-old unproctored honor code system in response to AI cheating.
HN Discussion:
  • In-person, handwritten exams are now necessary; AI era may strengthen degree signaling value
  • Universities must redesign courses adversarially with paper exams and 1-on-1 interviews to combat AI cheating
  • Honest students are forced to cheat to stay competitive when peers use AI on curved grading
  • The professor is being a luddite; he should just switch to in-person exams instead of complaining
  • ~Take-home closed-book exams are inherently flawed; AI isn't the real problem here
10.Librepods: AirPods liberated(github.com)
405 points by rbanffy 17 hours ago | 138 comments | permalink
tl;dr: LibrePods reverse-engineers Apple's proprietary AirPods protocol to bring exclusive features—like noise control switching, ear detection, accurate battery status, conversational awareness, and head gestures—to Linux and Android. Spoofing a Vendor ID as Apple's unlocks additional capabilities such as accessibility settings and hearing aid customization, while features like Find My, spatial audio, heart rate monitoring, and high-quality two-way audio are planned but likely require root. The project is GPLv3-licensed and warns that librepods.org is an unofficial site falsely claiming affiliation.
HN Discussion:
  • Clarifies AirPods already work as basic Bluetooth; this adds proprietary extras
  • Enthusiasm for the project and hope for similar liberation of other Apple protocols
  • ~Skepticism that Apple will patch these capabilities, discouraging purchase
  • Reluctance to financially support a company hostile to interoperability
  • ~Wishes specific features (multipoint, dual speaker/headset) were enabled by the project
11.A way to exclude sensitive files issue still open for OpenAI Codex(github.com)
210 points by pikseladam 23 hours ago | 133 comments | permalink
tl;dr: A GitHub issue requests that OpenAI Codex add a `.codexignore` mechanism (both repo-level and global) to explicitly prevent the agent from reading or transmitting sensitive files like `.env`, `.pem`, or SSH keys to the model. The requester notes this was previously raised in issue #205, which was closed in favor of a Rust implementation (codex-rs), but no equivalent feature appears to exist there as of August 2025.
HN Discussion:
  • A .codexignore feature is pointless and provides false security; use OS permissions or containers instead
  • Sensitive files shouldn't be stored in repo folders or plaintext on disk in the first place
  • Sandboxing via containers/bind-mounts is the proper solution, not a blocklist in Codex
  • ~Agent file access should be opt-in rather than opt-out, but handled at a different layer
  • We need better secret-handling infrastructure (proxies, agent-vaults) rather than relying on .env files
12.The curious case of the disappearing Polish S (2015)(aresluna.org)
246 points by colinprince 23 hours ago | 97 comments | permalink
tl;dr: Medium users typing in Polish couldn't enter the letter Ś because the editor blocked Ctrl+S to prevent the browser's save dialog. The bug arose because Polish keyboards use Right Alt+S to type Ś, and Windows internally maps Right Alt to Ctrl+Alt—so the editor's Ctrl+S handler was swallowing the keystroke. The fix was a one-line change: only block Ctrl+S when Alt isn't also pressed.
HN Discussion:
  • Similar Ctrl/Alt shortcut bugs plague many other apps blocking Polish diacritics
  • ~Browsers should expose better APIs for key combinations and the fix is incomplete
  • Medium should autosave instead of relying on Ctrl+S shortcuts entirely
  • Tangential linguistic and cultural observations about Polish language
  • Technical Unicode trivia about Polish letter normalization quirks
13.EU to legislate about Chat Control behind closed doors(patrick-breyer.de)
671 points by NeutralForest 21 hours ago | 386 comments | permalink
tl;dr: Former MEP Patrick Breyer warns of a two-pronged EU push this weekend to revive "Chat Control" mass-scanning of private messages. EP President Metsola is reportedly trying to resurrect the expired Chat Control 1.0 regulation despite Parliament's March rejection, while Monday's trilogue on the permanent CSAR proposal could mandate warrantless scanning, "voluntary" detection as risk mitigation, and age verification that ends anonymous communication. Civil society has relaunched fightchatcontrol.eu to pressure lawmakers.
HN Discussion:
  • Chat Control is ineffective against criminals while creating mass surveillance of ordinary citizens
  • Frustration that rejected proposals keep being resurrected through opaque processes
  • EU bureaucrats are hypocritical, hiding their own communications while surveilling citizens
  • Calls for deeper analysis of who drives these proposals rather than reactive outrage
  • Skepticism that the campaign website conflates reasonable measures with extreme ones
14.Marfa Public Radio Puts You to Sleep(marfapublicradio.org)
404 points by reaperducer 1 day ago | 124 comments | permalink
tl;dr: Marfa Public Radio launched a sleep podcast called "Marfa Public Radio Puts You to Sleep" for its fall membership drive, in which staff read aloud the dull operational documents (FCC compliance, NPR ethics codes, etc.) that keep the 24/7 station running. The goal is both to lull listeners to sleep and to encourage donations at marfapublicradio.org/donate.
HN Discussion:
  • Sharing similar sleep-inducing podcasts and media recommendations like fictional baseball broadcasts and boring book readings
  • Praising Marfa as a town and endorsing the public radio station's appeal
  • Personal sleep tricks and techniques that work better than boring audio content
  • The chosen subject matter (NPR ethics) is actually interesting and wouldn't induce sleep
  • Playful endorsement mimicking falling asleep while engaging with the concept
15.Michigan bill would bar employers from requiring after-hours coms with workers(cbsnews.com)
259 points by cebert 21 hours ago | 215 comments | permalink
tl;dr: Michigan Senate Bill 948, introduced by Sen. Erika Geiss, would prohibit employers from requiring workers to respond to emails, texts, or messages outside of their scheduled hours, with exceptions for contracted on-call pay, employee-set availability windows, and state/federal emergencies. Violations could be reported to the Department of Labor and Economic Opportunity, potentially resulting in fines or overtime pay. The bill has been referred to the Labor Committee.
HN Discussion:
  • Critics in thread show privilege by ignoring that many workers suffer after-hours demands this bill would address
  • Compensation and market dynamics, not legislation, should resolve after-hours work expectations
  • Bill seems reasonably scoped and would help vulnerable workers like restaurant staff without overreach
  • Legislation will make states less attractive to employers and reduce flexibility for willing workers
  • ~Personal experience shows after-hours contact isn't a problem, questioning the need for the bill
16.OpenRA(openra.net)
791 points by tosh 1 day ago | 158 comments | permalink
tl;dr: OpenRA's new playtest-20260222 introduces random map generators for Red Alert, Tiberian Dawn, and Dune 2000, usable in skirmish and multiplayer. Dune 2000 gets new visual effects, Starport bulk purchasing, and a community-led balance overhaul, while the standalone Tiberian Dawn HD mod is now feature-complete with toggleable remastered/classic assets. Other additions include map editor improvements, expansion-building bots, auto-save, new missions, and groundwork for localization.
HN Discussion:
  • OpenRA improves on the original with better balance and features
  • Praise and gratitude for OpenRA developers and the active player base
  • Nostalgic memories of playing the original Red Alert games
  • Open-source engine remakes of classic games are valuable broader trend
  • Publishers should open source older games like EA did with C&C
17.Suspicious Discontinuities (2020)(danluu.com)
273 points by tosh 1 day ago | 100 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • Sharing additional real-world examples of discontinuities reinforcing the article's thesis
  • Personal anecdotes confirming the threshold-gaming behavior described
  • Proposing fixes like eliminating means-testing or phase-outs entirely
  • ~Offering benign explanations (e.g., pacers) for some observed discontinuities
  • Appreciating the visualizations and statistical patterns shown
18.Anonymous GitHub account mass-dropping undisclosed 0-days(github.com)
930 points by binyu 1 day ago | 374 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • Reported vulnerabilities are unimpressive, just simple bugs not real 0-days
  • Term '0-day' is being misused; many are already-disclosed or trivial issues
  • AI-generated security reports tend to be bloated with false positives and non-issues
  • This mass-dropping reflects a transitional hype era for AI-based vulnerability discovery
  • Crowdsourcing LLM-found bugs without vetting is irresponsible and burdens maintainers
19.DSpark: Speculative decoding accelerates LLM inference [pdf](github.com)
789 points by aurenvale 2 days ago | 356 comments | permalink
tl;dr: Summary not available
HN Discussion:
  • DeepSeek is praised for genuine innovation and open publishing compared to American labs
  • Users share positive real-world experience with DeepSeek models and integrations
  • This technique likely explains DeepSeek's ability to offer dramatically lower prices
  • Questions whether this is meaningfully novel versus 2022 speculative decoding work
  • Speculates about future proliferation of specialized small models for speculative decoding
20.AI learns the “dark art” of RFIC design(spectrum.ieee.org)
264 points by Brajeshwar 4 days ago | 173 comments | permalink
tl;dr: Princeton researchers are using reinforcement learning, inverse design, and diffusion models to automate RFIC design—a notoriously artisanal field where chips for 5G, radar, and satellite comms have traditionally been hand-crafted over years. Their AI-generated power amplifiers, which often look like QR codes rather than symmetric layouts, have achieved record bandwidth and efficiency while cutting design time from months to minutes. The main bottleneck now is training data, most of which sits locked behind corporate NDAs, prompting calls for open chip-design datasets akin to ImageNet.
HN Discussion:
  • ~This phenomenon of algorithm-designed uninterpretable circuits is decades old, not novel
  • ~The article overstates novelty; this is similar to genetic algorithms and brute force search
  • Questions robustness of AI-generated designs and whether conventional subblocks carry the weight
  • Conflating LLMs with traditional ML techniques muddies the discussion unfairly
  • Philosophical musing on whether nature's truths may be ugly messes only machines can grasp