GPT‑NL: a sovereign language model for the Netherlands(tno.nl)
244 points by root-parent 22 hours ago | 277 comments
tl;dr: The Netherlands is building GPT-NL, a sovereign Dutch language model trained from scratch to avoid copyright, privacy, and data provenance issues inherited from existing models, with source code released as open source and weights under a controlled license. The project, backed by €13.5 million in public funding from the Ministry of Economic Affairs, includes a Content Board giving data providers a say and revenue share, emphasizing transparency, lawful data sourcing, and energy efficiency.
HN Discussion:
  • Building sovereign models from scratch wastes money; better to fine-tune existing open baselines
  • Europe needs its own sovereign, open-source model trained on local languages and renewable energy
  • Supporting national/European AI independence from US and China dominance is valuable
  • Countries should focus on controlling compute infrastructure rather than building their own models
  • Skepticism that €13.5M is sufficient to build a competitive model with fair compensation