5 comments

  • arnitdo 1 hour ago
    The whole website is genuinely unreadable / illegible with poor contrast.
    • zarzavat 49 minutes ago
      This is the third time I've seen a website with this styling (serif, yellow and white on black). What's going on? Is it a template or some AI induced convergence?
      • jbaiter 7 minutes ago
        I think it's Claude, yeah.
      • Zababa 33 minutes ago
        Default output of claude code. Another obvious example is https://trumprx.gov/, with the background beige that's kinda close to the Hacker News one (to my eyes at least)
    • youngtaff 20 minutes ago
      For what ever reason reader mode appears to show different content
      • Jolter 0 minutes ago
        It does for me too, in Safari on iPhone.
    • DesaiAshu 52 minutes ago
      Looks like a font rendering or retina issue
    • yashasolutions 59 minutes ago
      yes, they should increase the font size as a first step.
      • xxs 31 minutes ago
        I can do "ctrl + +" to increase the font, but it's still serif and low contrast, so I have to do "ctrl + A". Or better yet - press "reader view" on firefox.
  • adev_ 12 minutes ago
    > For the past few decades, building a datacenter has been a well-understood, predictable exercise in utility engineering.

    > In modern AI clusters, the network is no longer just infrastructure sitting beneath compute

    It always make me smile when someone is presenting these kind of topologies as "New", "Modern A.I" or anything remotely "Revolutionary".

    The HPC domain and any super computers have been doing RDMA networking centered around "all-to-all" and "all-reduce" operations for at least 3 fucking decades now.

    They are the main reasons supercomputer are almost always constructed around stupidly complex Torus or Dragonfly network topologies

    The only difference now is that it switch from "This niche thing 3 nerds were using for weather simulations" to "this cool thing any hyperscaler NEED to have for A.I"

  • ElFitz 12 minutes ago
    I never considered the implications and impacts on datacenters' architecture and organisation. It’s fascinating.
  • RetroTechie 3 minutes ago
    Datacenters are being built for AI. What happens when you remove the AI workload?

    Don't get me wrong. I don't mind when some tech bros burn billions of venture capital & nothing much (?) comes out of it.

    But those datacenters embody a lot of resources. Raw materials, complex/resource heavy manufacturing processes for IC's, servers, networking gear, etc etc.

    I sure hope that doesn't go to waste when the AI bubble pops. Datacenter stuffed with AI optimized hardware any good for general engineering? Science projects? Weather prediction? Web hosting? ...??

  • AlassaneSakande 2 days ago
    We're two data center networking engineers who've spent years designing and operating data center infrastructure for governments, telcos and banks in West Africa. This piece came out of our work on a new AI architecture based on associative memory rather than transformers. The GPU-free argument here is something we think about the next phase of AI networking. Happy to discuss further about it.

    PS: Taking a look at our manifesto (https://almartis.xyz/) can help with more context.

    • throw23232 41 minutes ago
      Very interesting of course, but stuff like this just needs a demo not a book. Can be super simple, but it needs to be demonstrated somehow.
    • Animats 1 hour ago
      Yes, read that. What these people are talking about seems to replacing training of NNs by something else entirely. The big question is, does that work? At all?

      It's premature to discuss network architecture until that basic question is answered.

      • fc417fc802 33 minutes ago
        I'm maybe 10% of the way in but I find I'm increasingly skeptical. If the basic building block dates back to the 1970s haven't other people tried this by now? If not, isn't the first order of business to throw together a prototype that solves MINST or one of the many other small datasets floating around out there as a proof of concept?

        So unfortunately I'm inclined to assume this is empty conjecture shat out by an LLM. Because who would write something up in this much detail rather than typing `import numpy as ...` and going to town?

        I'll also note that the document has all the usual crank signs. Lots of grand visions, hypotheses, and expounding at an overly high level on how various things work with hardly anything concrete.