Toro: Deploy Applications as Unikernels

(github.com)

84 points | by ignoramous 2 hours ago

11 comments

m132 1 hour ago
Projects like this and Docker make me seriously wonder where software engineering is going. Don't get me wrong, I don't mean to criticize Docker or Toro in parcicular. It's the increasing dependency on such approaches that bothers me.
Docker was conceived to solve the problem of things "working on my machine", and not anywhere else. This was generally caused by the differences in the configuration and versions of dependencies. Its approach was simple: bundle both of these together with the application in unified images, and deploy these images as atomic units.
Somewhere along the lines however, the problem has mutated into "works on my container host". How is that possible? Turns out that with larger modular applications, the configuration and dependencies naturally demand separation. This results in them moving up a layer, in this case creating a network of inter-dependent containers that you now have to put together for the whole thing to start... and we're back to square one, with way more bloat in between.
Now hardware virtualization. I like how AArch64 generalizes this: there are 4 levels of privilege baked into the architecture. Each has control over the lower and can call up the one immediately above to request a service. Simple. Let's narrow our focus to the lowest three: EL0 (classically the user space), EL1 (the kernel), EL2 (the hypervisor). EL0, in most operating systems, isn't capable of doing much on its own; its sole purpose is to do raw computation and request I/O from EL1. EL1, on the other hand, has the powers to directly talk to the hardware.
Everyone is happy, until the complexity of EL1 grows out of control and becomes a huge attack surface, difficult to secure and easy to exploit from EL0. Not good. The naive solution? Go a level above, and create a layer that will constrain EL1, or actually, run multiple, per-application EL1s, and punch some holes through for them to still be able to do the job—create a hypervisor. But then, as those vaguely defined "holes", also called system calls and hyper calls, grow, won't so the attack surface?
Or in other words, with the user space shifting to EL1, will our hypervisor become the operating system, just like docker-compose became a dynamic linker?
[-]
- eyberg 47 minutes ago
  Containers got popular at at time when there were an increasingly number of people that were finding it hard to install software on their system locally - especially if you were, for instance, having to juggle multiple versions of ruby or multiple versions of python and those linked to various major versions of c libraries.
  Unfortunately containers have always had an absolutely horrendous security story and they degrade performance by quite a lot.
  The hypervisor is not going away anytime soon - it is what the entire public cloud is built on.
  While you are correct that containers do add more layers - unikernels go the opposite direction and actively remove those layers. Also, imo the "attack surface" is by far the smallest security benefit - other architectural concepts such as the complete lack of an interactive userland is far more beneficial when you consider what an attacker actually wants to do after landing on your box. (eg: run their software)
  When you deploy to AWS you have two layers of linux - one that AWS runs and one that you run - but you don't really need that second layer and you can have much faster/safer software without it.
  [-]
  - m132 5 minutes ago
    I can understand the public cloud argument; if the cloud provider insists on you delivering an entire operating system to run your workloads, a unikernel indeed slashes the amount of layers you have to care about.
    Suppose you control the entire stack though, from the bare metal up. (Correct me if I'm wrong, but) Toro doesn't seem to run on real hardware, you have to run it atop QEMU or Firecracker. In that case, what difference does it make if your application makes I/O requests through paravirtualized interfaces of the hypervisor or talks directly to the host via system calls? Both ultimately lead to the host OS servicing the request. There isn't any notable difference between the kernel/hypervisor and the user/kernel boundary in modern processors either; most of the time, privilege escalations come from errors in the software running in the privileged modes of the processor.
    Technically, in the former case, besides exploiting the application, a hypothetical attacker will also have to exploit a flaw in QEMU to start processes or gain further privileges on the host, but that's just due to a layer of indirection. You can accomplish this without resorting to hardware virtualization. Once in QEMU, the entire assortment of your host's system calls and services is exposed, just as if you ran your code as a regular user space process.
    This is the level you want to block exec() and other functionality your application doesn't need at, so that neither QEMU nor your code ran directly can perform anything out of their scope. Adding a layer of indirection while still leaving user/kernel, or unikernel/hypervisor junction points unsupervised will only stop unmotivated attackers looking for low-hanging fruit.
  - pjmlp 34 minutes ago
    Linux containers you mean.
    The story is quite different in HP-UX, Aix, Solaris, BSD, Windows, IBM i, z/OS,...
    [-]
    - ripdog 5 minutes ago
      Windows has containers?
  - pixl97 19 minutes ago
    >what an attacker actually wants to do after landing on your box.
    Aren't there ways of overwriting the existing kernel memory/extending it to contain an a new application if an attacker is able to attack the running unikernel?
    What protections are provided by the unikernel to prevent this?
    [-]
    - eyberg 0 minutes ago
      To be clear there are still numerous attacks one might lob at you. For instance you if you are running a node app and the attacker uploads a new js file that they can have the interpreter execute that's still an issue. However, you won't be able to start running random programs like curling down some cryptominer or something - it'd all need to be contained within that code.
      What becomes harder is if you have a binary that forces you to rewrite the program in memory as you suggest. That's where classic page protections come into play such as not writing to rodata, not writing to txt, not exec'ing heap/stack, etc. Just to note that not all unikernel projects have this and even if they do it might be trivial to turn them off. The kernel I'm involved with (Nanos) has other features such as 'exec protection' which prevents that app from exec-mapping anything not already explicitly mapped exec.
      Running arbitrary programs, which is what a lot of exploit payloads try to achieve, is pretty different than having to stuff whatever they want to run inside the payload itself. For example if you look at most malware it's not just one program that gets ran - it's like 30. Droppers exist solely to load third party programs on compromised systems.
    - wmf 11 minutes ago
      If the stack and heap are non-executable and page tables can't be modified then it's hard to inject code. Whether unikernels actually apply this hardening is another matter.
- kitd 50 minutes ago
  This results in them moving up a layer, in this case creating a network of inter-dependent containers that you now have to put together for the whole thing to start... and we're back to square one, with way more bloat in between.
  I think you're over-egging the pudding. In reality, you're unlikely to use more than 2 types of container host (local dev and normal deployment maybe), so I think we've moved way beyond square 1. Config is normally very similar, just expressed differently, and being able to encapsulate dependencies removes a ton of headaches.
mustache_kimono 1 hour ago
Bryan Cantrill, "Unikernels are unfit for production". [0]
[0]: https://www.tritondatacenter.com/blog/unikernels-are-unfit-f...
[-]
- wmf 1 hour ago
  Toro provides a GDB stub so there has been a little progress since that time.
- vigilans 1 hour ago
  There's a man who hasn't tried running qubes-mirage-firewall.
  Unikernels don't work for him; there are many of us who are very thankful for them.
  [-]
  - mustache_kimono 56 minutes ago
    > there are many of us who are very thankful for them.
    Why? Can you explain, in light of the article, and for those of us who may not be familiar with qubes-mirage-firewall, why?
- cmrdporcupine 27 minutes ago
  Cantrill is far smarter and accomplished than me, but this article feels a bit strawman and hit and run?
  I think unikernels potentially have their place, but as he points, they remain mostly untried, so that's fair. We should analyze why that is.
  On performance: I think the kernel makes many general assumptions that some specialized domains may want to short circuit entirely. In particular I am thinking how there's a whole body of research of database buffer pool management basically having elaborate work arounds for kernel virtual memory subsystme page management techniques, and I suspect there's wins there in unikernel world. Same likely goes for inference engines for LLMs.
  The Linux kernel is a general purpose utility optimizing for the entire range of "normal things" people do with their Linux machines. It naturally has to make compromises that might impact individual domains.
  That and startup times, big world of difference.
  Is it going to help people better sling silly old web pages and whatever it is people do with computers conventionally? Yeah, I'd expect not.
  On security, I don't think it's unreasonable or pure "security theatre" to go removing an attack surface entirely by simply not having it if you don't need it (no users, no passwords, no filesystem, whatever). I feel like he was a bit dismissive here? That is also the principle behind capability-passing security to some degree.
  I would hate to see people close the door on a whole world of potentials based on this kind of summary dismissal. I think people should be encouraged to explore this domain, at least in terms of research.
  [-]
  - mustache_kimono 9 minutes ago
    > On performance: ... In particular I am thinking how there's a whole body of research of database buffer pool management
    Why? The solution seems to be, turn off what the kernel does, and, do these things in userspace, not move everything in the kernel? [0] Where are the gains to be had?
    [0]: https://db.cs.cmu.edu/papers/2022/cidr2022-p13-crotty.pdf
    > The Linux kernel is a general purpose utility optimizing for the entire range of "normal things" people do with their Linux machines.
    Yeah, like logging. Perhaps you say: "Oh we just add that logging to the blob we run". Well isn't that now another thing that can take down the system?
    > That and startup times, big world of difference.
    Perhaps in this very narrow instance, this is useful, but what is it useful for? Can't Linux or another OS be optimized for this use case without having to throw the baby out with the bathwater? Can't one snapshot a Firecracker VM and reach similar startup times?
    > On security, I don't think it's unreasonable or pure "security theatre" to go removing an attack surface entirely
    Isn't one problem removing any protection domains? Like between apps and the kernel and between the apps themselves?
lacoolj 46 minutes ago
I use LXD + LXC, wondering if this is worth trying or if the overhead of accessing (network, etc) would be too much to deal with/care about.
Also always a little wary of projects that have bad typos or grammar problems in a README - in particular on one of the headings (thought it's possible these are on purpose?). But that's just me :\
droelf 1 hour ago
I've been using unikraft (https://unikraft.org/) unikernels for a while and the startup times are quite impressive (easily sub-second for our Rust application).
[-]
- ahepp 1 hour ago
  What drove you to choose that over something like containers?
  [-]
  - m00dy 1 hour ago
    shorter cold-boot times.
gnabgib 2 hours ago
(2020) Currently, seems to have been around since 2011 (https://news.ycombinator.com/item?id=3288786) although at a few different domains (torokernel.org, torokernel.io)
raggi 50 minutes ago
I don't want the observability of my applications to be bound by themselves, it's kind of a real pain. I'm all for microvm images without excess dependencies, but coupling the kernel and diagnostic tools to rapidly developing application code can be a real nightmare as soon as the sun stops shining.
richardwhiuk 2 hours ago
I wonder how it compares to https://mirage.io/
[-]
- speed_spread 59 minutes ago
  Isn't Mirage OCaml only?
cmrdporcupine 2 hours ago
It's written in... Pascal...
Neat.
[-]
- giancarlostoro 1 hour ago
  My last name is finally on the front page of HN as a project name, look mah!
  I was not expecting Pascal, thats an interesting choice. One thing I do like is that Freepascal has one of the better ways of making GUIs meanwhile every other language had decided that just letting Javascript build UIs is the way.
- AceJohnny2 24 minutes ago
  yay no C strings!
- lacoolj 48 minutes ago
  Oh holy crap that's actually super cool. One of the first languages I (tried to) learn ... at 13. And failed.
  Now I write Javascript and SQL.
  :)
spacecadet404 2 hours ago
What's the use case for this rather than containers? Separation from the hypervisor kernel?
[-]
- Imustaskforhelp 1 hour ago
  Containers (docker/podman) are still not as secure as virtualization (qemu,kvm,proxmox)
  Plus these might be smaller and might run faster than containers too.
  [-]
  - m00dy 59 minutes ago
    yeah it's a fairy tale.
- ahepp 1 hour ago
  Presumably to avoid the cost of context switches or copying between kernel/user address spaces? Looks to be the opposite of userspace networking like DPDK: kernel space application programming.
- eru 1 hour ago
  It can be much faster, and much smaller surface area for attacks than using a full Linux kernel.
m00dy 58 minutes ago
it is using qemu's network stack, would like to know how performant it is.
itsthecourier 2 hours ago
reminds me of actors, they are sharing messages between kernels with a bus
file sharing is complex too it seems
would be good to see a benchmark or something showing where it shines
[-]
- Imustaskforhelp 1 hour ago
  I think one reason UniKernels can be different are perhaps that they can allow more isolation or run user generated code perhaps inside the Unikernel with proper isolation whereas I don't think actors can do that