What brand/model of pants can you recommend?

sntx@lemm.ee · 2 days ago

24 ^ 6 = 191_102_976

sntx@lemm.ee · 2 days ago

To be honest, I switched to Wayland years ago precisely because of the better perceived input/cursor experience.

Change my mind, but having an average of half a frame input latency is much preferred when in return I gain that the cursor position on the screen actually aligns with all the other content displayed.

Plus, I’m very sensitive to tearing, so whenever it happens I get the impression that there was a huge rendering error.

Well and on the note that the cursor might visibly stutter, sure. But it’s a bit misleading. A game pinning the GPU to 100 % and running on 5 FPS doesn’t mean that your cursor will be rendered with 5 FPS. So far I’ve only noticed cursor lag/stutters in OOM situations, but neither under heavy GPU or CPU load.

sntx@lemm.ee · 5 days ago

Dionaea muscipula

sntx@lemm.ee · 7 days ago

For the gifs, are you talking about the keyboard or lemmy client?

sntx@lemm.ee · 7 days ago

It’s Jerboa with the “Black” Theme selected under look and feel.

sntx@lemm.ee · 7 days ago

I was in a building that was rebuild after a fighter jet crashed into the one before it…

sntx@lemm.ee · 7 days ago

Unexpected keyboard is just the best!

screenshot of the unexpected keyboard while writing this response

sntx@lemm.ee · 8 days ago

Or about half a year if we’re only counting the time during which I’ve been alive.

sntx@lemm.ee · 8 days ago

13.787 ± 0.020 billion years

sntx@lemm.ee · 10 days ago

Why does look like another bot post?

sntx@lemm.ee · 10 days ago

The simlutation terminates.

sntx@lemm.ee · 12 days ago

I’m curious, how do you run the 4x3090s? The FE Cards would be 4x3=12 PCIe slots and 4x16=64 PCIe lanes… Did you nvlink them? What about transient power spikes? Any clock or even VBIOS mods?

sntx@lemm.ee · 12 days ago

I’m also on p2p 2x3090 with 48GB of VRAM. Honestly it’s a nice experience, but still somewhat limiting…

I’m currently running deepseek-r1-distill-llama-70b-awq with the aphrodite engine. Though the same applies for llama-3.3-70b. It works great and is way faster than ollama for example. But my max context is around 22k tokens. More VRAM would allow me more context, even more VRAM would allow for speculative decoding, cuda graphs, …

Maybe I’ll drop down to a 35b model to get more context and a bit of speed. But I don’t really want to justify the possible decrease in answer quality.

sntx@lemm.ee · 4 months ago

NixOS just sits on your face. All the stuff in front of you is awesome. Though you might suffocate at any moment given the options. Oh and sticking your nose too deep into things might get you a broken nose.

sntx@lemm.ee · 4 months ago

Thanks for the writeup! So far I’ve been using ollama, but I’m always open for trying out alternatives. To be honest, it seems I was oblivious to the existence of alternatives.

Your post is suggesting that the same models with the same parameters generate different result when run on different backends?

I can see how the backend would have an influence hanfling concurrent api calls, ram/vram efficiency, supported hardware/drivers and general speed.

But going as far as having different context windows and quality degrading issues is news to me.

sntx@lemm.ee · 4 months ago

Is there an inherent benefit for using NVLINK? Should I specifically try out Aprodite over the other recommendations when having 2x 3090 with NVLINK available?

sntx@lemm.ee · 8 months ago

Please tell ^^

sntx@lemm.ee · 8 months ago

I cleaned my bin.

All that’s left is a symlink: sh -> /nix/store/…

sntx@lemm.ee · 8 months ago

Nushell

sntx@lemm.ee · 8 months ago

This is the same setup I’m running, I can highly recommend it.

sntx@lemm.ee · 1 year ago

What brand/model of pants can you recommend?

sntx@lemm.ee · 1 year ago

What aspects are important for non-centralized communities?