More

linolevan · 2026-02-03T18:34:12 1770143652

Quick feedback: Website is basically unusable on mobile

jauws · 2026-02-03T18:50:52 1770144652

Ah shoot - thanks for letting me know. I'm still a noob on frontend so still learning as I go.

linolevan · 2026-02-03T18:33:12 1770143592

It’s pretty neat.

Had some previous discussion that may be interesting on https://news.ycombinator.com/item?id=46595393

linolevan · 2026-01-31T17:25:45 1769880345

You have the right intuition – AVIF is an image format based on the encoder for AV1 (which is a really, really good video codec).

If you're interested in video, you might be interested to know that AV2 is in development.

Imustaskforhelp · 2026-01-31T18:30:59 1769884259

> If you're interested in video, you might be interested to know that AV2 is in development.

Oh interesting to know that! What would be the differences between Av1 and AV2?

Found a website (https://www.geekextreme.com/av1-vs-av2-video-codec/) which gave me some interesting results

AV2 delivers 30% better compression efficiency than AV1, which already compresses 30% better than HEVC (H.265). AV2 encoding demands 2-3 times more computational power than AV1, requiring advanced hardware like RTX 5090 for practical use. AV2 will officially release by end of 2025, with widespread hardware support expected around 2027 or later. AV2 introduces advanced features like split-screen delivery, enhanced AR/VR support, and dynamic bitrate switching for adaptive streaming. 88% of AOMedia members plan to implement AV2 within two years, despite infrastructure and hardware compatibility challenges.

If there's any other difference then let me know too but Honestly a bit curious but it mentions that it requires RTX 5090

Wouldn't this be a little bad for the market too? Sure it compresses 30% more but not everybody has rtx 5090

Are we gonna see multi codec in things like say netflix where to devices which don't support av2 will be sent av1 but they would prefer to send av2 if the hardware category is matched?

SquareWheel · 2026-01-31T19:55:08 1769889308

Just in case you missed it, your quote was referring to encoding requirements. Decoding (eg. Netflix users) will have a different set of requirements. The situation will also improve over time as dedicated hardware encoders and decoders become available.

For the moment, I don't really mind if it requires more GPU power to encode media, since it only needs to happen once. I expect it will still be possible on a weaker card, but it would just take longer.

linolevan · 2026-01-29T21:31:53 1769722313

Really interesting! Genuinely a good puzzle, I wonder how frontier models do here.

linolevan · 2026-01-28T23:02:45 1769641365

This model is sort of interesting since it seems to be using a lot of synthetic training data – but your point stands

cyanydeez · 2026-01-28T23:32:50 1769643170

So it's a rip off of a rip off, is that whats interesting?

freakynit · 2026-01-29T11:57:08 1769687828

reminds of this recent news https://www.medianama.com/2026/01/223-nvidia-high-speed-acce...

linolevan · 2026-01-28T01:12:41 1769562761

I'm particularly excited to see a "true base" model to do research off of (https://huggingface.co/arcee-ai/Trinity-Large-TrueBase).

hahahahhaah · 2026-01-29T04:23:16 1769660596

I'd love to "chat" to that model see how it behaves

Grimblewald · 2026-01-29T08:50:10 1769676610

I highly recommend. As a tip, you can quite easily get into a chat like state by simply using in context learning. Have a few turns of conversation pre-written and generate from that. It'll continue the conversation (for both parties) so you just stop it from generating when it starts generating on your behalf.

That said, it's useful for so much more beyond. Outline the premise of a Book, then "what follows is that book\n #Chapter 1:" and watch it rip. Base models are my preferred way of using LLM's by a long margin.

peepee1982 · 2026-01-30T15:19:12 1769786352

I've done this out of curiosity with the base model of LLama 3.1 405B. I vibe coded a little chat harness with the system prompt being a few short conversations between "system" and "user" with "user:" being the stop word so I could enter my message. Worked surprisingly well and I didn't get any sycophancy or cliched AI responses.

linolevan · 2026-01-27T21:53:13 1769550793

Speculation on my part: Your site either supports accessible text scaling, or it doesn't. If only partly supports it – it might as well not at all.

linolevan · 2026-01-27T19:11:50 1769541110

For tiny models, the SFT data mixture is unbelievably critical to usability. They are unable to generalize in almost any way. If you don't have multi-turn conversations, they will not be able to do multi-turn conversations. If you have multi-turn conversations which are just chatting, and then single turn conversations for math, it will be unable to do math in a multi-turn setting. This is much less true for bigger models.

linolevan · 2026-01-21T17:13:26 1769015606

I believe SmartOS is a distro of Illumos (in the same way that debian is a distro of linux).

linolevan · 2026-01-20T23:00:11 1768950011

I'm not convinced that LLM training is at such a high energy use that it really matters in the big picture. You can train a (terrible) LLM on a laptop[1], and frankly that's less energy efficient than just training it on a rented cloud GPU.

Most of the innovation happening today is in post-training rather than pre-training, which is good for people concerned with energy use because post-training is relatively cheap (I was able to post-train a ~2b model in less than 6 hours on a rented cluster[2]).

[1]: https://github.com/lino-levan/wubus-1 [2]: https://huggingface.co/lino-levan/qwen3-1.7b-smoltalk