LLaDA - new MIT-licensed Diffusion LLM

ml-gsai.github.io

LLaDA - new MIT-licensed Diffusion LLM

ml-gsai.github.io

JOMusic@lemmy.ml to

Open Source@lemmy.mlEnglish · 11 months ago

SOCIAL MEDIA TITLE TAG

ml-gsai.github.io

SOCIAL MEDIA DESCRIPTION TAG TAG

Explainer of Diffusion LLMs from Andrej Karpathy: “Most of the LLMs you’ve been seeing are ~clones as far as the core modeling approach goes. They’re all trained “autoregressively”, i.e. predicting tokens from left to right. Diffusion is different - it doesn’t go left to right, but all at once. You start with noise and gradually denoise into a token stream.”

You must log in or # to comment.

Chat

Open Source@lemmy.ml

opensource@lemmy.ml

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !opensource@lemmy.ml

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Posts must be relevant to the open source ideology
No NSFW content
No hate speech, bigotry, etc

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

806 users / day
2.08K users / week
2.96K users / month
10.7K users / 6 months
1 local subscriber
43.2K subscribers
979 Posts
11.9K Comments
Modlog