ylai@lemmy.ml to

AI@lemmy.mlEnglish · 9 months ago

How Gradient created an open LLM with a million-token context window

venturebeat.com

5

cross-posted to:
[email protected]

25

How Gradient created an open LLM with a million-token context window

venturebeat.com

ylai@lemmy.ml to

AI@lemmy.mlEnglish · 9 months ago

5

cross-posted to:
[email protected]

AI startup Gradient and cloud platform Crusoe teamed up to extend the context window of Meta's Llama 3 models to 1 million tokens.

Chat

TechNerdWizard42@lemmy.world
link
fedilink
arrow-up
4·
9 months ago
I believe you’d need roughly 500GB of RAM to run it minimum at full context length. There is chatter that 125k context took and used 40GB

I know I can load the 70B models into my laptop at lower bits but it consumes about 140GB of RAM.

AI@lemmy.ml

artificial_intel@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

18 users / day
136 users / week
180 users / month
994 users / 6 months
5 local subscribers
4.58K subscribers
352 Posts
1.39K Comments
Modlog

mods: