ylai@lemmy.ml to AI@lemmy.mlEnglish · vor 2 JahrenNvidia’s ‘Nemotron-4 340B’ model redefines synthetic data generation, rivals GPT-4venturebeat.comexternal-linkmessage-square4linkfedilinkarrow-up120arrow-down11cross-posted to: aicompanions@lemmy.world
arrow-up119arrow-down1external-linkNvidia’s ‘Nemotron-4 340B’ model redefines synthetic data generation, rivals GPT-4venturebeat.comylai@lemmy.ml to AI@lemmy.mlEnglish · vor 2 Jahrenmessage-square4linkfedilinkcross-posted to: aicompanions@lemmy.world
minus-squareFischlinkfedilinkEnglisharrow-up1·vor 2 Jahren340B is fucking huge, holy shit. How big is GPT-4?
minus-squareylai@lemmy.mlOPlinkfedilinkarrow-up2·vor 2 JahrenThe rumor is 1.76 trillion, or 8x220B (mixture of experts) to be specific: https://wandb.ai/byyoung3/ml-news/reports/AI-Expert-Speculates-on-GPT-4-Architecture---Vmlldzo0NzA0Nzg4
340B is fucking huge, holy shit. How big is GPT-4?
The rumor is 1.76 trillion, or 8x220B (mixture of experts) to be specific: https://wandb.ai/byyoung3/ml-news/reports/AI-Expert-Speculates-on-GPT-4-Architecture---Vmlldzo0NzA0Nzg4