Matches in Nanopublications for { <https://w3id.org/np/RAmEQd6Xc3YHZ6uH6_kOfPKx1SDVV-wn6aK8LrvcmNyZE#assertion> ?p ?o ?g. }
Showing items 1 to 17 of
17
with 100 items per page.
- assertion type Question assertion.
- assertion type Claim assertion.
- assertion type Observation assertion.
- assertion comment " Scaling laws don't care about scale of the "train" models? Did anyone else get this? When I predict a scaling law, the scale of the largest model matters, but the num-models for fitting matters much much much more. Initial results, scaling error by #models starting from largest https://twitter.com/LChoshen/status/1803401845626511568/photo/1 Maybe more simply put: You can predict a scaling law with 8 small models, and it would be better than 3 large ones (that costs a lot) Is that something anyone else seen? " assertion.
- assertion wasGeneratedBy activity provenance.
- assertion wasAttributedTo RAoSadUw99CeqDlR2400018nqTzR_38fT86OrTzk16Vts provenance.
- assertion wasAttributedTo 0000-0002-0085-6496 provenance.
- assertion creator RAoSadUw99CeqDlR2400018nqTzR_38fT86OrTzk16Vts assertion.
- assertion wasAssociatedWith LChoshen provenance.
- assertion keywords "AI" assertion.
- assertion keywords "cost" assertion.
- assertion keywords "initialresults" assertion.
- assertion keywords "models" assertion.
- assertion keywords "modelscale" assertion.
- assertion keywords "scalinglaws" assertion.
- assertion keywords "training" assertion.
- assertion linksTo 1803401845626511568 provenance.