A Power Law Functional Form for Neural Scaling Behavior - 42Papers