[Training and inference of large language models using 8-bit floating point][1] Questions