Automatic Tuning of DCA++ for the ARM A64FX Processor - 42Papers