On the equation provided at https://evalai.cloudcv.org/web/challenges/challenge-page/354/evaluation

it is shown how one can calculate the “accuracy” of a single generated image, assuming that I_X and I_Y are image width or height.

How are these individual scores accumulated? Are the 1471 test set scores simply summed up?

Thanks a lot in advance for the clarification.