Figure 1: Multi-token in, Multi-token out Training and Inference. Note: Please prepare data before training. Data preparation details are in the file vila_u/data ...