Upload a video (e.g., MP4, MOV, AVI). It will be automatically converted to AVI for inference with the R(2+1)D + Enhanced MoE model.