The main task is to find the result of an equation based on a video sequence.
The equation will be indicated by a moving robot.
•Several mathematical operators (multiplication, division, minus, plus, equal) are placed on the table.
•Several handwritten digits (0 to 8) are placed on the table.
•From an initial location somewhere on the table, the robot moves around the table.
Each time the robot passes above an operator or a digit, the symbol located below the robot is added to the equation.
For example, the sequence “2” → “+” → “3” → “=” becomes “2+3=”.
The goal is, given a new video sequence, to retrieve the formula and its associated answer.
To test the pipeline, three different scenarios will be presented:
•SC1: All operators/ digits have vertical orientations.
•SC2: Both operators and digits have random orientations.
•SC3 (Bonus): Both operators and digits have black colors. Orientations are random.
The input of the algorithm is a “.avi” video sequence, recorded at 2 FPS. The output should be a video sequence with
the same frame rate, duration and resolution as the input video. Each frame (e.g., frame at time t) of the output video
should contain the following information, printed on the same frame:
•The current state of the formula at time t.
•The trajectory of the robot from start to time t.
1
Project Overview
IAPR Project –2020
Evan Béal, Maxime Délitroz & Eric Bergkvist