Mmm, the iMovie program I used for editing the videos allows me to put text, images for a certain period of time. No need to add the text frame by frame.
For example: I start visualizing the video at a higher speed, I cut the unnecessary, and then I recognize what segments need text (like opening hands). After, I just select from what point to what point I want the text to be shown. I add the effects, transitions, and move on

It does take lot of time, but It's not an insane work. I wouldn't do it if I wouldn't enjoy the process of doing it, or I wouldn't receive such a positive feedback from you guys
