theres a plugin called hardtime.nvim that does almost exactly what you have described. It goes a bit further and actually prevents you from doing certain things if you meet a threshold (like spamming j to go down a bunch if lines instea d of something like 15j to move 15 lines down)
I don’t think it’s that unreasonable to have something called “video podcast” in the scenario where you have an actual podcast, which also happens to have a video recording available on the internet as well. Sometimes I like to watch the video versions of podcasts to see the facial expressions of the speakers. “video podcast” seems like a natural shortening of “video of a podcast”. I think the important part is that the content is first and foremost a podcast, where it is meant to be listened to. As soon as it stops being possible to listen to the podcast as audio only, for example if they start relying on visuals that can only be seen in the video, then it is no longer a podcast.