At HackMIT 2018, three friends and I made a project called Scribr, and we won the largest cash prize at the hackathon. We call it Scribr. Scribr automatically simplifies videos for you. As a user, you upload a video of any length, and Scribr takes your video and returns a shorter version of the video with just the "highlights." It's a simple solution to a straightforward problem: videos are often too long, and it would be nice to have a quick summary of the video when you're short on time.
The obvious question is: how does Scribr decide what parts are the highlights? It decides by the audio. We first run the audio through Rev's text-to-speech system. Rev's system has automatic punctuation, which means it returns not just each spoken word, but also the periods at the ends of sentences. This enables us to split up the video at the sentence level. Then, with a list of sentences in hand, we rank the importance of each sentence using a basic NLP algorithm. Next, we take the sentences with the highest importance rank, and get the timestamps of when those sentences were said in the video. We then cut the video at beginning and ends of each time step. And finally, we return a single video to the user, with all of the important sentences stitched together.
You can find read more of the tech details — and see a demo — on the Scribr project page.
Scribr was created by Cowboy Lynk, Ike Urquhart, Nikhil Punwaney, and myself.
We had some fun.
Learn more about the project here.