A follow-up query in regards to the last rating was answered appropriately, however Gemini acquired the identify of the scorer of the primary landing flawed: The AI instructed it was Johan Dotson. Dotson was proven getting a landing within the highlights with the scores at 0-0, however it was dominated out—an instance of the nuances that AI would not essentially choose up on.
Gemini did efficiently establish when the Kansas Metropolis Chiefs acquired their first factors, and even included a timestamp linking straight to the landing within the YouTube clip. It additionally acquired the identify of the scorer proper. It appears Gemini is closely reliant on the commentary for sports activities clips, which is not shocking.
Summarize Video Contents
Subsequent, we tried placing Gemini up towards a behind-the-scenes featurette for The Grand Budapest Lodge, directed by Wes Anderson. The clip runs to four-and-a-half minutes, and Gemini fired again some replies virtually immediately: It recognized the identify of the movie being talked about, and the primary beats of the clip’s narrative.
Nevertheless, it is all reliant on the audio (or the transcript) once more—there would not appear to be any evaluation of the particular video contents. The AI could not say who the speaking heads had been within the video, though their names had been proven on display screen, and wasn’t in a position to say who the director was (though this was additionally talked about within the video description).
On the plus aspect, Gemini did do a powerful job of summing up the audio of the video. It appropriately recognized among the filmmaking challenges that had been talked about all through, and supplied timestamps to them — from searching for a set to characterize the Grand Budapest, to filling it with extras.
Summarize Interviews
Lastly, we tried Google Gemini with an interview: Channel 4 within the UK talking to Charlie Brooker and Siena Kelly in regards to the newest collection of Black Mirror (maybe applicable for an article on AI). Gemini proved itself very succesful at choosing out the speaking factors, and including timestamps, although in fact the entire video is generally speaking.
Once more although, there is not any context about something exterior of the audio or the transcript. Gemini AI could not say the place the interview came about, or how the individuals had been performing, or the rest in regards to the visuals of the video—which is price allowing for for those who use it your self.
For movies the place the solutions you need are within the audio of a YouTube video, and its related transcript, Gemini works rather well at summarizing and offering correct solutions (supplied the commentators point out when a landing is dominated out, in addition to when one is scored). For any form of visible info, you are still going to have to observe the video your self.