In the video below, one creative hacker mashes together a bunch of open-source libraries to create… well, a modern-day equivalent of HAL 9000.
Using OpenKinect as an interface, OpenCV to process the captured images, CMU Sphinx to recognize speech and Festvox to synthesize speech, one hacker was able to create a program that recognizes random real-world objects and communicates with a user in real time. He notes in the video’s description that it’s just a ‘proof of concept’, but one wonders how long it actually took him to cobble it together because the video was uploaded only a few days after OpenKinect’s release.
Proof of concept: “I’m wearing a headset because OpenKinect does not yet support audio input. All of the processing and recognition occurs in real time,” said the creator.