
Words in Your Mouth
The gist of the challenge is this: I want to develop a procedure and program that will allow me to take a fairly standardized interview video clip and corresponding transcript as input, and create a Max-Headroom-esque output of the interviewee saying anything we choose within the obvious limits of the vocabulary set available in the interview. And put it on the web.
Current Barriers & Constraints
The obvious alternative to this goal is to work with what is available, and that is subtitles at the sentence level. I might then be able to n-gram analysis at the sentence level and generate interesting conversation mashups. Even this goal has it’s challenge, which is how to take the start time for a subtitle and accurately estimate the duration of the entire utterance based on the character length of the subtitle. The good news is that this approach would definitely be more forgiving to slight inaccuracies in contrast to the word-level approach.
Before I get to the alternative, I’m trying to milk the original task as far as it will go before lowering the bar slightly. Right now, I’m looking into the possibilities of using Sphinx4‘s aligner audio file transcription process to generate time codes for each word in an audio file. I can just take the audio track from the video I want to transcribe, it will be the same length and should work. I can imagine that this won’t be any better than getting a Google voice transcription of a voicemail. We know how that goes.
Then taking this data, I would ideally be able to get some kind of video chopper process to output a bunch of independent clips. I have no idea what’s available for this process as of now. Please let me know if you have any ideas. The final step if all this works is to make some kind of web app to call a given number of clips and play them in sequence and hopefully the effect is at least funny. I’m thinking of approaching this by using a tornado python script, but I haven’t prototyped this yet so I’m still open to other options for this portion as well.
Hi there. I'm a design & code creative living, working and studying in sunny Brooklyn, NY. I'm currently finishing my thesis project at ITP and looking forward to what comes next.
Keywords: Design, User Experience, Interaction Design, Product Design, Visual Communication, Branding, Processing, Data Visualization, HTML, CSS, Javascript, Python
2010.09 — 2012.05 (expected)
Master of Professional Studies
Interactive Telecommunication Program (ITP)
Tisch School of the Arts, New York University
2010.09 — 2004.05
BA Visual Communications with minor in Art History
The George Washington University
Graduated Cum Laude
National Society of Collegiate Scholars
Spring 2003 semester at Sydney University, AU
2012.01 — present
Interaction Designer & Developer, SumAll, New York, NY
I'm currently working on an amazing data product with an incredible team here in SoHo. Check us out!
2011.06 — 2011.09
UX Designer, Microsoft Bing, Bellevue, WA
Worked with design, editorial, dev and program management teams to scope, design and develop prototypes for soon-to-be-released Bing.com feature during a summer internship. The internship culminated in two presentations of the feature prototypes to senior leadership at Microsoft as well as the Bing design team.
2007.02 — 2010.08
Graphic & Interaction Designer, Empax, Inc., New York, NY
Created a range of environmental, print and interactive materials to promote nonprofit clients and their causes. responsible for designing and presenting brand strategies, identities, print collateral, environmental signage, animation, user experience and interface, content management system setup and third party plug-in and data integration, search engine optimization, user analytics and testing.
2006.12 — 2011.08
Freelance Graphic & Interaction Design Consultant, New York, NY
Worked as a sole proprietor with various clients from retail, music, film, nonprofit, real estate and technology industries to create and improve existing brand and user experiences across many platforms and media.
2004.04 — 2006.01
Graphic Designer, The George Washington University Communication & Creative Services, Washington, DC
Worked with project management and external production vendors to deliver a range of print and interactive material related to university publications and communications initiatives. responsibilities included design and implementation of print collateral, posters, animation, environmental signage, web publication and press checks.
2011.07
Freakonomics (Web),
“What Would it Be Like to Climb 26 Years of Federal Spending?”
2011.04
Flowingdata (Web),
“Physically climb over budget data with Kinect”, by Nathan Yau
2011.02
Logo Lounge 6 (Book),
by Catharine Fishel and Bill Gardner, Rockport Publishers - Gedenk Logo
2010.12
“A Bartender That Pours The Perfect Shot, Every Shot”, by Matt Buchanan
2009.11
Basic Logos (Book),
by Index Book - The 2007 Gotham Awards Logo
2008.10
Print Magazine,
“Dialogue: Martin Kace”, by Steven Heller - The Alliance for Climate Protection Website
2010.12
ITP Winter show 2010, NYC
2011.04
Data Viz Challenge Party, hosted by Eyebeam and Google, NYC
2011.05
ITP Spring Show 2011, NYC
2006.01 — 2006.12
English Teacher, NOVA Japan, Kure-shi, Hiroshima-ken, Japan
Taught and mentored students of all ages and abilities in small to medium-sized classes to improve proficiency in english linguistics and conversation.