
For this week’s Learning Bit by Bit assignment we were asked to program a stop tokenizer to normalize an input sentence as if it was being passed to a search engine – and to think about what makes a good stop list. We talked a bit about this in class last week, so I thought I’d being by looking at what a typical search engine’s stop list might look like. Not sure how accurate this is as it seems a little aggressive, but here’s an idea of what a typical search engine’s stop list might look like. I’m not sure if I would include words like “greetings” or words that specify locations like “fifth” or “underneath” – but what do I know. What if you searched for “animals underneath the sea” and you found “animals above the sea”?? What then? The humanity.
I find it particularly hysterical after searching tirelessly through support forums for technical issues I will inevitably resort to searching with aggression: “Just tell me how to get the f*cking jquery plugin to work!” I usually do this to lighten my mood and remind myself of the absurdity of my predicament, but sometimes I mimic other people’s frustration that they’ve posted to a forum so well that it turns out to be a good lead. So I find it interesting that this stop list has words like “necessary” but not “f*cking”. Maybe being an emphatic swarthy pirate searcher has its benefits.
To test this out, I brought in this stop list and tried some searches. The class with the stop list plopped in with some punctuation added to the beginning and I’ve included it in. Otherwise it’s literally just the example from the book:
I tried out:
Who's the best pirate? START END TOKEN 15 21 |pirate|
Not so good, huh? Why would you want to include qualitative statements in the stop list? I’m not sure. But, if we try:
Just tell me how to get the f*cking jquery plugin to work! START END TOKEN 28 35 |f*cking| 36 42 |jquery| 43 49 |plugin| 53 57 |work| 57 58 |!|
I’m really not sure what I’ve achieved, if anything…
[image credit: testpattern]
Hi there. I'm a design & code creative living, working and studying in sunny Brooklyn, NY. I'm currently finishing my thesis project at ITP and looking forward to what comes next.
Keywords: Design, User Experience, Interaction Design, Product Design, Visual Communication, Branding, Processing, Data Visualization, HTML, CSS, Javascript, Python
2010.09 — 2012.05 (expected)
Master of Professional Studies
Interactive Telecommunication Program (ITP)
Tisch School of the Arts, New York University
2010.09 — 2004.05
BA Visual Communications with minor in Art History
The George Washington University
Graduated Cum Laude
National Society of Collegiate Scholars
Spring 2003 semester at Sydney University, AU
2012.01 — present
Interaction Designer & Developer, SumAll, New York, NY
I'm currently working on an amazing data product with an incredible team here in SoHo. Check us out!
2011.06 — 2011.09
UX Designer, Microsoft Bing, Bellevue, WA
Worked with design, editorial, dev and program management teams to scope, design and develop prototypes for soon-to-be-released Bing.com feature during a summer internship. The internship culminated in two presentations of the feature prototypes to senior leadership at Microsoft as well as the Bing design team.
2007.02 — 2010.08
Graphic & Interaction Designer, Empax, Inc., New York, NY
Created a range of environmental, print and interactive materials to promote nonprofit clients and their causes. responsible for designing and presenting brand strategies, identities, print collateral, environmental signage, animation, user experience and interface, content management system setup and third party plug-in and data integration, search engine optimization, user analytics and testing.
2006.12 — 2011.08
Freelance Graphic & Interaction Design Consultant, New York, NY
Worked as a sole proprietor with various clients from retail, music, film, nonprofit, real estate and technology industries to create and improve existing brand and user experiences across many platforms and media.
2004.04 — 2006.01
Graphic Designer, The George Washington University Communication & Creative Services, Washington, DC
Worked with project management and external production vendors to deliver a range of print and interactive material related to university publications and communications initiatives. responsibilities included design and implementation of print collateral, posters, animation, environmental signage, web publication and press checks.
2011.07
Freakonomics (Web),
“What Would it Be Like to Climb 26 Years of Federal Spending?”
2011.04
Flowingdata (Web),
“Physically climb over budget data with Kinect”, by Nathan Yau
2011.02
Logo Lounge 6 (Book),
by Catharine Fishel and Bill Gardner, Rockport Publishers - Gedenk Logo
2010.12
“A Bartender That Pours The Perfect Shot, Every Shot”, by Matt Buchanan
2009.11
Basic Logos (Book),
by Index Book - The 2007 Gotham Awards Logo
2008.10
Print Magazine,
“Dialogue: Martin Kace”, by Steven Heller - The Alliance for Climate Protection Website
2010.12
ITP Winter show 2010, NYC
2011.04
Data Viz Challenge Party, hosted by Eyebeam and Google, NYC
2011.05
ITP Spring Show 2011, NYC
2006.01 — 2006.12
English Teacher, NOVA Japan, Kure-shi, Hiroshima-ken, Japan
Taught and mentored students of all ages and abilities in small to medium-sized classes to improve proficiency in english linguistics and conversation.