<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss'><id>tag:blogger.com,1999:blog-3022028357504155915</id><updated>2009-10-13T14:23:38.248-05:00</updated><title type='text'>A Sketchy Blog</title><subtitle type='html'>A collection of sketch recognition and haptics paper summaries.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default?start-index=26&amp;max-results=25'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>84</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-2749244378790071235</id><published>2009-06-11T23:29:00.003-05:00</published><updated>2009-06-12T11:02:12.091-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='user interfaces'/><category scheme='http://www.blogger.com/atom/ns#' term='change blindness'/><category scheme='http://www.blogger.com/atom/ns#' term='inattentional blindness'/><category scheme='http://www.blogger.com/atom/ns#' term='cognitive psychology'/><title type='text'>Change Blindness and Its Implications for Complex Monitoring and Control Systems Design and Operator Training</title><content type='html'>Durlach, P. 2004. Change blindness and its implications for complex monitoring and control systems design and operator training. &lt;i&gt;Hum.-Comput. Interact.&lt;/i&gt; 19, 4 (Dec. 2004), 423-451. DOI= http://dx.doi.org/10.1207/s15327051hci1904_10&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Durlach from The Army Research Institute discussed various aspects of change blindness's affects on important monitoring systems, such as airport traffic control.&lt;br /&gt;&lt;br /&gt;One factor mentioned in the study is that the longer between screen updates (e.g., distractions), the more likely change blindness occurs.  If the screen updates are almost non-existent, the changes are detected in 1-2 flashes.  If the blank screens are ~80 ms, the detections are seen in about 17 alterations.&lt;br /&gt;&lt;br /&gt;Other factors that affect change blindness include: distractions, discriminability (red vs. burgandy, red vs. white), categorization (tank vs. truck), biased serial search (rescanning same areas), amount of information, external attention capture, prior learning from a task (repeated or predictable change), meaningfulness of the change, and the user's expertise in the change area.&lt;br /&gt;&lt;br /&gt;To help eliminate change blindness, Durlach proposes to reduce screen clutter, make any items easily discriminable, and train users on the systems.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;There's no silver bullet to combat change blindness and inattentional blindness, and Durlach recognizes this.  Her suggestions make sense, and she has a great list of pros and cons to accompany them.  As tasks become more complex, there's always a sacrifice with making software that can handle the complexity while minimizing potential user errors.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-2749244378790071235?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/2749244378790071235/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=2749244378790071235' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/2749244378790071235'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/2749244378790071235'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/06/change-blindness-and-its-implications.html' title='Change Blindness and Its Implications for Complex Monitoring and Control Systems Design and Operator Training'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-4074883978334489513</id><published>2009-06-11T20:32:00.003-05:00</published><updated>2009-06-12T10:56:22.690-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='children'/><category scheme='http://www.blogger.com/atom/ns#' term='cognitive psychology'/><title type='text'>Beyond Modularity</title><content type='html'>Karmiloff-Smith, A. Beyond Modularity: &lt;span class="bodycopy"&gt;A Developmental Perspective on Cognitive Science.  MIT Press. November, 1992.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Piaget's child development theory&lt;/span&gt;&lt;span class="bodycopy"&gt; describes how children develop when their minds mature with age.  He observed transitions and major events when this occurs, such as when children learn object permanence.&lt;br /&gt;&lt;br /&gt;Karmiloff-Smith presents research challenging the assumption that growth happens in such steps.  Instead, the author shows how many human functions (language, math, physics, drawing) are innate in very young children even before they are verbal.  For instance, infants look longer at images that even humans would consider novel (such as objects that do not obey a perceived grouping, p. 68).&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;This book was really well-written and I enjoyed the break from regular computer science reading to read an almost pure psychology book.  The idea that drawing is innate in humans is reaffirms our lab's claim that sketching is "natural and intuitive".&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-4074883978334489513?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/4074883978334489513/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=4074883978334489513' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4074883978334489513'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4074883978334489513'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/06/beyond-modularity.html' title='Beyond Modularity'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-4530437892003244369</id><published>2009-06-11T15:08:00.002-05:00</published><updated>2009-06-11T15:36:26.475-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sketching'/><category scheme='http://www.blogger.com/atom/ns#' term='drawing'/><category scheme='http://www.blogger.com/atom/ns#' term='children'/><category scheme='http://www.blogger.com/atom/ns#' term='cognitive psychology'/><title type='text'>Drawing and the Non-verbal Mind</title><content type='html'>&lt;div style="text-align: left;"&gt;&lt;span&gt;&lt;a href="http://www.amazon.com/exec/obidos/search-handle-url/ref=ntt_athr_dp_sr_1?%5Fencoding=UTF8&amp;amp;search-type=ss&amp;amp;index=books&amp;amp;field-author=Chris%20Lange-K%C3%BCttner"&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;span style="color: rgb(0, 0, 0);"&gt;Lange-Kuttner, C. and Vitner, A. "Drawing and the Non-Verbal Mind: A Life-Span Perspective." &lt;/span&gt;Cambridge University Press, September 15, 2008.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The editors discussed hundreds of experiments dealing with drawings, most focused on children.&lt;br /&gt;&lt;br /&gt;Some interesting points of note are:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Young children (3-4 yrs) often cannot recognize their own drawings after some time has passed since the original. (p. 55)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Children often have a "constant depiction strategy", such as drawing everything as a sunburst or as a scribbled dot.  The depiction looks closer to the actual object with age. (p. 64)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;A drawing can be affected by the question and how the child interprets the objects, such as individual objects or in a group.  Grouping of objects can happen more often if the objects are similar: "two circles" vs. "circle and triangle" (pp. 165-173)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;People suffering from diseases such as semantic dementia often forget the distinguishing characteristic of an object they should be drawing after a short period of time. (Rhino -&gt; generic animal, p. 286)&lt;/li&gt;&lt;/ol&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The findings presented are too numerous to list, so I simply mentioned the ones I found most interesting.  Actual child or mental development would be difficult to measure using sketch recognition techniques (the drawings are simply too abstract).  If I ever work with children, items 2 and 3 will probably be helpful with either distinguishing between children or simply with the phrasing of the questions to the children.&lt;br /&gt;&lt;span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-4530437892003244369?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/4530437892003244369/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=4530437892003244369' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4530437892003244369'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4530437892003244369'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/06/drawing-and-non-verbal-mind.html' title='Drawing and the Non-verbal Mind'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-6188136476377243978</id><published>2009-06-11T14:41:00.004-05:00</published><updated>2009-06-11T15:06:37.427-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='vision'/><category scheme='http://www.blogger.com/atom/ns#' term='orientation specificity'/><category scheme='http://www.blogger.com/atom/ns#' term='visual cortex'/><title type='text'>Brain Mechanisms of Vision</title><content type='html'>&lt;span&gt;Hubel DH, Wiesel TN. Brain Mechanisms of Vision. Scientific American. 1979 Sep; 241(3):150-62&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The brain's primary visual cortex processes images in a modular, distorted way.  The rods and cones in the eyes send messages from the retina to the geniculate cells in the brain, which then relay the message to the visual cortex.  These geniculate cells are in a layer called Layer IV, are unsophisticated, and receive the bulk of the visual input.&lt;br /&gt;&lt;br /&gt;Cells outside of layer IV have "orientation specificity", where a bar of light falling in a certain orientation will activate some cells and have no affect on others.  The response for each cell appears to be around 10-20 degrees, at which point the response is lessened or abolished. &lt;br /&gt;&lt;br /&gt;At the time (1979), there was no evidence to support that the orientation specific cells had anything to do with visual perception.&lt;br /&gt;&lt;br /&gt;As electric signals moved into more complex layers of the visual cortex, some patterns emerged.  Cells close to one another often have the same optimal stimulus orientation.  Changes in orientation happen in small increments, such as 25-50 micrometers between cell groups mapping to a change of 10 degrees with varying direction reversals.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Really interesting information on how the structure and hierarchy of the primary visual cortex.  Although the orientation information did not prove that the brain recognizes shapes using features such as line orientations, other papers citing this one might.  I'll have to find some...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-6188136476377243978?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/6188136476377243978/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=6188136476377243978' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/6188136476377243978'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/6188136476377243978'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/06/brain-mechanisms-of-vision.html' title='Brain Mechanisms of Vision'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-5921551193433891602</id><published>2009-06-11T13:40:00.003-05:00</published><updated>2009-06-11T15:07:05.436-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='change blindness'/><category scheme='http://www.blogger.com/atom/ns#' term='inattentional blindness'/><category scheme='http://www.blogger.com/atom/ns#' term='HCI'/><category scheme='http://www.blogger.com/atom/ns#' term='cognitive psychology'/><title type='text'>Unseen and Unaware:  Implications of Recent Research on Failures of Visual Awareness for Human–Computer Interface Design</title><content type='html'>Varakin, D. A., Levin, D. T., and Fidler, R. 2004. Unseen and unaware: implications of recent research on failures of visual awareness for human-computer interface design. &lt;i&gt;Hum.-Comput. Interact.&lt;/i&gt;&lt;span&gt; 19, 4 (Dec. 2004), 389-422. DOI= &lt;a href="http://dx.doi.org/10.1207/s15327051hci1904_9" class="smarterwiki-linkify"&gt;http://dx.doi.org/10.1207/s15327051hci1904_9&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The authors mention some research on inattentional blindness and change blindness and provide anecdotal evidence of their usage in computer interfaces&lt;br /&gt;&lt;/span&gt;&lt;ul&gt;&lt;li&gt;&lt;span&gt;Inattentional Blindness - user is unaware of a change occurring within &lt;span style="font-style: italic;"&gt;the same&lt;/span&gt; field of view&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span&gt;Change Blindess - user is unaware of a change occurring across &lt;span style="font-style: italic;"&gt;multiple&lt;/span&gt; views&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-5921551193433891602?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/5921551193433891602/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=5921551193433891602' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5921551193433891602'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5921551193433891602'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/06/unseen-and-unaware-implications-of.html' title='Unseen and Unaware:  Implications of Recent Research on Failures of Visual Awareness for Human–Computer Interface Design'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-4494373417873192082</id><published>2009-06-11T12:42:00.011-05:00</published><updated>2009-06-11T15:07:17.840-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='change blindness'/><category scheme='http://www.blogger.com/atom/ns#' term='cognitive psychology'/><title type='text'>Change blindness: past, present, and future</title><content type='html'>&lt;span&gt;Daniel J. Simons, Ronald A. Rensink, Change blindness: past, present, and future, Trends in Cognitive Sciences, Volume 9, Issue 1, January 2005, Pages 16-20, ISSN 1364-6613, DOI: 10.1016/j.tics.2004.11.006. (http://www.sciencedirect.com/science/article/B6VH9-4DXTHVD-2/2/d3451247e53c70b0b390450a275a475a)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;span&gt;The authors provide an overview of change blindness understanding, such as how research has shown that change blindness occurs often during eye movement or when a user's attention wanes.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span&gt;The main contribution of the paper is the idea that change blindness research does not confirm the thought that visual representations of a scene are 'sparse'.  The authors propose four requirements for a change blindness to reaffirm the idea of sparse representations:&lt;/span&gt;&lt;br /&gt;&lt;ol style=""&gt;&lt;li&gt;Evidence must eliminate the possibility that detailed visual representations exist by fade from memory before the representations can be compared with others to perceive changes&lt;/li&gt;&lt;li&gt;Evidence must eliminate the possibility that detailed visual representations exist, but in a different visual processing section (of the brain?) that cannot compare with the currently viewed representation for change detection&lt;/li&gt;&lt;li&gt;Evidence must eliminate the possibility that any stored detailed representation is in a format that cannot be compared with another representation&lt;/li&gt;&lt;li&gt;Evidence must eliminate the possibility that both the stored detailed representation and the viewed representation can be compared, but are not for some reason&lt;/li&gt;&lt;/ol&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span&gt;The paper's final thoughts on how a representation are stored do not concern me.  Instead, this paper has a wide bibliography of change blindness research that should help me to look for related work.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-4494373417873192082?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/4494373417873192082/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=4494373417873192082' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4494373417873192082'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4494373417873192082'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/06/change-blindness-past-present-and.html' title='Change blindness: past, present, and future'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-4915181951384094157</id><published>2009-02-22T22:33:00.003-06:00</published><updated>2009-02-22T23:35:41.302-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sketch recognition'/><category scheme='http://www.blogger.com/atom/ns#' term='psychology'/><title type='text'>CogSketch: Open-domain sketch understanding for cognitive science research and for education</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;CogSketch presents a sketch recognition system wrapped in psychological syntax.  Users draw single stroke &lt;span style="font-style: italic;"&gt;glyphs&lt;/span&gt; that can be &lt;span style="font-style: italic;"&gt;containment glyphs &lt;/span&gt;(symbols) or &lt;span style="font-style: italic;"&gt;connection glyphs&lt;/span&gt; (relationships).  Glyphs are recognized through a focused knowledge base of information that can be specified by the user.  Inter-glyph relationships are computed using &lt;a href="http://portal.acm.org/citation.cfm?id=225387"&gt;RCC-8&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;On the interface side, the system contains layers that have modes.&lt;br /&gt;&lt;br /&gt;Lastly, CogSketch has simulations that can be conducted.  The two simulations are analogies (A is to B as C is to ?) and spatial language learning (inside, above, below, etc.).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-4915181951384094157?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/4915181951384094157/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=4915181951384094157' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4915181951384094157'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4915181951384094157'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/02/cogsketch-open-domain-sketch.html' title='CogSketch: Open-domain sketch understanding for cognitive science research and for education'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-4242026568796731271</id><published>2009-02-22T16:15:00.003-06:00</published><updated>2009-02-22T16:38:52.873-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='graphical passwords'/><title type='text'>Do Background Images Improve “Draw a Secret” Graphical Passwords?</title><content type='html'>Dunphy, P. and Yan, J. 2007. Do background images improve "draw a secret" graphical passwords?. In &lt;i&gt;Proceedings of the 14th ACM Conference on Computer and Communications Security&lt;/i&gt; (Alexandria, Virginia, USA, October 28 - 31, 2007). CCS '07. ACM, New York, NY, 36-47.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The authors use DAS passwords in conjunction with background images in order to improve the complexity of the passwords without harming user recall.  A user would typically choose a small portion of an image to draw on, which could increase the complexity of the password if the image itself was complex.&lt;br /&gt;&lt;br /&gt;The paper contains great user studies focusing on the recall of passwords, the complexity of images, what images users chose to draw on, and what recall errors occurred.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;This is another DAS paper, and, like the previously blogged one, shows how much room the graphical password field has to grow.  The studies in this paper were phenomenally thorough, and if we ever start a sketching passwords project this is the paper we should all read.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-4242026568796731271?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/4242026568796731271/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=4242026568796731271' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4242026568796731271'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4242026568796731271'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/02/do-background-images-improve-draw.html' title='Do Background Images Improve “Draw a Secret” Graphical Passwords?'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-7200129360265405404</id><published>2009-02-22T14:43:00.003-06:00</published><updated>2009-02-22T16:15:01.434-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='graphical passwords'/><title type='text'>Graphical Passwords &amp; Qualitative Spatial Relations</title><content type='html'>Lin, D., Dunphy, P., Olivier, P., and Yan, J. 2007. Graphical passwords &amp;amp; qualitative spatial relations. In &lt;i&gt;Proceedings of the 3rd Symposium on Usable Privacy and Security&lt;/i&gt; (Pittsburgh, Pennsylvania, July 18 - 20, 2007). SOUPS '07, vol. 229. ACM, New York, NY, 161-162.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Summary&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The authors modify the Draw-a-Secret (DAS) scheme where users draw a graphical password in a grid so that the "looking over the shoulder" phenomenon could be reduced.  A DAS password is a simple encoding of a drawn stroke using a grid and directions, such as up, right, up.&lt;br /&gt;&lt;br /&gt;The extended abstract presents a Qualitative Draw-a-Secret (QDAS) scheme that changes DAS by first assigning a number to each grid.  Then, the grid itself varies based on the direction changes of the stroke.  The grid changes based on cell height and width.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Although this extended abstract wasn't too informative, it did give me some thoughts about how we could use sketch recognition techniques to improve upon drawn passwords.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-7200129360265405404?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/7200129360265405404/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=7200129360265405404' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7200129360265405404'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7200129360265405404'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/02/graphical-passwords-qualitative-spatial.html' title='Graphical Passwords &amp; Qualitative Spatial Relations'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-7459126366619515095</id><published>2009-02-16T22:13:00.007-06:00</published><updated>2009-02-16T22:51:43.268-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sketching'/><category scheme='http://www.blogger.com/atom/ns#' term='psychology'/><category scheme='http://www.blogger.com/atom/ns#' term='learning'/><category scheme='http://www.blogger.com/atom/ns#' term='assist'/><title type='text'>SKIT: A Computer-Assisted Sketch Instruction Tool</title><content type='html'>Greg Coombe and Brian Salomon&lt;br /&gt;Department of Computer Science, University of North Carolina&lt;br /&gt;&lt;br /&gt;This paper discusses a system, SKIT, that assists users in sketching line drawings by using some artist techniques.  The system breaks a full outline sketch into subdrawings that the user can draw at various sizes.  This allows the user to see the model they are drawing as a set of smaller, more geometric objects.  The subdrawings are then merged back together in the end.&lt;br /&gt;&lt;br /&gt;The small user study was more qualitative and showed users improved when using SKIT.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The paper presents a technique or two that might be helpful in starting a user-training program for sketching.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-7459126366619515095?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/7459126366619515095/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=7459126366619515095' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7459126366619515095'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7459126366619515095'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2009/02/skit-computer-assisted-sketch.html' title='SKIT: A Computer-Assisted Sketch Instruction Tool'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-4982954658763849582</id><published>2008-04-28T16:58:00.002-05:00</published><updated>2008-04-28T17:05:33.572-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='HMM'/><category scheme='http://www.blogger.com/atom/ns#' term='vision'/><category scheme='http://www.blogger.com/atom/ns#' term='hand gesture'/><title type='text'>Invariant features for 3-D gesture recognition</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Campbell &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; use HMMs and a list of features to find a good recognition rate for a set of T'ai Chi  gestures that are performed by users in a swivel chair; a hand gesture's change in polar coordinates provided the highest recognition for the 18 gestures tested.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Performing T'ai Chi in a chair kind of defeats the purpose of T'ai Chi.  That's like trying to study race car drivers by observing people who take the bus.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-4982954658763849582?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/4982954658763849582/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=4982954658763849582' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4982954658763849582'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/4982954658763849582'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/invariant-features-for-3-d-gesture.html' title='Invariant features for 3-D gesture recognition'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-5726030902336992649</id><published>2008-04-23T16:58:00.007-05:00</published><updated>2009-06-11T20:58:24.530-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='user interfaces'/><category scheme='http://www.blogger.com/atom/ns#' term='3D modeling'/><title type='text'>FreeDrawer - A Free-Form Sketching System on the Responsive Workbench</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;span&gt;&lt;br /&gt;&lt;br /&gt;Wesche &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; created a 3D sketching tool where the skeleton of a model is created.  A user can draw curves in a virtual space.  A new curve can be drawn anywhere, but additional curves must be merged with the present model.  Altering curves can be done on a local or global scale.  Surfaces can be filled in at closed curve loops.  Surfaces can also be smoothed.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;This paper had some nice pictures, but very little material was actually presented.  How does the computer know where the pen point is?  How does the user interact with the pen?  Range of motion?  Gestures?&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-5726030902336992649?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/5726030902336992649/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=5726030902336992649' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5726030902336992649'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5726030902336992649'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/freedrawer-free-form-sketchign-system.html' title='FreeDrawer - A Free-Form Sketching System on the Responsive Workbench'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-3588910954660038410</id><published>2008-04-23T16:57:00.012-05:00</published><updated>2008-04-23T22:49:12.280-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='user interfaces'/><category scheme='http://www.blogger.com/atom/ns#' term='vision'/><category scheme='http://www.blogger.com/atom/ns#' term='motion tracking'/><title type='text'>Interacting with human physiology</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;The authors &lt;span&gt;Pavlidis&lt;/span&gt;&lt;span style="font-style: italic;"&gt; et al.&lt;/span&gt; propose a system to monitor humans for stress levels and altered psychological states using high-end infrared cameras.  This system could then be used for a variety of purposes such as stress management of UIs, illness detection, or lie detection.&lt;br /&gt;&lt;br /&gt;The system tracks the user's face through &lt;span style="font-style: italic;"&gt;tandem tracking&lt;/span&gt; to track a small, keys section of the face.  These sections include the nose, forehead, and temporal regions.  The tracker models each region by its center of mass and orientation.  Blood flow is tracked in the face through a perfusion model and directional model.  The model involves a differential equation set to measure the "volumetric metabolic heat" flow in the face.  Other measurements tracked include pulse, heat transfer in areas, and breathing rate.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The ideas behind this system are great, although talking with &lt;span&gt;Pavlidis showed us that there are issues with the current system's usability.  Sweat and minor body temperature fluctuations can alter the system's reliability (since the system is trying to measure minor fluctuations).  &lt;/span&gt;Unfortunately, the cost for one of these high-end cameras is $60k, so we won't be seeing this any time soon.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-3588910954660038410?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/3588910954660038410/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=3588910954660038410' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/3588910954660038410'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/3588910954660038410'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/interacting-with-human-physiology.html' title='Interacting with human physiology'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-508184901789343455</id><published>2008-04-23T16:56:00.009-05:00</published><updated>2008-04-24T14:51:07.611-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='user interfaces'/><category scheme='http://www.blogger.com/atom/ns#' term='augmented reality'/><category scheme='http://www.blogger.com/atom/ns#' term='hand tracking'/><category scheme='http://www.blogger.com/atom/ns#' term='grasp'/><category scheme='http://www.blogger.com/atom/ns#' term='3D modeling'/><title type='text'>3D Object Modeling Using Spatial and Pictographic Gestures</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span&gt;Nishino &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; designed a 3D object modeling system that uses stereoscopic glasses, CyberGloves, and polhemus trackers.&lt;br /&gt;&lt;br /&gt;The system allows the creation of superellipsoids that can have smooth or squarish parameters.  These primitive shapes can be bent, stretched, twisted, and merged with other shapes.  Hand postures control these actions, such as grasping and pointing.  Virtual hands are displayed on a 200-inch arched screen, along with the object, in stereoscopic mode.  The virtual hands allow the user to easily see where they can touch and modify the 3D model.&lt;br /&gt;&lt;br /&gt;The authors tested the system by having users attempt two types of objects: symmetric and asymmetric.  The symmetric object was a bottle, and the asymmetric object was a teapot. &lt;br /&gt;Creation of the objects took up to 120 minutes.  The size of the stored objects was much less than a competing program, Open Inventor.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;For a paper in 1998, this was a pretty advanced system and seemed to offer some benefits over other systems.  I would have liked to have seen feedback from the users, though, since I'm not sure how hard the system is to use.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-508184901789343455?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/508184901789343455/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=508184901789343455' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/508184901789343455'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/508184901789343455'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/3d-object-modeling-using-spatial-and.html' title='3D Object Modeling Using Spatial and Pictographic Gestures'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-1346781579713340320</id><published>2008-04-23T16:56:00.006-05:00</published><updated>2008-04-23T17:28:32.015-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='HMM'/><category scheme='http://www.blogger.com/atom/ns#' term='gesture'/><category scheme='http://www.blogger.com/atom/ns#' term='multimodal'/><category scheme='http://www.blogger.com/atom/ns#' term='speech'/><title type='text'>Toward Natural Gesture/Speech HCI: A Case Study of Weather Narration</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;span&gt;&lt;br /&gt;&lt;br /&gt;Poddar &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; use HMMs and speech to recognize complementary keyword-gesture pairs in weather reporting.  The multi-modal system would combine speech and gesture recognition to improve recognition accuracy.&lt;br /&gt;&lt;br /&gt;HMMs work well for separating 20 isolated gestures from weather narrators, but continuous gestures drop the accuracy to between 50-80% for sequences.  Co-occurrence analysis seeks to study the meaning behind keywords occurring with gestures, the frequency behind these occurrences, and the temporal alignment of the gestures and keywords.  A table they presented shows that certain types of gestures (contour, area, and pointing) are more heavily associated with certain keywords ("here", "direction", "location").  The accuracy of recognizing the gestures can improve with both video (gesture) and audio (keywords).&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;This paper does not necessarily add much to solutions, but it was written ten years ago and did show some nice results that combining speech and gesture can improve recognition.  Since the authors did not use a speech recognition system, the errors with that system would also produce interesting results that differ from the given accuracies.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-1346781579713340320?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/1346781579713340320/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=1346781579713340320' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/1346781579713340320'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/1346781579713340320'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/toward-natural-gesturespeech-hci-case.html' title='Toward Natural Gesture/Speech HCI: A Case Study of Weather Narration'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-7873943278829945145</id><published>2008-04-17T17:06:00.006-05:00</published><updated>2008-04-23T16:56:07.501-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='vision'/><category scheme='http://www.blogger.com/atom/ns#' term='gesture'/><category scheme='http://www.blogger.com/atom/ns#' term='bayesian networks'/><title type='text'>Discourse Topic and Gestural Form</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Eisenstein &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; applied an unsupervised learning technique and a Bayesian network model to study the correlation between gestures and presentation topics.&lt;br /&gt;&lt;br /&gt;Their system looks at "interest points" within a video image, where each interest point is said to have come from a mixture model.  Interest points from a similar model are clustered together to create a codebook.  A hidden variable determines whether the observation gesture codeword is from a topic-specific or a speaker-specific distribution.  The authors use a Bayesian model to learn what distribution each gesture belongs to, based off Gaussians of feature vectors.&lt;br /&gt;&lt;br /&gt;The system was tested with fifteen users giving 33 presentations picked from five topics.  The experiments show that with correct labels, the topic-specific gestures account for 12% of the gestures, whereas corrupting these labels drops the average to 3%.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;This paper is a good start to a longer study on how to incorporate topic-specific gestures into recognition systems.  Finding these gestures can help computers understand what topics might be presented, as well as what speakers are presenting a topic or if a speaker is veering off-topic.  The system can then be used for speech training, presentation classification, or assistance (Clippy).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-7873943278829945145?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/7873943278829945145/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=7873943278829945145' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7873943278829945145'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7873943278829945145'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/discourse-topic-and-gestural-form.html' title='Discourse Topic and Gestural Form'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-1374407887871868119</id><published>2008-04-14T15:52:00.004-05:00</published><updated>2008-04-17T17:02:34.010-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='adaboost'/><category scheme='http://www.blogger.com/atom/ns#' term='grasp'/><category scheme='http://www.blogger.com/atom/ns#' term='glove'/><title type='text'>Feature selection for grasp recognition from optical markers</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Chang et al. reduced the number of markers needed on a vision-based hand grasp system from 30 to 5 while retaining around a 90% recognition rate.&lt;br /&gt;&lt;br /&gt;Six different grasps are used for classification: cylindrical, spherical, lumbrical, two-finger pinch, tripod, and lateral tripod.   The posterior probabilities for a class &lt;span style="font-style: italic;"&gt;y&lt;span style="font-size:78%;"&gt;k&lt;/span&gt;&lt;/span&gt; are modeled with a softmax function, which divides the exp value of an observation sequence with the class weights, divided by the sum of all exp(weights * obs) values.&lt;br /&gt;&lt;br /&gt;The weight values are determined by maximum conditional likelihood estimation from the training set of observations and classes (&lt;span style="font-style: italic;"&gt;X, Y&lt;/span&gt;).  Gradient descent is used to find the log likelihood with respect to the weights.  Input features are found using a "sequential wrapper algorithm" that examines one feature at a time with respect to a target class.&lt;br /&gt;&lt;br /&gt;Grasp data measured 38 objects being grasped with a full set of 30 markers.  An "optimal", small set of markers was chosen by forward and backward selection.&lt;br /&gt;&lt;br /&gt;The results indicate that the small marker set of 5 markers has between a 92-97% "accuracy retention" rate.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Reducing the number of sensors using the forward and backward selection is nice, but simply having a few more sensors increases the accuracy to the actual plateau point.  From 10 on there is almost no change in accuracy, but between 5 and 10 sensors the accuracy can jump 5%, or 1/20, which is a huge percentage when taking into account user frustration.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-1374407887871868119?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/1374407887871868119/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=1374407887871868119' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/1374407887871868119'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/1374407887871868119'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/feature-selection-for-grasp-recognition.html' title='Feature selection for grasp recognition from optical markers'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-8110036810309405512</id><published>2008-04-14T13:53:00.005-05:00</published><updated>2008-04-14T14:27:25.051-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='neural networks'/><category scheme='http://www.blogger.com/atom/ns#' term='gesture'/><category scheme='http://www.blogger.com/atom/ns#' term='speech'/><category scheme='http://www.blogger.com/atom/ns#' term='hand tracking'/><category scheme='http://www.blogger.com/atom/ns#' term='glove'/><category scheme='http://www.blogger.com/atom/ns#' term='hand gesture'/><title type='text'>Glove-TalkII--A Neural-Network Interface which Maps Gestures to Parallel Formant Speech Synthesizer Controls</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Fels and Hinton created Glove-TalkII, a system designed to synthesize voice using complicated glove and feet controls.&lt;br /&gt;&lt;br /&gt;The artificial vocal track (AVT) is controlled using a CyberGlove, ContactGlove, polhemus sensor, and foot pedal.  The ContactGlove controls 9 stop consonants, such as CH, T, and NG.  The foot pedal controls the volume of the speech.  Hand position corresponds to a vowel sound.  Hand postures map to nonstop consonant phonemes.&lt;br /&gt;&lt;br /&gt;The neural networks used include a vowel/consonant network to determine if the sensors are reading a vowel or consonant, and then separate vowel and consonant networks to distinguish between the phonemes.&lt;br /&gt;&lt;br /&gt;A single user had to undergo 100 hours of training to be able to use the system.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Impractical.  I'm shocked that they had someone train the system for 100 hours, and the fact that it takes a person that long to train the system should indicate that this is a poor way to synthesize voice.  The person's final voice is even described as "intelligible and somewhat natural-sounding", which is not a good complement.&lt;br /&gt;&lt;br /&gt;Requiring a person to walk around with a one-handed keyboard and type their words is a better solution.  The keyboard wouldn't even have a foot pedal.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-8110036810309405512?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/8110036810309405512/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=8110036810309405512' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/8110036810309405512'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/8110036810309405512'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/glove-talkii-neural-network-interface.html' title='Glove-TalkII--A Neural-Network Interface which Maps Gestures to Parallel Formant Speech Synthesizer Controls'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-8968284448029145691</id><published>2008-04-09T14:15:00.006-05:00</published><updated>2008-04-09T14:33:16.653-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='robotics'/><category scheme='http://www.blogger.com/atom/ns#' term='RFID'/><title type='text'>RFID-enabled Target Tracking and Following with a Mobile Robot Using Direction Finding Antennas</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Kim &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; use dual-direction antennas to find the direction of arrival for RF signals transmitted from an RFID tag.  The two spiral antennas are perpendicular to each other and their signal strengths are different depending on the angle to the RFID tag.&lt;br /&gt;&lt;br /&gt;Obstacles in front of the antennas/tag increase the error in determining the direction.  The object can still be tracked, though.  In experimental results it worked pretty well.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;It works pretty well for its domain.  Probably less accurate for incredibly small movements (e.g., finger bends).  Seems like every now and then it goes crazy off-track (Figure 8).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-8968284448029145691?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/8968284448029145691/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=8968284448029145691' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/8968284448029145691'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/8968284448029145691'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/rfid-enabled-target-tracking-and.html' title='RFID-enabled Target Tracking and Following with a Mobile Robot Using Direction Finding Antennas'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-7910833979943963199</id><published>2008-04-04T11:47:00.003-05:00</published><updated>2008-04-04T12:23:32.170-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='gesture'/><category scheme='http://www.blogger.com/atom/ns#' term='music'/><title type='text'>Gesture Recognition Using an Acceleration Sensor and Its Application to Musical Performance Control</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Sawada and Hasimoto use accelerometer data to extract features of gestures and create a music tempo system.&lt;br /&gt;&lt;br /&gt;The extracting of features is basic:  projections onto certain planes, such as &lt;span style="font-style: italic;"&gt;xy&lt;/span&gt; or &lt;span style="font-style: italic;"&gt;yz&lt;/span&gt;, and the bounding box of the acceleration values.  Changes of acceleration are measured using a fuzzy partition of radial angles.&lt;br /&gt;&lt;br /&gt;The authors recognize or classify gestures using squared error.  The actual gesture recognition is trivial.&lt;br /&gt;&lt;br /&gt;The music tempo program is where the paper is more interesting as the system has to predict where a beat has been hit in real-time.  Systems already existed where a marker is placed on a baton, but the visual processing of these systems usually has a delay of 0.1s (in 1997 computational power).  In the author's system, gestures for up, down, and diagonal swings are used to indicate tempo.  Other gestures can map to other elements of conducting.&lt;br /&gt;&lt;br /&gt;A score is stored in the computer and the user conducts to the score.  Often the computer and human are slightly off, and the two try to balance to each other.  A simple function for balancing the tempo is given.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The system they use isn't a true conducting system since it relies on defined (and trained) gestures, but the ideas behind the tempo system are good and the simple execution and equations are appreciated.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-7910833979943963199?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/7910833979943963199/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=7910833979943963199' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7910833979943963199'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7910833979943963199'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/gesture-recognition-using-acceleration.html' title='Gesture Recognition Using an Acceleration Sensor and Its Application to Musical Performance Control'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-973930034550000699</id><published>2008-04-02T15:32:00.005-05:00</published><updated>2008-04-04T11:47:25.366-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='vision'/><category scheme='http://www.blogger.com/atom/ns#' term='RFID'/><title type='text'>Activity Recognition using Visual Tracking and RFID</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Krahnstoever &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; use RFID tags in conjunction with computer vision tracking to interpret what is happening within a scene.&lt;br /&gt;&lt;br /&gt;A person model tracks a human's movement through their head and hands.  The head is a 3D cartesian coordinate location, and each hand is described in spherical coordinates (r, phi, theta) with respect to the head.  The models for where the head will be, &lt;span style="font-style: italic;"&gt;p&lt;/span&gt;(X^&lt;span style="font-style: italic;"&gt;t&lt;/span&gt;_&lt;span style="font-style: italic;"&gt;h&lt;/span&gt;, X^&lt;span style="font-style: italic;"&gt;t-1&lt;/span&gt;_&lt;span style="font-style: italic;"&gt;h&lt;/span&gt;), and the hands &lt;span style="font-style: italic;"&gt;p&lt;/span&gt;(X^&lt;span style="font-style: italic;"&gt;t&lt;/span&gt;_&lt;span style="font-style: italic;"&gt;q&lt;/span&gt;, X^&lt;span style="font-style: italic;"&gt;t-1&lt;/span&gt;_&lt;span style="font-style: italic;"&gt;h&lt;/span&gt;) had to be learned.  The priors &lt;span style="font-style: italic;"&gt;p&lt;/span&gt;(X_&lt;span style="font-style: italic;"&gt;q&lt;/span&gt; | X_&lt;span style="font-style: italic;"&gt;h&lt;/span&gt;) also had to be learned.  Both hands and head are segmented using skin color.&lt;br /&gt;&lt;br /&gt;Each pixel within a given image frame can belong to either the background or the foreground (body part).  The likelihood for an image given the observations is taken to be the Improved Iterative Scaling (IIS) of the image section and bounding box of a body part, summed over the parts and sections.  I have no idea how IIS works.&lt;br /&gt;&lt;br /&gt;RFID tags provide movement and orientation information in 3D spaces.  The amount of charge the RFID tag receives depends on its angle to the wave source, where a perpendicular angle receives no energy and a parallel angle is the greatest.  The tag then outputs the tag's ID, orientation, and field strength to the signal.&lt;br /&gt;&lt;br /&gt;The authors use the RFID information along with the hand and head positions to interpret what is happening in a scene.  Agents are somehow used to do this.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The RFID information looks like it helps recognize what is happening within a scene, but I would have liked to have seen a comparison between a pure vision system and a system with the RFID.  This could be a bit difficult, but it might help the strength of the paper.&lt;br /&gt;&lt;br /&gt;I also would have liked an actual description of the activity agent system.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-973930034550000699?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/973930034550000699/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=973930034550000699' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/973930034550000699'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/973930034550000699'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/04/activity-recognition-using-visual.html' title='Activity Recognition using Visual Tracking and RFID'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-3305302858179745393</id><published>2008-03-31T13:23:00.003-05:00</published><updated>2008-04-23T16:59:22.773-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='HMM'/><category scheme='http://www.blogger.com/atom/ns#' term='gesture'/><title type='text'>Enabling fast and effortless customisation in accelerometer based gesture interaction</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Mäntyjärvi &lt;span style="font-style: italic;"&gt;et al. &lt;/span&gt;apply discrete HMMs to accelerometer data for gesture recognition.  The authors had a previous study that indicated users prefer defining their own gestures, or they prefer intuitive gestures.&lt;br /&gt;&lt;br /&gt;The authors add noise into the gestures to increase the recognition of user-defined gestures under certain conditions.  This supposedly speeds up the training process since less gestures need to be "drawn".  Adding Gaussian noise versus uniform noise might improve the recognition.  But not really.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;This paper changed courses in the middle and moved from customization to noise addition.  The gesture set they tested on was super easy and can be done with Rubine's recognizer.  I'd like to see some data that users created and the differences between the user-defined gesture and the DVD gestures.&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-3305302858179745393?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/3305302858179745393/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=3305302858179745393' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/3305302858179745393'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/3305302858179745393'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/03/enabling-fast-and-effortless.html' title='Enabling fast and effortless customisation in accelerometer based gesture interaction'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-5713079949568680995</id><published>2008-03-27T12:49:00.002-05:00</published><updated>2008-03-27T13:01:04.956-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='wii remote'/><category scheme='http://www.blogger.com/atom/ns#' term='HMM'/><category scheme='http://www.blogger.com/atom/ns#' term='gesture'/><title type='text'>Gesture Recognition with a Wii Controller</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Schlomer &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; showed that the Wii controller is pretty good at recognizing tennis gestures.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;a href="http://www.amazon.com/Nintendo-RVL-RSPE-USA-Wii-Sports/dp/B000TK0G78/ref=sr_1_1?ie=UTF8&amp;amp;s=videogames&amp;amp;qid=1206640715&amp;amp;sr=1-1"&gt;Here's a good evaluation study.&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-5713079949568680995?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/5713079949568680995/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=5713079949568680995' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5713079949568680995'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5713079949568680995'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/03/gesture-recognition-with-wii-controller.html' title='Gesture Recognition with a Wii Controller'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-5991796310059840576</id><published>2008-03-27T12:35:00.003-05:00</published><updated>2008-03-27T12:49:53.254-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='user interfaces'/><category scheme='http://www.blogger.com/atom/ns#' term='haptics'/><category scheme='http://www.blogger.com/atom/ns#' term='user study'/><title type='text'>SPIDAR G&amp;G: A Two-Handed Haptic Interface for Bimanual VR Interaction</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Murayama &lt;span style="font-style: italic;"&gt;et al.&lt;/span&gt; presented a two-handed computer control device that allowed the manipulation of on-screen objects.  The system, called SPIDAR G&amp;amp;G, consisted of two balls suspended in two horseshoe apparatus with six strings each.  The user moved these balls with six degrees of freedom, which translated onto a cursor or object on the computer.  The strings also had pull and resisted movement through small motors.  Each ball included a pressure button that detected grip.&lt;br /&gt;&lt;br /&gt;The authors evaluated the system using a pointer and a target object.  The users had to manipulate the pointer and object with both balls in order to accomplish a goal.  Three people tested their system and found that the use of two SPIDAR balls, as opposed to one and a keyboard, allowed the users to manipulate the objects faster.  Also, haptic feedback helped.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Although the system sounds interesting, I have a lot of issues with the evaluation.  The authors used only three people familiar with VR interfaces, which is quite low.  A greater concern is that the system was only tested against another form of itself.  SPIDAR G&amp;amp;G was only compared against SPIDAR G + keyboard, when really SPIDAR G&amp;amp;G should have been compared to a mouse and keyboard interface, or a joystick and mouse, or two joysticks, or a roller ball, or any number of more common peripherals.  As is stands, I have no basis to say that the suspended ball manipulation method is any better than traditional interfaces.  The only definite conclusion is that two balls are better than one, and having the balls touch back is beneficial.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-5991796310059840576?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/5991796310059840576/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=5991796310059840576' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5991796310059840576'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/5991796310059840576'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/03/spidar-g-two-handed-haptic-interface.html' title='SPIDAR G&amp;G: A Two-Handed Haptic Interface for Bimanual VR Interaction'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3022028357504155915.post-7513651492683413587</id><published>2008-03-23T14:19:00.003-05:00</published><updated>2008-03-23T14:36:27.441-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sign language'/><category scheme='http://www.blogger.com/atom/ns#' term='neural networks'/><category scheme='http://www.blogger.com/atom/ns#' term='vision'/><category scheme='http://www.blogger.com/atom/ns#' term='hand gesture'/><title type='text'>Taiwan sign language (TSL) recognition based on 3D data and neural networks</title><content type='html'>&lt;span style="font-weight: bold;"&gt;Summary:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Lee and Tsai implemented a vision-based hand gesture recognition system to classify 20 hand TSL signs.  The system used hand features based on visual distances, and 8 reflective markers were placed on the hand to assist in these readings.  The features are then sent into a back-propogation neural network (BPNN) that had 15 features as inputs and the 20 gesture probabilities as outputs.&lt;br /&gt;&lt;br /&gt;The features used include the distances between a wrist point and the finger tips, and the distances between each finger pair (spread).&lt;br /&gt;&lt;br /&gt;10 students tested the system and produced 2788 gestures, of which half went to training and the other half to testing.  The authors tested on neural networks with 2 hidden layers varying in size from 25 x 25 to 250 x 250.  The best results were with the BPNN with 250 x 250 hidden nodes, with a testing accuracy of 94.65%.  Two gestures were heavily confused because the only difference was the length of the finger shown (i.e., the fingers were bent in one gesture).&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Discussion:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;This was a pretty decent use of neural nets, and I'm glad that they gave the results at different hidden layers and the recognition rates for each gesture.  In fact, now that I think about it, I'm just glad they gave results.  These are definitely the best results I've seen and quite promising: one of their main issues was a good feature to distinguish between bent fingers and non-bent fingers.&lt;br /&gt;&lt;br /&gt;The differences between 150x150 and 250x250 are statistically insignificant, but they might be more significant when more gestures are added.  I especially like that there is little discrepancy between training and testing sets, which hopefully indicates that their approach works for the general user.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3022028357504155915-7513651492683413587?l=awolin.blogspot.com'/&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://awolin.blogspot.com/feeds/7513651492683413587/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://www.blogger.com/comment.g?blogID=3022028357504155915&amp;postID=7513651492683413587' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7513651492683413587'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3022028357504155915/posts/default/7513651492683413587'/><link rel='alternate' type='text/html' href='http://awolin.blogspot.com/2008/03/taiwan-sign-language-tsl-recognition.html' title='Taiwan sign language (TSL) recognition based on 3D data and neural networks'/><author><name>Aaron Wolin</name><uri>http://www.blogger.com/profile/11326805313302464034</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='04526254782774441681'/></author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></entry></feed>