<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Shelley disagrees with and dislikes &#8220;Miscellaneous&#8221;</title>
	<atom:link href="http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/</link>
	<description>About David Weinberger's book (May, 2007) and how we're pulling ourselves together now that we've blown ourselves to bits</description>
	<lastBuildDate>Fri, 12 Mar 2010 07:37:30 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: roy belmont</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-5819</link>
		<dc:creator>roy belmont</dc:creator>
		<pubDate>Sat, 19 May 2007 10:40:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-5819</guid>
		<description>Brian H thanks that was really groovy. Whether there&#039;s something more imprtant at stake or not I think we should act like there is. because there probably is even if we can&#039;t tell right now.
JohnE it would also imply that said African when he was hearing and comprehending speech before his literacy breakthrough was hearing and comprehending something that was not made of words. Kind of how the day isn&#039;t really divided into 24 equal parts, and never was.</description>
		<content:encoded><![CDATA[<p>Brian H thanks that was really groovy. Whether there&#8217;s something more imprtant at stake or not I think we should act like there is. because there probably is even if we can&#8217;t tell right now.<br />
JohnE it would also imply that said African when he was hearing and comprehending speech before his literacy breakthrough was hearing and comprehending something that was not made of words. Kind of how the day isn&#8217;t really divided into 24 equal parts, and never was.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: johne</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-5062</link>
		<dc:creator>johne</dc:creator>
		<pubDate>Mon, 14 May 2007 21:31:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-5062</guid>
		<description>Marshall McLuhan, somewhere, quotes the memoirs of an African who passed an illiterate childhood in a village society, and began school only when he was old enough to reflect on what was happening.  He said, if I remember correctly, that once he realized that the ocean of meaning that flowed toward him when someone opened their mouth could be thought of as divided into individual words, each one expressing an idea, and each usable in a different context, literacy was a snap.

That would imply that the very concept of words, and hence spaces, definitely falls in the realm of metadata.</description>
		<content:encoded><![CDATA[<p>Marshall McLuhan, somewhere, quotes the memoirs of an African who passed an illiterate childhood in a village society, and began school only when he was old enough to reflect on what was happening.  He said, if I remember correctly, that once he realized that the ocean of meaning that flowed toward him when someone opened their mouth could be thought of as divided into individual words, each one expressing an idea, and each usable in a different context, literacy was a snap.</p>
<p>That would imply that the very concept of words, and hence spaces, definitely falls in the realm of metadata.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian H</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-4795</link>
		<dc:creator>Brian H</dc:creator>
		<pubDate>Sun, 13 May 2007 16:46:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-4795</guid>
		<description>One, or maybe the primary, benefits of written spacing is speed of absorption/reading.  It is possible to absorb printed text much faster than it can be coherently spoken, especially if you&#039;ve been give a bit of speed-reading training, and the space-format-metadata clues help by reducing the decoding effort.  But there are mysteries about how we order information and data.  For example, in the standard intro/promo session for the Evelyn Wood speed reading course, one is shown a series of flip cards, riffled quickly to show that we pick up on the meaning of a series of words forming a sentence quite quickly.  And then comes the kicker: the stacks of cards are shown slowly, in the order &quot;riffled&quot;, and the last few sentences are in garbled or random order.  But were perceived and read as though properly sequenced.  

The clear implication is that there is a rather larger &quot;comprehension buffer&quot;  in the brain than we are used to considering, and that meaning and structure are derived/comprehended/imposed in a far more active and contextually integrated form than we naively assume.  To continue with the E.W. example, the extension of the point is that with practice and trust in our background sorting and meaning-assignment capacities it is possible to scan pages &quot;slashing&quot; diagonally across chunks of paragraphs, whole paragraphs, or even pages and yet perceive and absorb what is written.  And then the demonstration concludes with a some &quot;speed-reading&quot; of poetry -- which turns out to be reading aloud, with feeling.  There is SO much of the weight of communication borne in poetry by the cadences, rhymes, and patterned reflections of image that it would be almost pointless to try and &quot;grok at  a glance&quot;.  ;) 

So spaces HELP with perception of content.  Metadata organizes semantically.  Perhaps you could expand your argument to encompass capitalization conventions, many of which r txtg yth r gvg up ttlly.  No space at the inn on a tiny cell phone screen for all the convenient clues that foolscap, whether material  or electronic, allows.

Speaking of conventions, agreed-upon spelling and syntax have much to do with comprehension, too; having spent years editing articles written for the Web, I know that many disdain (in error, IMNSHO) to follow them.  Pointing out errors and typos in, e.g. comments sections like this (&quot;well night unto impossible&quot; s/b &quot;well-nigh impossible&quot;; &quot;pauses are genrally there&quot; s/b &quot;pauses are generally there&quot;; &quot;imminent doom. . .&quot; s/b &quot;imminent doom ...&quot;) get one excoriated as a &quot;spelling-and-grammar Nazi!&quot;.  Speed of comprehension, though, has a lot to do with such pickiness; decoding misspells and malaprops takes time and effort and is distracting.  

So are formalisms and formatting metadata?  Is it simply a matter of preference about whether the definition should be stretched to include them?  Or is there something more important at stake?  Maybe there&#039;s a special category of metadata which relates specifically to internal consistency and comprehensibility, but is otherwise &quot;neutral&quot;.</description>
		<content:encoded><![CDATA[<p>One, or maybe the primary, benefits of written spacing is speed of absorption/reading.  It is possible to absorb printed text much faster than it can be coherently spoken, especially if you&#8217;ve been give a bit of speed-reading training, and the space-format-metadata clues help by reducing the decoding effort.  But there are mysteries about how we order information and data.  For example, in the standard intro/promo session for the Evelyn Wood speed reading course, one is shown a series of flip cards, riffled quickly to show that we pick up on the meaning of a series of words forming a sentence quite quickly.  And then comes the kicker: the stacks of cards are shown slowly, in the order &#8220;riffled&#8221;, and the last few sentences are in garbled or random order.  But were perceived and read as though properly sequenced.  </p>
<p>The clear implication is that there is a rather larger &#8220;comprehension buffer&#8221;  in the brain than we are used to considering, and that meaning and structure are derived/comprehended/imposed in a far more active and contextually integrated form than we naively assume.  To continue with the E.W. example, the extension of the point is that with practice and trust in our background sorting and meaning-assignment capacities it is possible to scan pages &#8220;slashing&#8221; diagonally across chunks of paragraphs, whole paragraphs, or even pages and yet perceive and absorb what is written.  And then the demonstration concludes with a some &#8220;speed-reading&#8221; of poetry &#8212; which turns out to be reading aloud, with feeling.  There is SO much of the weight of communication borne in poetry by the cadences, rhymes, and patterned reflections of image that it would be almost pointless to try and &#8220;grok at  a glance&#8221;.  ;) </p>
<p>So spaces HELP with perception of content.  Metadata organizes semantically.  Perhaps you could expand your argument to encompass capitalization conventions, many of which r txtg yth r gvg up ttlly.  No space at the inn on a tiny cell phone screen for all the convenient clues that foolscap, whether material  or electronic, allows.</p>
<p>Speaking of conventions, agreed-upon spelling and syntax have much to do with comprehension, too; having spent years editing articles written for the Web, I know that many disdain (in error, IMNSHO) to follow them.  Pointing out errors and typos in, e.g. comments sections like this (&#8220;well night unto impossible&#8221; s/b &#8220;well-nigh impossible&#8221;; &#8220;pauses are genrally there&#8221; s/b &#8220;pauses are generally there&#8221;; &#8220;imminent doom. . .&#8221; s/b &#8220;imminent doom &#8230;&#8221;) get one excoriated as a &#8220;spelling-and-grammar Nazi!&#8221;.  Speed of comprehension, though, has a lot to do with such pickiness; decoding misspells and malaprops takes time and effort and is distracting.  </p>
<p>So are formalisms and formatting metadata?  Is it simply a matter of preference about whether the definition should be stretched to include them?  Or is there something more important at stake?  Maybe there&#8217;s a special category of metadata which relates specifically to internal consistency and comprehensibility, but is otherwise &#8220;neutral&#8221;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: roy belmont</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-4087</link>
		<dc:creator>roy belmont</dc:creator>
		<pubDate>Thu, 10 May 2007 00:59:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-4087</guid>
		<description>The space-between-words has clear lines forming its sides - it&#039;s the lines of the letters in the words themselves that mark it. But above and below are only spaces with indeterminate edges, the lines between the lines, and again out at the margins of the page those line-data spaces become something else again. White space is cohesive, unbroken, all through the text unless a line gets made out to the edges of the page, as it almost never does. We revere the margins in this way.
It&#039;s sort of a concrete abstraction, that.
&lt;i&gt;&quot;They are actual characters, they have their own key&quot;&lt;/i&gt;
Yes. But the &quot;Delete&quot; key, the &quot;COPY&quot; and &quot;PASTE&quot; commands own the whole document in that sense don&#039;t they?
It&#039;s where that space-between-words character meets its upper and lower &quot;other&quot; - the space-between-lines - that seems interesting in this way, as meaning imputed to the space-between-words character disappears...where? Or when? At the parallel lines of the capitals? Top of the &quot;I&quot; is top of the space?
Without being facetious - it&#039;s all there, and on out to the far edge of the page.
Like a Mercator projection, it&#039;s not accurate, but it serves well enough to get the job done. We use the written word to map the spoken, or did until whole specialized demographics became so familiar with it they could be said to have inhabited the maps themselves.
In speech the outside margins of the info are the world around the spoken words - everything that is not-spoken - it&#039;s cylindrical more than linear, yes?
The little gaps and elisions of spoken language take place, when they do, within a larger space they blur seamlessly into as well, though we supply the linear flow, by hearing it. Which is to say in order to have a discussion about the spaces between words we have to pretend those spaces aren&#039;t part and parcel of the spaces-between-lines which are themselves the margin-stuff and the blank foolscap all one.</description>
		<content:encoded><![CDATA[<p>The space-between-words has clear lines forming its sides &#8211; it&#8217;s the lines of the letters in the words themselves that mark it. But above and below are only spaces with indeterminate edges, the lines between the lines, and again out at the margins of the page those line-data spaces become something else again. White space is cohesive, unbroken, all through the text unless a line gets made out to the edges of the page, as it almost never does. We revere the margins in this way.<br />
It&#8217;s sort of a concrete abstraction, that.<br />
<i>&#8220;They are actual characters, they have their own key&#8221;</i><br />
Yes. But the &#8220;Delete&#8221; key, the &#8220;COPY&#8221; and &#8220;PASTE&#8221; commands own the whole document in that sense don&#8217;t they?<br />
It&#8217;s where that space-between-words character meets its upper and lower &#8220;other&#8221; &#8211; the space-between-lines &#8211; that seems interesting in this way, as meaning imputed to the space-between-words character disappears&#8230;where? Or when? At the parallel lines of the capitals? Top of the &#8220;I&#8221; is top of the space?<br />
Without being facetious &#8211; it&#8217;s all there, and on out to the far edge of the page.<br />
Like a Mercator projection, it&#8217;s not accurate, but it serves well enough to get the job done. We use the written word to map the spoken, or did until whole specialized demographics became so familiar with it they could be said to have inhabited the maps themselves.<br />
In speech the outside margins of the info are the world around the spoken words &#8211; everything that is not-spoken &#8211; it&#8217;s cylindrical more than linear, yes?<br />
The little gaps and elisions of spoken language take place, when they do, within a larger space they blur seamlessly into as well, though we supply the linear flow, by hearing it. Which is to say in order to have a discussion about the spaces between words we have to pretend those spaces aren&#8217;t part and parcel of the spaces-between-lines which are themselves the margin-stuff and the blank foolscap all one.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shelley</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-3895</link>
		<dc:creator>Shelley</dc:creator>
		<pubDate>Wed, 09 May 2007 00:35:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-3895</guid>
		<description>For anything to be meaningful, there&#039;s must be some value with attaching a formalized concept of &#039;metadata&#039;, same for attaching a value of &#039;data&#039;.

A word by itself has meaning--the dictionary attests to that. But words as data--depends on the context. The title of your book consists of words, and they form data. The metadata is &#039;title&#039; or even &#039;book title&#039;. This may seem an uptight statement to make to some, or overly simplistic to others, but it&#039;s a very tangible example of data and metadata, as well as words as &#039;data&#039;. 

But red

Yellow apple rain not not worm peach. Brick fog chip! 

Very real words. Each has meaning. In this context and how they&#039;re used? Worthless. So are they data? Only to the person who really does count the cracks in the sidewalk. 

Returning to the space. I know that in your book, David, you used space to represent how even absence can be metadata, and used spaces between words as example. But spaces are not &#039;nothing&#039;, nor are they an absence of something. They are actual characters, they have their own key on the keyboard, their own width based on font, they take up real estate on a page. They don&#039;t have a visible presence, but they &#039;something&#039;. 

Now, are the metadata? Not really. They are a syntactic element, just like the alphabet that is used to form what can end up being data or even metadata. But they don&#039;t really describe anything, no more than the letter &#039;b&#039; describes anything. Well, unless we&#039;re watching Sesame Street.

Even if they were, they&#039;re not inherently a part of a word -- they&#039;re a part of the sentence, phrase, term, what have you. A word can be terminated with any number of characters, or even nothing at all. It&#039;s not dependent on a space for its termination. It&#039;s the sentence, phrase, or term that is counting on its existence.

Thisisaperfectlygooduseofwordsbutnotagreatsentence. 

This/is/a/perfectly/good/use/of/words/but/not/a/great/sentence.

Now, when is the absence of something &#039;metadata&#039;? &quot;When&#039;s the wedding?&quot; &quot;Oh, we haven&#039;t set the date yet.&quot;

But then, I&#039;m an uptight Semantic Web nerd ;-)</description>
		<content:encoded><![CDATA[<p>For anything to be meaningful, there&#8217;s must be some value with attaching a formalized concept of &#8216;metadata&#8217;, same for attaching a value of &#8216;data&#8217;.</p>
<p>A word by itself has meaning&#8211;the dictionary attests to that. But words as data&#8211;depends on the context. The title of your book consists of words, and they form data. The metadata is &#8216;title&#8217; or even &#8216;book title&#8217;. This may seem an uptight statement to make to some, or overly simplistic to others, but it&#8217;s a very tangible example of data and metadata, as well as words as &#8216;data&#8217;. </p>
<p>But red</p>
<p>Yellow apple rain not not worm peach. Brick fog chip! </p>
<p>Very real words. Each has meaning. In this context and how they&#8217;re used? Worthless. So are they data? Only to the person who really does count the cracks in the sidewalk. </p>
<p>Returning to the space. I know that in your book, David, you used space to represent how even absence can be metadata, and used spaces between words as example. But spaces are not &#8216;nothing&#8217;, nor are they an absence of something. They are actual characters, they have their own key on the keyboard, their own width based on font, they take up real estate on a page. They don&#8217;t have a visible presence, but they &#8217;something&#8217;. </p>
<p>Now, are the metadata? Not really. They are a syntactic element, just like the alphabet that is used to form what can end up being data or even metadata. But they don&#8217;t really describe anything, no more than the letter &#8216;b&#8217; describes anything. Well, unless we&#8217;re watching Sesame Street.</p>
<p>Even if they were, they&#8217;re not inherently a part of a word &#8212; they&#8217;re a part of the sentence, phrase, term, what have you. A word can be terminated with any number of characters, or even nothing at all. It&#8217;s not dependent on a space for its termination. It&#8217;s the sentence, phrase, or term that is counting on its existence.</p>
<p>Thisisaperfectlygooduseofwordsbutnotagreatsentence. </p>
<p>This/is/a/perfectly/good/use/of/words/but/not/a/great/sentence.</p>
<p>Now, when is the absence of something &#8216;metadata&#8217;? &#8220;When&#8217;s the wedding?&#8221; &#8220;Oh, we haven&#8217;t set the date yet.&#8221;</p>
<p>But then, I&#8217;m an uptight Semantic Web nerd ;-)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael R. Bernstein</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-3855</link>
		<dc:creator>Michael R. Bernstein</dc:creator>
		<pubDate>Tue, 08 May 2007 20:42:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-3855</guid>
		<description>AKMA, the pauses are genrally there for languages that need them to the degree that they are needed (which can be highly variable). Note that German, for example, has words structures  that are more regular and agglutinative as a matter of course, and consequently more run-on words in print.

Also note that a speaker&#039;s pauses get more distinct when they assume the listener is less fluent, and that pauses are less necessary when vocabulary is constrained (so that words are more easily distinguished as unique utterances).

This is far from a hard-and-fast rule, just a tendency, and there are other forms of redundancy like the use of &#039;a&#039; vs. &#039;an&#039; or even regional dialects (both pronunciation and idiom) that reduce the need for pauses in many cases, but of course printed word boundaries for a given language are less susceptible to variation (precisely in order to make them more comprehensible across time and space in the same way that more distinct pauses do).

David, on top of the foregoing, pauses and stress can of course also convey metadata (the pregnant pause, sarcasm, the stress conveying a double meaning, the change in tone indicating a digression), but note that these are generally conveyed in print in other ways, if at all.

In those cases (commas, parentheses, the ellipsis) and others, we usually *are* talking about metadata (because it adds to the literal meaning of the words), but a literal interpretation of the lowly space in-and-of-itself is just that it&#039;s a simple delimiter and a data format detail (and, if I recall correctly, bullets were tried first).

It is interesting to note that the affordances of the textual equivalent may not match those of the spoken utterance (this is, in part, why you don&#039;t usually notice the micro pauses consciously). For example, few people can manage to convey by tone alone more than one level of digression, whereas nested parenthetical remarks run rampant in print. This is also why we need commas to indicate deliberate, meaningful pauses, rather than relying on double or triple spaces in print (which is the &#039;obvious&#039; solution).</description>
		<content:encoded><![CDATA[<p>AKMA, the pauses are genrally there for languages that need them to the degree that they are needed (which can be highly variable). Note that German, for example, has words structures  that are more regular and agglutinative as a matter of course, and consequently more run-on words in print.</p>
<p>Also note that a speaker&#8217;s pauses get more distinct when they assume the listener is less fluent, and that pauses are less necessary when vocabulary is constrained (so that words are more easily distinguished as unique utterances).</p>
<p>This is far from a hard-and-fast rule, just a tendency, and there are other forms of redundancy like the use of &#8216;a&#8217; vs. &#8216;an&#8217; or even regional dialects (both pronunciation and idiom) that reduce the need for pauses in many cases, but of course printed word boundaries for a given language are less susceptible to variation (precisely in order to make them more comprehensible across time and space in the same way that more distinct pauses do).</p>
<p>David, on top of the foregoing, pauses and stress can of course also convey metadata (the pregnant pause, sarcasm, the stress conveying a double meaning, the change in tone indicating a digression), but note that these are generally conveyed in print in other ways, if at all.</p>
<p>In those cases (commas, parentheses, the ellipsis) and others, we usually *are* talking about metadata (because it adds to the literal meaning of the words), but a literal interpretation of the lowly space in-and-of-itself is just that it&#8217;s a simple delimiter and a data format detail (and, if I recall correctly, bullets were tried first).</p>
<p>It is interesting to note that the affordances of the textual equivalent may not match those of the spoken utterance (this is, in part, why you don&#8217;t usually notice the micro pauses consciously). For example, few people can manage to convey by tone alone more than one level of digression, whereas nested parenthetical remarks run rampant in print. This is also why we need commas to indicate deliberate, meaningful pauses, rather than relying on double or triple spaces in print (which is the &#8216;obvious&#8217; solution).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jay Fienberg</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-3816</link>
		<dc:creator>Jay Fienberg</dc:creator>
		<pubDate>Tue, 08 May 2007 16:45:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-3816</guid>
		<description>&quot;Jay, I’m surprised you don’t count words as data. Would you count words as information?&quot;

Yes, but that&#039;s not a property of the words as much as it&#039;s a property of we humans always having or creating a sense of persistent context for words. 

(Or, we&#039;re not recognizing the characters as words, and just talking about the characters as information.)

For example, someone hands you a piece of paper with three letters: &quot;die.&quot; That&#039;s already a bit of a context. If you&#039;re playing a German word game, &quot;die&quot; is the definite article (&quot;the&quot; in English). If you&#039;re a character in horror movie in English, it brings bad tidings of you imminent doom. . .

So, until writing all of that, I felt like there was no similarly persistent context / contextualizing in which we recognize words as data. I was just thinking there are specific contexts (of which, of course, there could be many). For example, &quot;die,&quot; which represents a number of points in Scrabble, is data in the context of Scrabble scoring. 

But, now I see that there is at least one persistent context / contextualizing in which we recognize words as data: the &quot;word count&quot; context. For example, this is turning into a really long comment, which means it has a lot or words: the individual words are the data of this word count context.

To get the spaces = metadata, in this context, I&#039;d suggest that a word count includes a list of characters that act as delimiters or non-word characters. That items in that list, including the space character, are data: it&#039;s not the word data, but it&#039;s a constraint on the word data, and therefore, in this context, it could rightly be called metadata.</description>
		<content:encoded><![CDATA[<p>&#8220;Jay, I’m surprised you don’t count words as data. Would you count words as information?&#8221;</p>
<p>Yes, but that&#8217;s not a property of the words as much as it&#8217;s a property of we humans always having or creating a sense of persistent context for words. </p>
<p>(Or, we&#8217;re not recognizing the characters as words, and just talking about the characters as information.)</p>
<p>For example, someone hands you a piece of paper with three letters: &#8220;die.&#8221; That&#8217;s already a bit of a context. If you&#8217;re playing a German word game, &#8220;die&#8221; is the definite article (&#8220;the&#8221; in English). If you&#8217;re a character in horror movie in English, it brings bad tidings of you imminent doom. . .</p>
<p>So, until writing all of that, I felt like there was no similarly persistent context / contextualizing in which we recognize words as data. I was just thinking there are specific contexts (of which, of course, there could be many). For example, &#8220;die,&#8221; which represents a number of points in Scrabble, is data in the context of Scrabble scoring. </p>
<p>But, now I see that there is at least one persistent context / contextualizing in which we recognize words as data: the &#8220;word count&#8221; context. For example, this is turning into a really long comment, which means it has a lot or words: the individual words are the data of this word count context.</p>
<p>To get the spaces = metadata, in this context, I&#8217;d suggest that a word count includes a list of characters that act as delimiters or non-word characters. That items in that list, including the space character, are data: it&#8217;s not the word data, but it&#8217;s a constraint on the word data, and therefore, in this context, it could rightly be called metadata.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: AKMA</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-3808</link>
		<dc:creator>AKMA</dc:creator>
		<pubDate>Tue, 08 May 2007 15:36:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-3808</guid>
		<description>&quot;Spoken speech has tiny pauses between words&quot; &#8212; I&#039;m surprised too hear that; I&#8217;d be interested in learning about research that demonstrated such a thing. I can hear distinct words easily in spoken English (usually), when concentrating in spoken French (usually), with great effort in spoken German (sometimes), and only occasionally in spoken Spanish. Are the pauses &quot;there,&quot; or are they an effect of the relative fluency with which I perceive and process the words?</description>
		<content:encoded><![CDATA[<p>&#8220;Spoken speech has tiny pauses between words&#8221; &#8212; I&#8217;m surprised too hear that; I&#8217;d be interested in learning about research that demonstrated such a thing. I can hear distinct words easily in spoken English (usually), when concentrating in spoken French (usually), with great effort in spoken German (sometimes), and only occasionally in spoken Spanish. Are the pauses &#8220;there,&#8221; or are they an effect of the relative fluency with which I perceive and process the words?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Weinberger</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-3790</link>
		<dc:creator>David Weinberger</dc:creator>
		<pubDate>Tue, 08 May 2007 12:13:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-3790</guid>
		<description>Jay, I&#039;m surprised you don&#039;t count words as data. Would you count words as information? 

Michael, I take the stress changes and pauses as metadata that give us important information about the data (the words).</description>
		<content:encoded><![CDATA[<p>Jay, I&#8217;m surprised you don&#8217;t count words as data. Would you count words as information? </p>
<p>Michael, I take the stress changes and pauses as metadata that give us important information about the data (the words).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael R. Bernstein</title>
		<link>http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/comment-page-1/#comment-3732</link>
		<dc:creator>Michael R. Bernstein</dc:creator>
		<pubDate>Tue, 08 May 2007 02:00:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.everythingismiscellaneous.com/2007/05/07/shelley-disagrees-with-and-dislikes-miscellaneous/#comment-3732</guid>
		<description>Spoken speech has tiny pauses between words and stress changes within words that help to denote their beginning and end, though we don&#039;t notice them in the normal course of things. Try listening to someone speaking in a monotone with no pauses to see how much it these subtle changes (and the larger pauses at the end of sentences) aid comprehension.

Running the written text together loses that, making it more difficult (though not impossible) to decode the text back into words.

So, while the spaces are data, it is just redundant data, like a checksum. With them, decoding the text stream back into words is unambiguous.

No one thinks of checksums as metadata, it is just a feature of a data *format*, and of no more semantic significance than that.</description>
		<content:encoded><![CDATA[<p>Spoken speech has tiny pauses between words and stress changes within words that help to denote their beginning and end, though we don&#8217;t notice them in the normal course of things. Try listening to someone speaking in a monotone with no pauses to see how much it these subtle changes (and the larger pauses at the end of sentences) aid comprehension.</p>
<p>Running the written text together loses that, making it more difficult (though not impossible) to decode the text back into words.</p>
<p>So, while the spaces are data, it is just redundant data, like a checksum. With them, decoding the text stream back into words is unambiguous.</p>
<p>No one thinks of checksums as metadata, it is just a feature of a data *format*, and of no more semantic significance than that.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
