<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Cleaning up scanned text &#8211; revisited</title>
	<atom:link href="http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/</link>
	<description>Steve Eddins manages the Image &#38; Geospatial development team at The MathWorks and coauthored Digital Image Processing Using MATLAB. He writes here about image processing concepts, algorithm implementations, and MATLAB.</description>
	<lastBuildDate>Sat, 11 Feb 2012 18:27:50 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Steve</title>
		<link>http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-18960</link>
		<dc:creator>Steve</dc:creator>
		<pubDate>Mon, 21 Jan 2008 13:45:10 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-18960</guid>
		<description>Greg&#8212;Thanks for your suggestions.</description>
		<content:encoded><![CDATA[<p>Greg&mdash;Thanks for your suggestions.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Greg</title>
		<link>http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-18660</link>
		<dc:creator>Greg</dc:creator>
		<pubDate>Tue, 15 Jan 2008 06:29:37 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-18660</guid>
		<description>Look up the work on reconstructing FAX images which essentially do the following:
1) Break the fax into glyphs (i.e, each character)
2) Find the glyphs that nearly match each other
3) Combine glyphs that nearly match each other (a simple naive method is to average the glyphs and then threshold the result)
4) Put the glyphs back on the page in their original location

This is quite similar to the lossy JBIG2 compression algorithm.

Another approach is to use a vertical median filter where each pixel in an X pixel (usually 5 or 7) vertical line is set to the median value of the X pixels.  This preserves vertical lines and fills in 1 pixel gaps.  Follow that with a horizontal median filter to fill in gaps in horizontal lines.</description>
		<content:encoded><![CDATA[<p>Look up the work on reconstructing FAX images which essentially do the following:<br />
1) Break the fax into glyphs (i.e, each character)<br />
2) Find the glyphs that nearly match each other<br />
3) Combine glyphs that nearly match each other (a simple naive method is to average the glyphs and then threshold the result)<br />
4) Put the glyphs back on the page in their original location</p>
<p>This is quite similar to the lossy JBIG2 compression algorithm.</p>
<p>Another approach is to use a vertical median filter where each pixel in an X pixel (usually 5 or 7) vertical line is set to the median value of the X pixels.  This preserves vertical lines and fills in 1 pixel gaps.  Follow that with a horizontal median filter to fill in gaps in horizontal lines.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve</title>
		<link>http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-17578</link>
		<dc:creator>Steve</dc:creator>
		<pubDate>Wed, 26 Dec 2007 14:40:28 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-17578</guid>
		<description>Ian&#8212;Use &lt;tt&gt;hold on&lt;/tt&gt; and &lt;tt&gt;plot&lt;/tt&gt; to superimpose lines on top of an image.  See &lt;a href=&quot;http://blogs.mathworks.com/steve/2007/01/01/superimposing-line-plots/&quot; rel=&quot;nofollow&quot;&gt;this post&lt;/a&gt; for examples.</description>
		<content:encoded><![CDATA[<p>Ian&mdash;Use <tt>hold on</tt> and <tt>plot</tt> to superimpose lines on top of an image.  See <a href="http://blogs.mathworks.com/steve/2007/01/01/superimposing-line-plots/" rel="nofollow">this post</a> for examples.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ian</title>
		<link>http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-17344</link>
		<dc:creator>ian</dc:creator>
		<pubDate>Sun, 23 Dec 2007 06:41:57 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.mathworks.com/steve/2007/12/21/cleaning-up-scanned-text-revisited/#comment-17344</guid>
		<description>Thanks for the block.Hopefully I will be doing  OCR if all goes well. Sorry to use thiss forum but I have spend 4 hours trying to figure out a solution.Checking online and trying in matlab. Am trying to a rectangular around every labelled component in the image.However I can&#039;t figure out how to do this using BoundingBox the documentation gives a brief mention. Pointing me the right direction would help. I tried following the syntax of adding a centroid to the image but even that was failure.</description>
		<content:encoded><![CDATA[<p>Thanks for the block.Hopefully I will be doing  OCR if all goes well. Sorry to use thiss forum but I have spend 4 hours trying to figure out a solution.Checking online and trying in matlab. Am trying to a rectangular around every labelled component in the image.However I can&#8217;t figure out how to do this using BoundingBox the documentation gives a brief mention. Pointing me the right direction would help. I tried following the syntax of adding a centroid to the image but even that was failure.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

