<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Justice Solutions Web &#38; Graphics &#187; Data Mining</title>
	<atom:link href="http://justicesolutionsllc.com/category/data-mining/feed/" rel="self" type="application/rss+xml" />
	<link>http://justicesolutionsllc.com</link>
	<description>THE SUPERHEROES OF WEB DESIGN, DEVELOPMENT &#38; MARKETING</description>
	<lastBuildDate>Tue, 01 May 2012 02:52:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Data Mining for Not Quite Dummies</title>
		<link>http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/</link>
		<comments>http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/#comments</comments>
		<pubDate>Tue, 01 May 2012 02:52:25 +0000</pubDate>
		<dc:creator>Doug</dc:creator>
				<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Screen Scraping]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[mozenda]]></category>
		<category><![CDATA[Needlebase]]></category>
		<category><![CDATA[screen scraping software]]></category>
		<category><![CDATA[selenium]]></category>
		<category><![CDATA[seleniumHQ]]></category>

		<guid isPermaLink="false">http://justicesolutionsllc.com/?p=685</guid>
		<description><![CDATA[This will certainly be an ongoing series, so definitely stay tuned, but I have the unique task of finding some really good data mining software for a client vs. trying to code it myself since these days it makes far more sense to follow the basic steps of computing software needs: Is the software the [...]]]></description>
			<content:encoded><![CDATA[<p>This will certainly be an ongoing series, so definitely stay tuned, but I have the unique task of finding some really good data mining software for a client vs. trying to code it myself since these days it makes far more sense to follow the basic steps of computing software needs:</p>
<ol>
<li>Is the software the company needs already created and will sufficiently do the tasks they need at an affordable cost to the client/company?</li>
<li>If there&#8217;s no software that&#8217;s a 100% fit either with affordability or satisfaction of the functionality needed, can one be purchased and perhaps customized?</li>
<li>If the customization of existing software (either open source or purchasing a developer&#8217;s license from a pre-built software company) isn&#8217;t available, can it be coded by taking pieces of opensource code and either pairing it together with more open source code or with custom code?</li>
<li>If all else fails, break out the checkbook because it&#8217;s going to cost you.</li>
</ol>
<p>Now this is extremely simplified, but it is a basic 4 steps of why you should or shouldn&#8217;t be developing your own software for your company&#8217;s needs.  There&#8217;s other factors not mentioned here such as, possible retail licensing of your software, partnership revenues, etc.  but you get the idea.</p>
<p>So back to the issue&#8230;data mining software.  It&#8217;s a nightmare to code and quite honestly there&#8217;s not a whole lot of good ones out there when it comes right down to it.  Even as a coder, I find the &#8220;coding&#8221; versions of some of these systems way too complicated than they need to be, and hence not worth my or my clients&#8217; time.</p>
<p>I&#8217;ve compiled some thoughts on the following systems, please feel free to comment on your own experiences or if you&#8217;ve tried something better&#8230;.I&#8217;m all ears.</p>
<h2><a href="http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/screen-shot-2012-04-30-at-10-29-02-pm/" rel="attachment wp-att-686"><img class="alignleft size-medium wp-image-686" title="Screen shot 2012-04-30 at 10.29.02 PM" src="http://justicesolutionsllc.com/wp-content/uploads/2012/05/Screen-shot-2012-04-30-at-10.29.02-PM-300x211.png" alt="" width="300" height="211" /></a><a href="http://seleniumhq.org/">Selenium HQ</a></h2>
<p>Not bad software and as a matter of fact if you like Firefox plugins, this one may work for you.  The free price tag on it is equally appealing for certain.  I actually used this software before in its earlier stages, and not much has changed except they have added a full coding module to the system so you can basically run it from a number of coding systems such as asp.NET, PHP, Java, etc. which makes it very appealing to the coder in me, but honestly when it comes to data mining, who wants to spend that amount of time coding something that most likely will change the next time a site you are gathering data from, changes it&#8217;s layout or format.  Sorry Selenium&#8230;.not interested.</p>
<h2></h2>
<h2></h2>
<h2></h2>
<h2></h2>
<h2><a href="http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/screen-shot-2012-04-30-at-10-23-25-pm/" rel="attachment wp-att-687"><img class="alignleft size-medium wp-image-687" title="Screen shot 2012-04-30 at 10.23.25 PM" src="http://justicesolutionsllc.com/wp-content/uploads/2012/05/Screen-shot-2012-04-30-at-10.23.25-PM-300x211.png" alt="" width="300" height="211" /></a><a href="http://mozenda.com/">Mozenda</a></h2>
<p>Now this company has the right idea when it comes to revenue generating models that work.  This site has both a web client and desktop client (Windows Only right now&#8230;grrr) but has a rather snazzy looking interface and really does a heck of a job with giving you free pages (that&#8217;s how it tracks your usage/cost) just by going through their pretty comprehensive, but not time consuming tutorials.  I really like the way the data repeaters know that you are looking for results data on this page, but when you click on the title of the product or the image, you can then tell it that you are now going to a product detail page and to continue to grab data for that same row.  It&#8217;s pretty intuitive, quick and the free version which gives you about 5,000 pages to churn through, it&#8217;s not bad.</p>
<p>Drawbacks are of course the cost.  It can get really pricey for companies who need a lot of data on a daily basis, and their engine for the desktop ran a bit slow at times which was a bit annoying.  Overall, I think this product is perfect for a company who has a small to medium sized budget for data acquisition and has a computer savvy person who would be driving the Mozenda engine, but definitely is more advanced than just your basic MS Office computing user.</p>
<h2></h2>
<h2></h2>
<h2><a href="http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/screen-shot-2012-04-30-at-10-29-02-pm/" rel="attachment wp-att-686"><img class="alignleft size-medium wp-image-686" title="Screen shot 2012-04-30 at 10.29.02 PM" src="http://justicesolutionsllc.com/wp-content/uploads/2012/05/Screen-shot-2012-04-30-at-10.29.02-PM-300x211.png" alt="" width="300" height="211" /></a><a href="http://mozenda.com/">Needlebase</a></h2>
<p>This is the last product for this blog post, but it&#8217;s the one I&#8217;m currently exploring as we speak.  When I first came to their site, I said&#8230;.ugh&#8230;.this looks terrible.  However, after doing some research I came to find that it was recently acquired by Google and was made free (for the time being) for any google/gmail/chrome/blah blah blah&#8230;.registered user.  The interface and data modeling is very difficult to get a grasp on if you&#8217;re not familiar with nodes and types and how data models generally come together.  However, if you&#8217;re like me where you understand this type of computing, however need a quick and easy way to grab sources of data through a gui interface that allows you to select the various pieces of data on the actual page you are looking to grab data from (Mozenda also does this btw as does Selenium&#8217;s FF plugin), but do it a bit more efficiently than the others&#8230;you may want to give Needlebase a try.</p>
<p>Ok back to Needlepoint&#8230;.I mean Needlebasing&#8230;.I hope this helps a few of you in your searches for robust, easy to use Data Mining software.</p>
<p>&nbsp;</p>
<p><em>-Doug Justice is the CEO &amp; Chief Superhero of Justice Solutions LLC.  He is an expert in over 10 different programming languages and development methodologies and performs analysis of company development teams, new business concepts, and web marketing potentials.</em></p>

				<!-- Social Sharing Toolkit v2.0.4 | http://www.marijnrongen.com/wordpress-plugins/social_sharing_toolkit/ -->
				<div class="mr_social_sharing_wrapper"><span class="mr_social_sharing"><iframe src="https://www.facebook.com/plugins/like.php?locale=en_US&amp;href=http%3A%2F%2Fjusticesolutionsllc.com%2Fdata-mining-for-not-quite-dummies%2F&amp;layout=standard&amp;show_faces=false&amp;width=51px&amp;height=24px" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:51px; height:24px;" allowTransparency="true"></iframe></span><span class="mr_social_sharing"><div id="fb-root"></div><fb:send href="http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/" font=""></fb:send></span><span class="mr_social_sharing"><a href="http://twitter.com/share?url=http%3A%2F%2Fjusticesolutionsllc.com%2Fdata-mining-for-not-quite-dummies%2F&amp;text=Data+Mining+for+Not+Quite+Dummies&amp;via=justicesolution" target="_blank" class="mr_social_sharing_popup_link"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/twitter.png" alt="Share on Twitter" title="Share on Twitter"/></a></span><span class="mr_social_sharing"><g:plusone size="medium" count="false" href="http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/"></g:plusone></span><span class="mr_social_sharing"><a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fjusticesolutionsllc.com%2Fdata-mining-for-not-quite-dummies%2F&amp;title=Data+Mining+for+Not+Quite+Dummies" target="_blank" class="mr_social_sharing_popup_link"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/stumbleupon.png" alt="Submit to StumbleUpon" title="Submit to StumbleUpon"/></a></span><span class="mr_social_sharing"><a href="http://digg.com/submit?url=http%3A%2F%2Fjusticesolutionsllc.com%2Fdata-mining-for-not-quite-dummies%2F&amp;title=Data+Mining+for+Not+Quite+Dummies" target="_blank" class="mr_social_sharing_popup_link"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/digg.png" alt="Digg This" title="Digg This"/></a></span><span class="mr_social_sharing"><a href="mailto:?subject=Data Mining for Not Quite Dummies&amp;body=http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/email.png" alt="Share via email" title="Share via email"/></a></span></div>]]></content:encoded>
			<wfw:commentRss>http://justicesolutionsllc.com/data-mining-for-not-quite-dummies/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Experts in Data Mining, Screen Scraping, and Import to a Single Source</title>
		<link>http://justicesolutionsllc.com/experts-in-data-mining-screen-scraping/</link>
		<comments>http://justicesolutionsllc.com/experts-in-data-mining-screen-scraping/#comments</comments>
		<pubDate>Thu, 18 Dec 2008 18:58:29 +0000</pubDate>
		<dc:creator>Doug</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Screen Scraping]]></category>
		<category><![CDATA[Web Development]]></category>
		<category><![CDATA[Web Programming]]></category>

		<guid isPermaLink="false">http://www.justicesolutionsllc.com/?p=144</guid>
		<description><![CDATA[Ok, it&#8217;s time to toot our own horn today.  I&#8217;ve seen plenty of blogs on this where businesses and individuals are looking to grab data from multiple sources (that they are allowed to scrape mind you), and combine it all into one grand application or database&#8230;often to no avail and leave very frustrated comments about [...]]]></description>
			<content:encoded><![CDATA[<p>Ok, it&#8217;s time to toot our own horn today.  I&#8217;ve seen plenty of blogs on this where businesses and individuals are looking to grab data from multiple sources (that they are allowed to scrape mind you), and combine it all into one grand application or database&#8230;often to no avail and leave very frustrated comments about their trials and tribulations.</p>
<p>Well, we&#8217;ve been doing this for quite a while and I felt as though it was time to let other folks know about it since there are a lot of opportunities for businesses to save on some soft employee costs, reduce overhead, and increase productivity and data.</p>
<p><strong>A Quick Word About Why You Should Hire a Web Development/Design Company<br />
</strong>This is where web development companies are worth their weight in gold.  Yes, I know there&#8217;s plenty of cookie cutter do it yourself websites out there, and they are great and very affordable.  But when you have an idea that is unique, or a situation that is unique, you really don&#8217;t want what is out there available for everyone else.  I mean, honestly&#8230;if Dell used a cookie cutter build your own computer plugin, would they really be all that different from the other 100 web stores that offer the same thing at approximately the same cost?  No, you&#8217;d probably go to the place that was unique, professional looking, and looked as though they actually spent some money building a better mousetrap.  Ok&#8230;so there&#8217;s my two cents why you should actually use us&#8230;.on to the good stuff.</p>
<p><strong>Why You Would Need an Expert in Data Mining and Screen Scraping</strong><br />
Let&#8217;s say you are a company in Arizona who has an idea to put together a site that puts together all of the latest reduction in homes for sale prices.  Now there&#8217;s a bunch of websites out there that you&#8217;ve contacted and said you&#8217;re going to promote this site and obviously push the business to the various real estate agents involved.  Now comes the fun&#8230;.gathering this data on a daily basis and putting it all into your application.  That&#8217;s where Justice Solutions comes in.</p>
<p><strong>Grab Data, Massage Data, Import Data, Display Data<br />
</strong>So now you hire Justice Solutions to do this seemingly impossible task.  We now take the web addresses of these various sites and create an application that will go out to each one, identify the way each site displays its data, grab it (even if it&#8217;s on multiple pages), and then perform some massaging scripts that will convert any text data into price data, etc.  We then import that data into your master database which will then be used to display the data to your website users in a unique format specific to your website.</p>
<p><strong>Why It&#8217;s So Difficult<br />
</strong>The reason this is usually such a daunting task is because if you&#8217;re not used to it, trying to identify data patterns and other things that go into dealing with data from multiple sources can be quite overwhelming.  Also, you have to be able to automate all of it so it happens quickly and without much user intervention.</p>
<p><strong>We&#8217;ll Toot Today<br />
</strong>So if you or a person you know has an idea like this, or an existing business that would benefit from this type of application, please refer them to us.  We gladly pay referral fees and would be happy to help in easing the fears of this commonly thought of, but rarely successful, business model.  Contact info@justicesolutionsllc.com for more information.</p>
<p>Until next time&#8230;happy coding&#8230;and mining&#8230;.</p>
<p>Doug.</p>

				<!-- Social Sharing Toolkit v2.0.4 | http://www.marijnrongen.com/wordpress-plugins/social_sharing_toolkit/ -->
				<div class="mr_social_sharing_wrapper"><span class="mr_social_sharing"><iframe src="https://www.facebook.com/plugins/like.php?locale=en_US&amp;href=http%3A%2F%2Fjusticesolutionsllc.com%2Fexperts-in-data-mining-screen-scraping%2F&amp;layout=standard&amp;show_faces=false&amp;width=51px&amp;height=24px" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:51px; height:24px;" allowTransparency="true"></iframe></span><span class="mr_social_sharing"><div id="fb-root"></div><fb:send href="http://justicesolutionsllc.com/experts-in-data-mining-screen-scraping/" font=""></fb:send></span><span class="mr_social_sharing"><a href="http://twitter.com/share?url=http%3A%2F%2Fjusticesolutionsllc.com%2Fexperts-in-data-mining-screen-scraping%2F&amp;text=Experts+in+Data+Mining%2C+Screen+Scraping%2C+and+Import+to+a+Single+Source&amp;via=justicesolution" target="_blank" class="mr_social_sharing_popup_link"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/twitter.png" alt="Share on Twitter" title="Share on Twitter"/></a></span><span class="mr_social_sharing"><g:plusone size="medium" count="false" href="http://justicesolutionsllc.com/experts-in-data-mining-screen-scraping/"></g:plusone></span><span class="mr_social_sharing"><a href="http://www.stumbleupon.com/submit?url=http%3A%2F%2Fjusticesolutionsllc.com%2Fexperts-in-data-mining-screen-scraping%2F&amp;title=Experts+in+Data+Mining%2C+Screen+Scraping%2C+and+Import+to+a+Single+Source" target="_blank" class="mr_social_sharing_popup_link"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/stumbleupon.png" alt="Submit to StumbleUpon" title="Submit to StumbleUpon"/></a></span><span class="mr_social_sharing"><a href="http://digg.com/submit?url=http%3A%2F%2Fjusticesolutionsllc.com%2Fexperts-in-data-mining-screen-scraping%2F&amp;title=Experts+in+Data+Mining%2C+Screen+Scraping%2C+and+Import+to+a+Single+Source" target="_blank" class="mr_social_sharing_popup_link"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/digg.png" alt="Digg This" title="Digg This"/></a></span><span class="mr_social_sharing"><a href="mailto:?subject=Experts in Data Mining, Screen Scraping, and Import to a Single Source&amp;body=http://justicesolutionsllc.com/experts-in-data-mining-screen-scraping/"><img src="http://justicesolutionsllc.com/wp-content/plugins/social-sharing-toolkit/images/buttons/email.png" alt="Share via email" title="Share via email"/></a></span></div>]]></content:encoded>
			<wfw:commentRss>http://justicesolutionsllc.com/experts-in-data-mining-screen-scraping/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

