<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Datamartist.com &#187; MS Excel</title>
	<atom:link href="http://www.datamartist.com/category/ms-excel/feed" rel="self" type="application/rss+xml" />
	<link>http://www.datamartist.com</link>
	<description>Reduce cost with self serve data transformation</description>
	<lastBuildDate>Mon, 26 Jul 2010 18:33:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Spreadsheet errors- Fear, uncertainty and doubt</title>
		<link>http://www.datamartist.com/spreadsheet-risk-and-errors-fear-uncertainty-and-doubt</link>
		<comments>http://www.datamartist.com/spreadsheet-risk-and-errors-fear-uncertainty-and-doubt#comments</comments>
		<pubDate>Mon, 11 Jan 2010 18:54:46 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[MS Excel]]></category>
		<category><![CDATA[Reality Check]]></category>
		<category><![CDATA[data culture]]></category>
		<category><![CDATA[Business Intelligence trends]]></category>
		<category><![CDATA[Excel]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=3831</guid>
		<description><![CDATA[I love the acronym FUD which stands for "Fear, uncertainty and doubt". What I don't love is the underhanded use of FUD to manipulate peoples behavior. Spreading FUD is not about creating something new, but destroying- destroying someones confidence in something, clouding the real issue, stopping a new or creative direction from being taken. FUD [...]]]></description>
			<content:encoded><![CDATA[<p>I love the acronym FUD which stands for "Fear, uncertainty and doubt".  What I don't love is the underhanded use of FUD to manipulate peoples behavior.  Spreading FUD is not about creating something new, but destroying- destroying someones confidence in something, clouding the real issue, stopping a new or creative direction from being taken.  FUD is often used to block reform and change because FUD can cause people to do nothing- and doing nothing is good for the incumbent.</p>
<p>In the data analysis realm, spreadsheet errors are often used to try to dissuade companies from letting their people "work with the data directly".  Software vendors of all sizes, but particularly the really big ones (those incumbants) spread FUD because if they can stop people from getting at the data themselves, it increases the chance of companies buying some more business intelligence suites.</p>
<p>The argument goes something like this:</p>
<blockquote><p>Spreadsheets have been shown to be plagued with errors, many studies showing error rates above 90%.  You need to reduce the risk that spreadsheets are creating in your organization by establishing formal, documented processes that are created an managed by professionals using sophisticated tools.</p></blockquote>
<p>Then the usual nightmare scenarios are brought out, all involving rabid Auditors, Sarbane-Oxley, governance failures etc.</p>
<p><img src="http://www.datamartist.com/wp-content/uploads/2010/01/accidently-put-last-years-spreadsheet-number-into-annual-report1.jpg" alt="accidently-put-last-years-spreadsheet-number-into-annual-report" title="accidently-put-last-years-spreadsheet-number-into-annual-report" width="341" height="226" class="alignright size-full wp-image-3839" />Now, don't get me wrong, spreadsheet errors are a very real and serious problem, and there are all sorts of data applications that should never be done in Excel or other ad-hoc, user driven tools. Ever.  Formal documented processes are critically important, and there are lots of places where you better be using the right tools and professionals.  </p>
<p>I have seen the culture of the spreadsheet completely undermine initiatives that would have driven better data quality, data analysis and business processes.  The spreadsheet certainly has its dark side.</p>
<p>But the problem is that FUD paints with a broad brush.  People take it as "Spreadsheets with data in them? Bad news. Don't do it.  Individuals able to get at the data, and quickly transform it, analyze it?  Who knows what they'll do- shut them down!"</p>
<p>Sadly, from a data quality point of view, sometimes the spreadsheets have the BEST data quality- because people have fixed the issues they can't fix in the transactional system due to constraints or IT department delays.</p>
<h2>Encourage positive change with reasonable controls.</h2>
<p>Intelligent, responsible people should be encouraged to use "informal" methods and tools to do data analysis.  </p>
<p>These people will find things, learn things, and drive positive change (including change in those big formal professional systems).  </p>
<p>They should do it with a reasonable understanding that doing things in an informal way, with spreadsheets or other tools does introduce errors, and should consider this when they recommend taking action based on the results. </p>
<h2>Balance between two extremes </h2>
<p><strong>The totalitarian state:</strong> I don't think there is an  IT department in the world that is capable of stopping all unofficial data analysis.  In fact, I would suggest that the moment such an IT department comes into existence, it would kill the host company, a harsh sort of self-regulation.  People interested in data and thinking for themselves would just pack up and leave. So who would be left making the decisions and based on what?</p>
<p><strong>The twisted web of spreadsheets:</strong> Companies that allow an anything goes, visual basic code, macros and manual cut and paste direct to the annual report environment are not going to be long for the world either.  They populate the horror story pages on <a href="http://www.eusprig.org/horror-stories.htm" target="_blank">the spreadsheet risk websites.</a></p>
<h2>The zone of win.</h2>
<p>You want to be somewhere between insane spreadsheet addiction and strict formal big tool paralysis.  </p>
<p>I submit that companies that balance risk while still encouraging their smart people to "play" with the data and do analysis in new and interesting ways with new tools are going to win.</p>
<p>Again, don't let this process generate your profit and loss statement- understand where and what the informal discovery process is for- but do let it discover things.  If it discovers something interesting you'll have the chance to check for the errors.  Make sure its part of the process to do so.</p>
<p>By letting the FUD get you down, you'll never get that far and who knows what insights you might be giving up?</p>
<p>Of course,  we believe you should go even further and give those intelligent, responsible people new tools that are less error prone than spreadsheets but still provide as much or even greater flexibility.  That's why we're building Datamartist after all.</p>
<p>Openness, balance, and clear minded pragmatism will get you further than FUD every time.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/spreadsheet-risk-and-errors-fear-uncertainty-and-doubt/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MS Access vs Excel vs Datamartist</title>
		<link>http://www.datamartist.com/ms-access-vs-excel-vs-datamartist-a-do-it-yourself-guide</link>
		<comments>http://www.datamartist.com/ms-access-vs-excel-vs-datamartist-a-do-it-yourself-guide#comments</comments>
		<pubDate>Fri, 06 Mar 2009 02:33:06 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Data Modelling]]></category>
		<category><![CDATA[MS Access]]></category>
		<category><![CDATA[MS Excel]]></category>
		<category><![CDATA[Datamartist Tool]]></category>
		<category><![CDATA[Excel Data Import]]></category>
		<category><![CDATA[Personal Data Marts]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=1251</guid>
		<description><![CDATA[When data analysis requirements really get tough, the tough get going- and start to seriously use databases. Let's face it, if you're considering Microsoft Access chances are what you need to get done is beyond what Excel does well, so you're looking for options. Its also likely that your IT department is unable or un-willing [...]]]></description>
			<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2009/03/excel-database-datamartist1.jpg" alt="excel-database-datamartist1" title="excel-database-datamartist1" width="200" height="183" class="alignright size-full wp-image-1301" />When data analysis requirements really get tough, the tough get going- and start to seriously use databases.</p>
<p>Let's face it, if you're considering Microsoft Access chances are what you need to get done is beyond what Excel does well, so you're looking for options.  Its also likely that your IT department is unable or un-willing to help you out- this being even more likely as the recession reduces reporting budgets left, right and center.</p>
<p>Two of the key things that lead someone to search for a database solution are:</p>
<ul>
<li><strong>Data Volume</strong>- More than a million rows and Excel becomes very difficult, even before that the performance suffers.</li>
<li><strong>Flexibility to Join Tables</strong> - Vlookup and VBA code only go so far- Access gives an easy way to make joins between tables, one of the powerful features of relational databases.</li>
</ul>
<p>Now, the data volume is what it is- if you have millions and millions of rows, you need something to cut it down to size before you move it into your Excel spreadsheet. </p>
<p>On the other point, however, I can hear the Excel fans saying "now wait a minute, Excel can do that, I don't really need a database" and they are right.  But they are almost always right- Excel can do almost anything. It does not mean, however that its the best tool for the job. Using Vlookup and VBA scripts to join up multiple tables is not my idea of a fun time. And even in Excel 2007 I find the pivot tables annoying and prone to break if I'm adding categories, moving data sets or heaven forbid changing number and order of columns.</p>
<p>Microsoft Access has a very nice interface for creating joins between tables, just a simple drag and drop between fields. The cross tab query capability is useful and good, and being a relational database it's more tolerant of changes to table structure because it's not messing with cell references.</p>
<p>"But", many who have used MS Access will say, "its pretty complex to learn, and even if I do start to get the query stuff down, it doesn't handle bad data well."</p>
<p>Bad data?  Who has bad data? Isn't all data pristine, as intended, correctly formatted and accurate?</p>
<p><img src="/wp-content/uploads/2009/03/enough-to-make-access-decide-its-text1.jpg" alt="enough-to-make-access-decide-its-text1" title="enough-to-make-access-decide-its-text1" width="210" height="225" class="alignright size-full wp-image-1289" />One of the huge differences between Excel and MS Access is that Excel is extremely flexible.  (Probably more flexible than your auditor would like, but thats a different story).  One source of Excels flexibility is its ability to accept different data types in the same column, and to allow editing of cells quickly. In Microsoft Access, for example, when it sees some variation it either discards the data or defaults to the data type "Text"- meaning now you can't perform the calculations you need to do on your data.<img src="/wp-content/uploads/2009/03/sales-data-import-errors.jpg" alt="sales-data-import-errors" title="sales-data-import-errors" width="365" height="232" class="alignright size-full wp-image-1289" /></p>
<p>This illustrates one of the challenges people face in trying to use a database - databases are very strict on data types.  Once you declare a data type for a column, if you import data into the table, the database will discard the values that do not conform to that data type.  In Excel, you get cell errors if you try calculations but the original data is still there.</p>
<p>One of the powerful features of the <a href="/product">Datamartist tool</a> is the fact that it has an underlying database structure that provides flexibility on data types.  Unlike MS Access and other databases, Datamartist can store dates, numbers, strings and booleans natively in a single column. (It does not convert to strings- it stores the full object).  Take a look at this example:<br />
<img src="/wp-content/uploads/2009/03/datamartist-dynamicly-handles-data-type-at-row-level1.jpg" alt="datamartist-dynamicly-handles-data-type-at-row-level1" title="datamartist-dynamicly-handles-data-type-at-row-level1" width="425" height="226" class="aligncenter size-full wp-image-1294" /></p>
<p>In each individual row, Datamartist completes the calculation if possible.  Datamartist is a database that gives you the freedom of a Spreadsheet. Of course, just like excel, if you ask for a calculation on a value that is meaningless you will get an error- but at the individual value- not a full row discard.  This means that with messy data you can still work with it, bring it in, and fix it.  In Access or another database, you can't even get it through the front door (or it defaults to text, making many calculations impossible).</p>
<p>This won't be the last time I compare these three tools- and the types of data structures and tasks each of them are most effective with.</p>
<p>In the mean time- Download <a href="/downloads">Datamartist</a>- see what I'm talking about with your own data.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/ms-access-vs-excel-vs-datamartist-a-do-it-yourself-guide/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
