<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Datamartist.com &#187; Personal data mart</title>
	<atom:link href="http://www.datamartist.com/tag/personal-data-mart/feed" rel="self" type="application/rss+xml" />
	<link>http://www.datamartist.com</link>
	<description>Reduce cost with self serve data transformation</description>
	<lastBuildDate>Mon, 26 Jul 2010 18:33:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Adding self serve data transformation to reduce shadow systems</title>
		<link>http://www.datamartist.com/adding-self-serve-data-transformation-to-reduce-shadow-systems</link>
		<comments>http://www.datamartist.com/adding-self-serve-data-transformation-to-reduce-shadow-systems#comments</comments>
		<pubDate>Sat, 07 Nov 2009 16:04:36 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Data Transformation]]></category>
		<category><![CDATA[Spreadmarts]]></category>
		<category><![CDATA[Business Intelligence trends]]></category>
		<category><![CDATA[Personal data mart]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=2203</guid>
		<description><![CDATA[Do you have lots of unoffical spreadsheets in your organization being used for data analysis? Is the data warehouse use low to non-existent, yet somehow lots of data is appearing in power point presentations and excel spreadsheets all over the company? I believe a key to understanding how information moves around your organization is to [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.datamartist.com/wp-content/uploads/2009/11/spreadsheet-data-is-official-its-just-seasoned1.jpg" alt="spreadsheet-data-is-official-its-just-seasoned" title="spreadsheet-data-is-official-its-just-seasoned" width="376" height="291" class="alignright size-full wp-image-3424" />Do you have lots of unoffical spreadsheets in your organization being used for data analysis? Is the data warehouse use low to non-existent, yet somehow lots of data is appearing in power point presentations and excel spreadsheets all over the company?</p>
<p>I believe a key to understanding how information moves around your organization is to think of it as a mini economy. (I know, the economy is not our favorite subject right now, but bear with me).</p>
<p>There are information suppliers, and information consumers.  The consumers are willing to pay more or less for different types of information, and different methods of supplying information have different costs.  In the end, the market decides what gets done and what does not get done.</p>
<p>And like many markets, there is also a underground economy- places consumers go if the official prices don't make sense, or the products they want are not available on the open market.</p>
<p>In many companies, the IT department in theory has a monopoly on information supply, however the underground is active and constitutes a significant supply.  The underground in this case is all the excel spreadsheets, the MS Access databases etc. used to make the shadow systems and spread marts.  Spreadmarts seem to exist in the majority of enterprises- I've mentioned an <a href="/spreadmarts-and-data-shadow-systems-the-debate/" target="_blank">interesting study regarding these shadow systems</a> previously, and the attitudes people have.</p>
<h2>To help illustrate this I am going to make up some data and put it in colorful graphs.</h2>
<p><img src="/wp-content/uploads/2009/05/relative-cost-data-warehouse-data-mart-spreadmart2.jpg" alt="relative-cost-data-warehouse-data-mart-spreadmart2" title="relative-cost-data-warehouse-data-mart-spreadmart2" width="376" height="221" class="alignleft size-full wp-image-2211"/></p>
<p>Looking at the first graph, in broad terms a data warehouse based approach will have higher costs than one based on data marts (because data warehouse provide more cross enterprise integration, which requires more effort), and the spreadmarts will have the lowest perceived cost.  It's important to note that the actual cost of spreadmarts are higher, but <strong>percieved</strong> cost is what drives the consumers choice.</p>
<p>The trick is that because the percieved cost of spreadmarts is so low, and because there is no sanctioned enterprise solution to compete, a significant amount of effort is put in to these systems for any type of analysis that is percieved to be possible.  Of course for certain data volumes or complexities there is no alternative to a full fledged data warehouse or data mart project, but for almost everything else, business users and analysts will often try to go it alone creating a chaos of spreadsheets and data bases.</p>
<p>The problem is, even "experts" can't accurately estimate how much effort the data analysis is.  So estimates for how long it will take to "whip it up in excel" by non-experts are almost always low by orders of magnitude.</p>
<h2>Don't dictate.  Engage with sanctioned tools that work the way people want to work.</h2>
<p>The key to adjusting this market imbalance is to introduce a new sanctioned product line, in effect undercutting the "black market".<br />
<img src="/wp-content/uploads/2009/05/relative-cost-data-warehouse-data-mart-spreadmart-plus-self-serve.jpg" alt="relative-cost-data-warehouse-data-mart-spreadmart-plus-self-serve" title="relative-cost-data-warehouse-data-mart-spreadmart-plus-self-serve" width="470" height="274" class="alignnone size-full wp-image-2213" /></p>
<p>This is exactly what self serve data transformation is about.  Rather than leaving users to do it themselves in Excel- IT can provide specific tools, and thereby reduce the amount of completely opaque data transformation going on, while still providing users with the ability to get what they need. </p>
<h2>So why is that better?</h2>
<ul>
<li><strong>It opens up the dialog</strong> -  Talking is better than having a "Us" vs "Them" mentality.  It lets you meet the people involved, lets you discuss their challenges with them, and provides an opening for discussion of important topics like data quality, master data management and data security.</li>
<li><strong>You'll know who the power users are</strong> -  Right now, it is potentially anyone who has Excel- chances are that's everyone in your organisation.</li>
<li><strong>It gives you visibility on what matters to the business</strong> - If you know what the hot topics are, it can help you keep the official systems relevant and prioritize your efforts where they will do the most good.</li>
</ul>
<p>What has to be different in this new relationship, however, is that IT has to understand about the "self" in self-serve.  People will do things that no self-respecting ETL developer or data warehouse architect would ever sanction.  If you clamp down and stop them, they will abandon the tools and return to the wild west.  IT believes that it has the power in the relationship, but in fact the users are able to walk at any time.  So add value, communicate, educate, but don't dictate.  If your relationship with the business users, and the "Kings of the spreadmart" is poor to start, you have to give it time to evolve.</p>
<h2>"But we just can't let them do that."</h2>
<p>Resist the urge to clamp down.</p>
<p>Keep your systems secure, guard your infrastructure, but don't have any illusions that you can stop people from analyzing and transforming their data.</p>
<p>If they want to calculate net sales in a particular way then they'll do it in excel, and it will be the number that the CEO sees.  The business is made up of grownups, after all.  IT has a responsibility to explain the issues and challenges that shadow systems and rampant spreadsheeting can cause, but I have yet to see or hear of a company where an authoritarian approach works.   As Princess Leia said- <a href="http://www.entertonement.com/clips/qswvtcydps--Star-Wars-Episode-IV-A-New-Hope-Carrie-Fisher-Princess-Leia-Organa-The-more-you-tighten-your-grip-Tarkin-the-more-star-systems-will-slip-through-your-fingers">"The more you tighten your grip, Tarkin, the more star systems will slip through your fingers."</a></p>
<h2>Arming the rebels</h2>
<p>The business intelligence vendors are all realizing what the crowd pleasers are-  really good integration into office applications, excel at the forefront.  People want at their data.</p>
<p>Microsoft has of course long provided the main weapons for the shadow systems, MS Excel and MS Access- and they are going nuclear with the addition of "Power Pivot" to Excel 2010-  although it is largely a presentation layer tool, and probably won't be used widely for data transformation itself.</p>
<p>Trying to fight all this with the standard tools of closing down the ability to export data, hiring an army of report writers, and constantly raving about the dangers and pitfalls of run away spreadsheets is like pushing on a rope.  </p>
<h2>Provide a safe, legal alternative to the free for all.</h2>
<p>Talk to your business users.  Understand their needs.  Provide them with tools.  Work with them to both empower responsible analysts, and avoid the worst issues that existing shadow systems are creating.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/adding-self-serve-data-transformation-to-reduce-shadow-systems/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>MS Access query example and comparision to Datamartist</title>
		<link>http://www.datamartist.com/microsoft-access-query-example-and-comparision-to-datamartist</link>
		<comments>http://www.datamartist.com/microsoft-access-query-example-and-comparision-to-datamartist#comments</comments>
		<pubDate>Tue, 31 Mar 2009 22:59:55 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Data Modelling]]></category>
		<category><![CDATA[Data Transformation]]></category>
		<category><![CDATA[MS Access]]></category>
		<category><![CDATA[Microsoft Excel]]></category>
		<category><![CDATA[Access]]></category>
		<category><![CDATA[Data Mart Example]]></category>
		<category><![CDATA[Excel]]></category>
		<category><![CDATA[Personal data mart]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=1321</guid>
		<description><![CDATA[Microsoft Access allows users to create complex queries and analyze large data sets. However, it can be complicated to use compared to Excel. In this post, I'll talk about ms access queries and the equivalent way to perform the same data transformation in the Datamartist tool- visually and simply. Microsoft Access has a clear role [...]]]></description>
			<content:encoded><![CDATA[<p>Microsoft Access allows users to create complex queries and analyze large data sets.  However, it can be complicated to use compared to Excel.  In this post, I'll talk about <a href="/help-support/tutorials/microsoft-access-examples-and-tutorials">ms access queries</a> and the equivalent way to perform the same data transformation in the <a href="/product">Datamartist tool</a>- visually and simply.</p>
<p>Microsoft Access has a clear role to play when a small, light database application is required.  However, it has a learning curve, and is not necessarily the best tool for data analysis.</p>
<h2>Product Segmentation Query Example</h2>
<p>Lets look at an example ms access query or two and see how we can do the same thing Datamartist, only without the queries and without any SQL. For this example, lets say that we have two sets of sales data from different time periods, and a product list, and we want to define some product segments based on color and price.  We want to get a summary of the sales Qty and average price sold by month, broken out by the new categories which are as follows:</p>
<ul>
<li> "Red and High Priced" If the product is Red and its minimum price is more than $1000</li>
<li> "Red Low Price wide price range" If the product is Red, has a minimum price less than $1000 but has a min to max price of more than $200</li>
<li> "Red Low Price small price range" If its Red and not in the first two segments</li>
<li> "Yellow" if the product is yellow. </li>
<li> "Other" for all the rest</li>
</ul>
<p>The three data tables we have are as follows:</p>
<ol>
<li> Sales 03-06 with about 120 000 rows, which contains sales data from 2003 - 2006</li>
<li> Sales 2007  with about 30 000 rows, which contains sales data for 2007</li>
<li> Products  which contains the colors for all the products and their minimum and maximum prices</li>
</ol>
<p>So- first step is to combine the two data tables, in Access, this is done with a UNION query with the following SQL code:</p>
<blockquote><p>select * from [Sales Data 03-06] UNION select * from [Sales Data 2007];</p></blockquote>
<p>In Datamartist, we simply connect the two tables up to a combine block.<br />
<img src="/wp-content/uploads/2009/03/segmentation-example-datamartist-combine1.jpg" alt="segmentation-example-datamartist-combine1" title="segmentation-example-datamartist-combine1" width="264" height="234" class="alignnone size-full wp-image-1394" /></p>
<p>Next, we need to define the segmentation-  again in Access this is done with a Query, this time by nesting IIF statements to add a new column called "Product_Segment" to the resulting query.</p>
<blockquote><p>SELECT Products.Product_ID, Products.Product_Name, Products.Product_Group, Products.Product_Category, Products.Product_SubCategory, Products.Shipping_Weight, Products.Color, Products.Price_Min, Products.Price_Max, IIf([Color]="Red" And [Price_Min]>1000,"Red and High Priced",IIf([Color]="Red" And ([Price_max]-[Price_min])>200,"Red Low Price wide price range",IIf([Color]="Red","Red Low Price small price range",IIf([Color]="Yellow","Yellow","Other")))) AS Product_Segment<br />
FROM Products;</p></blockquote>
<p>In Datamartist, we use a segmentation block to do the same thing.  The interface is graphical, and the syntax is the same as you would use in Excel.  There is no need to nest any IF statements, because the overall block is designed to do that.  Heres what the blocks look like-  the MS Access import block on the left, and the segmentation rule block on the right.<br />
<img src="/wp-content/uploads/2009/03/segmentation-example-datamartist-segment-block.jpg" alt="segmentation-example-datamartist-segment-block" title="segmentation-example-datamartist-segment-block" width="418" height="211" class="alignnone size-full wp-image-1428" /><br />
Each segment has the statement that defines if a row is in the segment or not.   The block tests each segment rule in order, starting at the top- the first statement that solves as "TRUE" defines the value for the Product_Segment column for that row. Dragging the segments up and down changes what order the rules are checked.</p>
<p><a href="/resources/images/Segmentation-Example-Product.jpg" target="_blank" onClick="javascript: pageTracker._trackPageview('/screenshots/Segmentation-Example-Product'); "><img src="/resources/images/Segmentation-Example-Product-Thumb.jpg">
<p style="padding:8px;">(Click to Enlarge)</p>
<p></a></p>
<p>Then we have to Join this new product dimension (with the segmentation column) to the sales data, and summarize.</p>
<p>In MS Access, this is done with more queries-  Heres what Access looks like when we're done.<br />
<img src="/wp-content/uploads/2009/03/segmentation-example-access-gui1.jpg" alt="segmentation-example-access-gui1" title="segmentation-example-access-gui1" width="450" height="485" class="alignnone size-full wp-image-1405" /><br />
Compare that list of Tables and Queries to the visual, left to right layout of the Datamartist data canvas that does the same thing.  Without ever having to write any SQL code:</p>
<h2>The VISUAL way to do it</h2>
<p><img src="/wp-content/uploads/2009/03/segmentation-example-solved-canvas.jpg" alt="segmentation-example-solved-canvas" title="segmentation-example-solved-canvas" width="406" height="314" class="alignnone size-full wp-image-1403" /></p>
<p><a href="/resources/images/Segmentation-Example-Datamartist-full-app-shot.jpg" target="_blank" onClick="javascript: pageTracker._trackPageview('/screenshots/Segmentation-Example-Datamartist-full-app-shot'); "><img src="/resources/images/Segmentation-Example-Datamartist-full-app-shot-Thumb.jpg" class="alignright size-full wp-image-1430" ></a><br />
In Datamartist you can see the flow of the data, the row counts are clearly displayed, and clicking on the connectors will bring up the underlying data set in the data viewer.  Its clear which block feeds which, and by adding more blocks and connecting them at the desired point in the data flow, new analysis can be created.</p>
<p>Take Datamartist for a trial run-  <a href="/downloads">download it now</a> because maybe you don't have to learn microsoft access queries after all.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/microsoft-access-query-example-and-comparision-to-datamartist/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>A Cost comparision between Data Marts and a Data Warehouse</title>
		<link>http://www.datamartist.com/a-cost-comparision-between-data-marts-and-a-data-warehouse</link>
		<comments>http://www.datamartist.com/a-cost-comparision-between-data-marts-and-a-data-warehouse#comments</comments>
		<pubDate>Mon, 19 Jan 2009 02:19:51 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Cost Reduction]]></category>
		<category><![CDATA[Personal Data Marts]]></category>
		<category><![CDATA[Bill Inmon]]></category>
		<category><![CDATA[Data mart]]></category>
		<category><![CDATA[Data warehouse]]></category>
		<category><![CDATA[Personal data mart]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=712</guid>
		<description><![CDATA[I've noticed a fair bit of search traffic focusing on cost questions, particularly which is cheaper; a series of data marts or a single enterprise data warehouse.  I think it's a bit like the question of lease vs buy.  Starting off building a single departmental data mart will represent a much smaller cash flow out. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="/wp-content/uploads/2009/01/data-warehouse-vs-data-mart-cost.jpg"><img class="alignright size-medium wp-image-715" title="data-warehouse-vs-data-mart-cost" src="/wp-content/uploads/2009/01/data-warehouse-vs-data-mart-cost-300x272.jpg" alt="" width="300" height="272" /></a>I've noticed a fair bit of search traffic focusing on cost questions, particularly which is cheaper; a series of data marts or a single enterprise data warehouse.  I think it's a bit like the question of lease vs buy.  Starting off building a single departmental data mart will represent a much smaller cash flow out.  But by the time you've built all the data marts, and then have to redo them all again to integrate between subject areas and departments, I'd have to say that I'm with Bill Inmon when he says no number of data marts add up to a data warehouse.</p>
<p>With data marts (just like leasing a car) you get behind the wheel quickly, and it gets you where you want to go in style.  And the monthly payment is something you can afford now.  However, long term, well, in three years you don't own it, and have paid a bundle.</p>
<p>But let's be realistic.  Just as having all the cash on hand to buy the car outright just might not be in the cards,  a true data warehouse might require a very significant outlay before anything comes out the other end, making it unaffordable.  A quick, focused departmental data mart could be delivering value in a matter of weeks with relatively little investment.  (Your actual mileage may vary- depending on where you're at, its always dangerous to believe someone when they say "a matter of weeks" when software and people are involved.)</p>
<p>Will that departmental data mart, or even a number of data marts lead you to a single version of the truth?  Will it give you deep competitive advantage through a culture of data analytics and cross enterprise master data management? In my honest opinion, No.</p>
<p>But is it something you can afford in today's economy, and will you learn things about your data, your company's information culture, and your business that will be useful if in the future you embark on a true data warehouse initiative.  Yes.  Yes it is, and yes you will.</p>
<p>And I'll take it one (blatantly promotional) step further.  Is a personal data mart on your desk top as good as a full fledged departmental data mart with an army of highly paid developers maintaining it?  Probably not.</p>
<p>Is the personal data mart on your desk basicly free in comparision to the servers, software and hired help the data mart requires?- Yes. And does it, just like the data mart does for the data warehouse, prepare the ground for the next evolution when the economy turns around? Yes. Yes it does.</p>
<p>In difficult times companies that are pragmatic, and do what is possible, preparing for the day when more will be, survive to see that day.</p>
<p>It seems obvious that doing nothing because you can't afford to do the best thing is a bad strategy- but we need to ask ourselves, how often do we make that exact choice through inaction?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/a-cost-comparision-between-data-marts-and-a-data-warehouse/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Personal Data Marts: cost effective, powerful Analysis</title>
		<link>http://www.datamartist.com/personal-data-marts-cost-effective-powerful-analysis</link>
		<comments>http://www.datamartist.com/personal-data-marts-cost-effective-powerful-analysis#comments</comments>
		<pubDate>Sat, 10 Jan 2009 03:14:56 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Datamartist Tool]]></category>
		<category><![CDATA[Personal Data Marts]]></category>
		<category><![CDATA[Business Intelligence trends]]></category>
		<category><![CDATA[Personal data mart]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=668</guid>
		<description><![CDATA[Huge, expensive data warehouse projects often either deliver a huge compromise in terms of functionality, or are impossibly complex and expensive. A bunch of them fail outright. Why do so many people turn to Excel for their analysis needs? Because with excel they are in control.  With Excel it IS personal. Massive, expensive data warehouses [...]]]></description>
			<content:encoded><![CDATA[<p>Huge, expensive data warehouse projects often either deliver a huge compromise in terms of functionality, or are impossibly complex and expensive. A bunch of them fail outright.</p>
<p>Why do so many people turn to Excel for their analysis needs?  Because with excel they are in control.  With Excel it IS personal. Massive, expensive data warehouses are one size fits none. Excel is the ultimate in customisation. </p>
<blockquote><p>“Despite years of investing in BI, many IT organisations have difficulty connecting BI with the business, and to get business users fully involved and out of the ‘Excel culture’”<br />
- Bill Hostmann, Vice President and distinguished analyst at Gartner</p></blockquote>
<p> </p>
<p><strong>Excel isn't always enough</strong><br />
<a href="/wp-content/uploads/2009/01/excel-and-database1.jpg"><img class="alignright size-medium wp-image-683" title="excel-and-database1" src="/wp-content/uploads/2009/01/excel-and-database1-300x190.jpg" alt="" width="300" height="190" /></a></p>
<p>Many users frustrated by the limitations in Excel  in terms of data volume and functionality want more.  Some of them take up programming macros- others spend money they'd rather not on throw away database development.  But demand encourages supply, and new tools are beginning to emerge.</p>
<p>As <a href="http://datadoodle.com/2008/10/23/recessions-benefits-for-bi/" target="_blank">Ted Cuzzillo</a> put it so well in his <a href="http://www.tdwi.org/News/display.aspx?id=9262" target="_blank">trends for 2009 blog post</a> recently:</p>
<blockquote><p>“on business desktops, new, highly individualised tools will sprout.”</p></blockquote>
<p>The Datamartist vision is that of a user driven tool that provides the flexibility of a spreadsheet with the key functionality of a database without the database. </p>
<p>Personal data marts can for the first time be built directly on the desktop, without programming, yet handling millions of rows of data, and implementing truly useful data analysis.</p>
<p><a href="/wp-content/uploads/2009/01/datamartist.jpg"><img class="aligncenter size-medium wp-image-684" title="datamartist" src="/wp-content/uploads/2009/01/datamartist-300x123.jpg" alt="" width="300" height="123" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/personal-data-marts-cost-effective-powerful-analysis/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
