<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Datamartist.com &#187; Data mart</title>
	<atom:link href="http://www.datamartist.com/tag/data-mart/feed" rel="self" type="application/rss+xml" />
	<link>http://www.datamartist.com</link>
	<description>Reduce cost with self serve data transformation</description>
	<lastBuildDate>Mon, 26 Jul 2010 18:33:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Build vs Buy the Allure of Out of the box business intelligence</title>
		<link>http://www.datamartist.com/build-vs-buy-the-pros-and-cons-of-premade-data-warehouses-and-data-marts</link>
		<comments>http://www.datamartist.com/build-vs-buy-the-pros-and-cons-of-premade-data-warehouses-and-data-marts#comments</comments>
		<pubDate>Thu, 28 May 2009 03:10:00 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Software in General]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Data mart]]></category>
		<category><![CDATA[ERP Projects]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=2296</guid>
		<description><![CDATA[The big software vendors claim they can sell you an "out of the box" data warehouse including reporting that installs onto your Oracle, Microsoft or SAP ERP package. Are these things really that easy? Is it actually cheaper to build it yourself? How do you determine if a given product is a good fit for [...]]]></description>
			<content:encoded><![CDATA[<p>The big software vendors claim they can sell you an "out of the box" data warehouse including reporting that installs onto your Oracle, Microsoft or SAP ERP package.  Are these things really that easy?  Is it actually cheaper to build it yourself? How do you determine if a given product is a good fit for your situation?</p>
<p>Here's the secret- If you answer yes to all of the following its probably a good idea to buy:</p>
<ul>
<li>You've configured your Enterprise Resource Planning (ERP) software <em>exactly</em> as the vendor who wrote the data warehouse or data mart expected you to.</li>
<li>You use ALL the modules of the ERP- and do not have any "best of breed" applications for any areas.</li>
<li>You are interested in the key performance indicators that they have included and they calculate them the same way your company does</li>
<li>Your end users like the layout, formating and color choices in the reports included and have promised not to ask for changes.</li>
</ul>
<p>In fact if all this is true, then you will probably save HUGE amounts of money by using a pre-built package.</p>
<p>But.  Lots of things tend to get in the way of that dream. More often then not it's not the pre-built package that needs to be focused on- it's what you have in your system and how you've configured it.</p>
<h2> ERP Customization</h2>
<p><img src="http://www.datamartist.com/wp-content/uploads/2009/05/erp-customization-well-a-little-bit1.jpg" alt="erp-customization-well-a-little-bit1" title="erp-customization-well-a-little-bit1" width="364" height="260" class="alignright size-full wp-image-2312" /><br />
Anyone who has been involved in a large ERP project knows that there is always at least a bit of customization.  If the ERP is a good fit to your industry, and the project is focused on avoiding customization and using all those "flex fields" and "category codes" as they were intended, then the pre-built data warehouse that is available for that ERP might be a very cost effective route.</p>
<p>But here's the rub;  in many ERP projects rather than change the existing processes (which by the way is often the justification for installing an ERP- process improvement), the business users find reasons to "keep things as they are"- and as a result, the ERP bends to the existing process rather than the other way round.</p>
<p>This means that new tables are created to hold information that normally would be in a different format in one of the standard ERP tables.  This means that that code in field "X" that normally would mean something is overloaded to mean three or four things.  This means that field "Y" that isn't needed by your company ends up being used for something completely different.</p>
<p>Well, when the pre-built data mart plugs into your ERP, it doesn't know anything about those tables you've added, and its assuming that the data is in the standard tables.  Its already configured its ETLs, and star schemas to use those two fields that you've high-jacked, and so all those reports and cubes don't make any sense without modifications.</p>
<p>To be fair, many of the pre-built data warehouse frameworks have evolved to have lots of flexibility and configurability in them- you can modify and map the ETL jobs provided to fit your data.  You can turn off KPIs so that they don't load in false data.</p>
<p>But the trick is the more you have to modify, map and configure, the more effort you spend.  The more you shut off, the less you have.  In the end you can find that you are always trying to fit your specific requirements into the general, vanilla requirements that came out of the box.  Often, this configuration is more effort and demands more compromises than just building what you need.</p>
<h2>Non ERP transactional systems</h2>
<p>In some ERP implementations, certain modules of the ERP are not used and instead a third party software provides the functionality.  Often, key information is then moved in and out of the ERP as required for transactions via an enterprise application integration (EAI) tool.  In many cases, the reason the additional system is used is because it has more functionality (and therefore stores more detailed and varied information) than the ERP module.  As a result, the EAI tool cannot move all of the information into the ERP, only summaries and selected data.  This means that for the functional area that is treated by that module at least some of the detail users are interested in does not exist in the ERP.  And if it's not in the ERP, it won't be picked up by the pre-built solution.</p>
<h2>Diverse and specific report requirements</h2>
<p>Its amazing how the layout (column order, summarization, which key performance indicators where) of a report can be very very important to report consumers.  Its also amazing how much work can be required to modify the hundreds of reports and cubes that came with the pre-built system.  Report writers everywhere are nodding their heads. Enough said.</p>
<h2>Data Quality Issues</h2>
<p>Even if you've used all the tables and fields in the ERP exactly as needed, and the reports look just fine to your users, if they have not already been given access to all this information you will probably find that there is lots of data quality work to be done.  This cost is there for both the build and the buy options- just be aware of it, because it means the buy option is going to be more than just the sticker price.</p>
<h2>One ray of hope-  very specific industry solutions</h2>
<p>Hold on, the vendors say, but we have industry specific data warehouses and data marts- these are built just for your industry so they know your business.  We have one for banking data marts, another one for pharmaceutical data marts, you can get a sales data warehouse specifically for retail, and so on. </p>
<p>Without a doubt the more focused the pre-built data warehouse or data mart is the more likely it is that its going to save you time and money.</p>
<h2>Conclusion?  Often when something sounds too good to be true...</h2>
<p>The bottom line is, although you can tell I'm not head over heals for prebuilt datawarehousing, I think the field has evolved.  But then again, so have the prices.</p>
<p>The key is to be realistic regarding how much modification will be required now, and going forward, and how many of those hundreds of cubes and reports that are included are actually going to be applicable to your business.</p>
<p>It is possible that a package exists that will provide the best value.  Just be very very careful when you are comparing the costs- because its never quite as sweet a deal as it first seems.</p>
<p>Smaller, very focused pre-built data marts, particularly targeted to applications or modules that are used fully by an enterprise are probably much lower risk and more likely to provide real value than a massive "out of the box, install it in weeks not years" data warehouse.</p>
<p>My personal experience with pre-built products was that in the end, when the dust settled, we had touched almost every ETL job, and the reports that were the most used and useful were ones we had built from scratch, largely using data loaded out of custom ERP tables.</p>
<p>If you have customization in your ERP, you have to think hard about this.  The reason the custom tables were put in there was because they hold information that was important enough to justify the development.  This probably means that the information in them is going to be important to the business intelligence you are building.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/build-vs-buy-the-pros-and-cons-of-premade-data-warehouses-and-data-marts/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Cost comparision between Data Marts and a Data Warehouse</title>
		<link>http://www.datamartist.com/a-cost-comparision-between-data-marts-and-a-data-warehouse</link>
		<comments>http://www.datamartist.com/a-cost-comparision-between-data-marts-and-a-data-warehouse#comments</comments>
		<pubDate>Mon, 19 Jan 2009 02:19:51 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Cost Reduction]]></category>
		<category><![CDATA[Personal Data Marts]]></category>
		<category><![CDATA[Bill Inmon]]></category>
		<category><![CDATA[Data mart]]></category>
		<category><![CDATA[Data warehouse]]></category>
		<category><![CDATA[Personal data mart]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=712</guid>
		<description><![CDATA[I've noticed a fair bit of search traffic focusing on cost questions, particularly which is cheaper; a series of data marts or a single enterprise data warehouse.  I think it's a bit like the question of lease vs buy.  Starting off building a single departmental data mart will represent a much smaller cash flow out. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="/wp-content/uploads/2009/01/data-warehouse-vs-data-mart-cost.jpg"><img class="alignright size-medium wp-image-715" title="data-warehouse-vs-data-mart-cost" src="/wp-content/uploads/2009/01/data-warehouse-vs-data-mart-cost-300x272.jpg" alt="" width="300" height="272" /></a>I've noticed a fair bit of search traffic focusing on cost questions, particularly which is cheaper; a series of data marts or a single enterprise data warehouse.  I think it's a bit like the question of lease vs buy.  Starting off building a single departmental data mart will represent a much smaller cash flow out.  But by the time you've built all the data marts, and then have to redo them all again to integrate between subject areas and departments, I'd have to say that I'm with Bill Inmon when he says no number of data marts add up to a data warehouse.</p>
<p>With data marts (just like leasing a car) you get behind the wheel quickly, and it gets you where you want to go in style.  And the monthly payment is something you can afford now.  However, long term, well, in three years you don't own it, and have paid a bundle.</p>
<p>But let's be realistic.  Just as having all the cash on hand to buy the car outright just might not be in the cards,  a true data warehouse might require a very significant outlay before anything comes out the other end, making it unaffordable.  A quick, focused departmental data mart could be delivering value in a matter of weeks with relatively little investment.  (Your actual mileage may vary- depending on where you're at, its always dangerous to believe someone when they say "a matter of weeks" when software and people are involved.)</p>
<p>Will that departmental data mart, or even a number of data marts lead you to a single version of the truth?  Will it give you deep competitive advantage through a culture of data analytics and cross enterprise master data management? In my honest opinion, No.</p>
<p>But is it something you can afford in today's economy, and will you learn things about your data, your company's information culture, and your business that will be useful if in the future you embark on a true data warehouse initiative.  Yes.  Yes it is, and yes you will.</p>
<p>And I'll take it one (blatantly promotional) step further.  Is a personal data mart on your desk top as good as a full fledged departmental data mart with an army of highly paid developers maintaining it?  Probably not.</p>
<p>Is the personal data mart on your desk basicly free in comparision to the servers, software and hired help the data mart requires?- Yes. And does it, just like the data mart does for the data warehouse, prepare the ground for the next evolution when the economy turns around? Yes. Yes it does.</p>
<p>In difficult times companies that are pragmatic, and do what is possible, preparing for the day when more will be, survive to see that day.</p>
<p>It seems obvious that doing nothing because you can't afford to do the best thing is a bad strategy- but we need to ask ourselves, how often do we make that exact choice through inaction?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/a-cost-comparision-between-data-marts-and-a-data-warehouse/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Warehouse vs Data Mart</title>
		<link>http://www.datamartist.com/data-warehouse-vs-data-mart</link>
		<comments>http://www.datamartist.com/data-warehouse-vs-data-mart#comments</comments>
		<pubDate>Tue, 02 Dec 2008 17:20:00 +0000</pubDate>
		<dc:creator>James Standen</dc:creator>
				<category><![CDATA[Business Intelligence Architecture]]></category>
		<category><![CDATA[Data Modelling]]></category>
		<category><![CDATA[Datamartist Tool]]></category>
		<category><![CDATA[Personal Data Marts]]></category>
		<category><![CDATA[Bill Inmon]]></category>
		<category><![CDATA[Data mart]]></category>
		<category><![CDATA[Data warehouse]]></category>
		<category><![CDATA[Gartner]]></category>
		<category><![CDATA[Ralph Kimball]]></category>

		<guid isPermaLink="false">http://www.datamartist.com/?p=451</guid>
		<description><![CDATA[Very often, the question is asked- what's the difference between a data mart and a data warehouse- which of them do I need? Data warehouse or Data Mart? Data Warehouse: Holds multiple subject areas Holds very detailed information Works to integrate all data sources Does not necessarily use a dimensional model but feeds dimensional models. [...]]]></description>
			<content:encoded><![CDATA[<p>Very often, the question is asked- what's the difference between a data mart and a data warehouse- which of them do I need?</p>
<h2>Data warehouse or Data Mart?</h2>
<p><img src="/wp-content/uploads/2008/12/hello-im-a-datawarehouse.jpg" alt="hello-im-a-datawarehouse" title="hello-im-a-datawarehouse" width="275" height="184" class="alignright size-full wp-image-1513" /><br />
Data Warehouse:</p>
<ul>
<li>Holds multiple subject areas </li>
<li>Holds very detailed information</li>
<li>Works to integrate all data sources</li>
<li>Does not necessarily use a dimensional model but feeds dimensional models.</li>
</ul>
<p>Data Mart</p>
<ul>
<li>Often holds only one subject area- for example, Finance, or Sales</li>
<li>May hold more summarised data (although many hold full detail)</li>
<li>Concentrates on integrating information from a given subject area or set of source systems</li>
<li>Is built focused on a dimensional model using a star schema.</li>
</ul>
</p>
<h2> More Info about Data mart models</h2>
<ul>
<li><a href="/data-mart-data-modelling-101">Data mart Modelling 101</a></li>
<li><a href="/tag/data-mart-example">Data mart example</a></li>
<li><a href="/dimensional-tables-and-fact-tables">Dimension tables and Fact Tables</a></li>
</ul>
</p>
<h2> More Detail regarding Data Warehouse Vs Datamart:  and Inmon vs Kimball</h2>
<p>As the concept of decisional systems, and data warehouses and data marts evolved, two major points of view came into existence. There are two giants in this field.  Bill Inmon, and Ralph Kimball.  </p>
<p>There are some that argue the best approach is to start with data marts, department by department, then merge them together to form a data warehouse- this is more in line with Kimballs approach.</p>
<p>Now, Bill Inmon is an advocate of the Data warehouse.  Here's <a href="http://www.dmreview.com/dmdirect/19991120/1675-1.html" target="_blank">one of his articles</a>, which contains the following quote that makes it clear what he thinks about the idea:</p>
<p>"<em>You can catch all the minnows in the ocean and stack them together and they still do not make a whale,"</em> <strong>Bill Inmon, January 8, 1998.</strong></p>
<p><a href="http://www.rkimball.com/" target="_blank">Ralph Kimball</a>, on the other hand, advocates what he calls a bus architecture data warehouse.  His methodology specifies conformed dimensions, where multiple fact tables share common dimensional tables.  For me, each of these fact tables represents a data mart.  The row of dimensional tables that all the fact tables plug into is the bus,  and because, for example, the finance and the sales data marts both use the same product dimension table there is integration between departments.</p>
<p>The more extreme data mart strategy is that of the completely stand alone data mart, the concept being that its fast, easy, cheap, and delivers value immediately. I'm a supporter of this at the desktop level- thats the point of the Datamartist tool afterall.  But I don't buy this for server based architectures- what is really fast, easy and cheap when you have to buy servers, create a project and form a commitee?  In my mind if you've decided you need  a central server solution, some level of integration is needed, and don't pretend its going to be magic. </p>
<p><a href="/wp-content/uploads/2008/12/datawarehouse-and-datamart.jpg"><img class="alignright size-medium wp-image-492" title="datawarehouse-and-datamart" src="/wp-content/uploads/2008/12/datawarehouse-and-datamart-300x231.jpg" alt="" width="300" height="231" /></a></p>
<p>The interesting thing about these approaches, is that the harder you work on really conforming your dimensions, the more your data marts look like the data warehouse that Inmon advocates. (Data modellers in the know will be jumping up and down right now shouting NO they don't- but this is a high level conversation...)  But the reality is, even in a data warehouse, issues will arise that require compromise- things that just don't map or conform, and budget, schedule and business reality will mean that nothing is ever perfect, and in the end the world is full of data warehouses that are less conformed than some data mart clusters. Its just not simple.</p>
<p>Generally, it is probably true that data warehouses provide a solution that is closer to the "single version of the truth", but they do take a HUGE amount of effort, and an ability to coordinate across the entire organisation. If you have not already built at least half a dozen data marts, don't think you can estimate how much effort a data warehouse will take. You can't. And bring your cheque book.</p>
<p>Whereas data marts might deliver some value early, if built without sufficient effort on cross functional mapping and data cleanup they are just more silo systems and have their own set of costs and issues. Don't measure payback on datamarts in years-  nothing is the same in a few years, you'll be back to the drawing board shortly. </p>
<p>It's a real dilemma.  So which one? Data warehouse?  Data mart?  In my view, the right answer is "it depends" and "yes". However, never launch a data warehouse project as your first shot.  A successful strategy will balance the fast, pain point addressing solutions, with a medium and long term plan, and investment in infrastructure and competencies to build the technology, processes and culture that a company needs to manage information. Depending on your industry and how sucessful you are, a massive data warehouse might be in your future.  But sorry, no magic bullet.</p>
<h2>Build a multi-level data strategy</h2>
<ul>
<li>Level 1- Get the data to the people</li>
<li>Level 2- Build Departmental Data Marts</li>
<li>Level 3- Plan long term infrastructure and architecture</li>
</ul>
<p>Don't do these things in order- this isn't step 1, 2, 3-  actively work on all three levels at once and ensure the plans at each level are coordinated.</p>
<h3>Data to the People</h3>
<p>People are building spreadsheets, and spending money on data base development now- you know they are.  Give them better tools, help them better use the spreadsheets, and formalize the way they do.  The do-it-yourself exists- but it doesn't have to be completely informal.</p>
<p>The <a href="/product" target="_blank">Datamartist tool</a> is adding another capability that can accelerate the process-  letting you move more quickly, proto-typing and analysing to determine which areas are ready for additional analysis capability. </p>
<p> In some cases Datamartist itself might simply be the best choice for certain types of analysis, cutting costs dramatically. In addition, if your end users are building their own data marts, when it comes time to build server based data marts they know the concept, they understand the structure, and can even provide concrete examples of the dimensions they need.</p>
<p>But whatever you do- don't make the mistake of thinking this is all you need.  Work on all levels at once.</p>
<h3>Build Departmental Data marts</h3>
<p>If a whole department is flying blind, and big money is on the table, then don't launch a three year data warehouse project- create some departmental data marts. These projects should be designed to be 3-6 months long, and be sold to management honestly and clearly as being for short term gains, and as part of a broader discovery process.  The hardware and software licenses will be reusable- but be clear that the data marts will have a limited lifespan- they always do.</p>
<p>And trust me, when you build these data marts you will discover all sorts of things about your data, your organisation, and your definitions and business processes.  You will discover that the sales organisation needs to analyse product segments in a way that is fundamentally inconsistent with how finance has been reporting it for years- and neither group is willing to budge (and they may both be right).  You will discover that 80% of your sales orders have errors on them or are incomplete.  These discoveries will help you build the next data mart, and assess if a data warehouse is possible.  They should also send you back to your transactional system and business processes to work to clean up the problems.</p>
<h3>Build the infrastructure and deal with the foundation</h3>
<p>There are lots of pitfalls in creating a decisional architecture-  this <a href="http://www.gartner.com/it/page.jsp?id=774912&amp;goback=%2Ehom" target="_blank">short list of from Gartner</a> resonates with me- I've seen and battled these issues on project after project.</p>
<p>Set standards in terms of tools, project management etc.  Buy infrastructure for multiple projects, not on a project by project basis- don't have multiple servers when one with multiple data marts is radically cheaper.</p>
<p>And probably most important, I honestly think that you can build anything you want, any way you want, but it will not succeed if you don't have your definitions, both data and information, under control. (See Gartner issue #8)  In the end, it's not what language you speak, it's if you have a dictionary or not, and if everyone is using the same dictionary across your organisation.</p>
<p>You should have common reference data sets that are used by all levels. Datamartist can load in and use reference data that is coordinated with departmental data marts and the eventual warehouse. Make these data sets available to everyone- you'll be amazed that if they are easy to get and use, people will put them in their spreadsheets, and things might actually start matching up.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.datamartist.com/data-warehouse-vs-data-mart/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
