Tag archive for ‘Fixing Data’
-
Self Serve Business Intelligence
Self serve business intelligence dreams of letting everyone whip up any report or analysis they want. The reality is that its often not the report that's the problem- it the underlying data and model. So the idea of self serve business intelligence is a wonderful idea- the problem is that its not all about pretty [...]
-
Excel auto formating is getting into your genes
We often give Excel our data, and trust it to do the right thing. There was a link posted on meta-filter today that sparked some lively discussion amongst the crowd. The Excel auto formating "feature" loves to scramble common genetic nomenclature. It turns out that in the genetics field, common codes get converted to incorrect [...]
-
Duplicate Data and removing duplicate records
Duplicate records, doubles, redundant data, duplicate rows; it doesn't matter what you call them, they are one of the biggest problems in any data analyst's life. There are lots of different types of data quality problems, but in this post I'll focus on Duplicates. I'll share some hints on how to find duplicate records and remove duplicate records, [...]



