Data Te Ching, ch 4
The cost of no data may be high
The cost of some data may be low
But the cost of a confirmed assumption
is the highest cost of all
When we seek patterns
we find them
it is what we do
Normalized data helps
but what does it help?
Conformance
Well-formed queries help
but what do they help?
Ignorance
When answers to questions come
they are valid in the data
even when the questions
are the wrong ones to ask
Surface the exceptions
Embrace the edge cases
Contradict the biases
Then simplify the data
When we don’t know
the cost we pay
we always spend
more than we can afford