Mean and median filters bad

From BanghamLab
Jump to navigation Jump to search
Siv1-meanmedian.png</wikiflv>


OK its a crap Figure. I spent ages getting the trace above each new one to show the moving window used to compute the output of each stage.

Left panel: running mean. Stage (scale) 1 window of 3. Stage 2 window of 5 on the previous stage. And so forth. Each stage simplifies the signal until, at the bottom, it fades away. BUT it does not preserve scale-space and that is bad.

Right panel: running median. Median filters had many followers. It was thought that they preserved the edges in a meaningful way. Stage (scale) 1 window of 3. The filter removes extrema of length (scale) 1. Stage 2 window of 5 on the previous stage and this filter removes extrema of scale 2. And so forth. Each stage simplifies the signal until, at the bottom, it fades away. It is tempting to imagine that extrema at each scale are removed in a nice regular way BUT no, it all falls apart at larger scales it does not preserve scale-space and that is bad.

So here is a gripe. When I first sent a paper on all to a reputable journal I wrote just that ('... falls apart ...') and the paper was all about how to fix it - sieves. The reviewer rejected the paper because 'everyone knows that median filters aren't useful so how could a stack of them be any better'. Well - that's exactly what the paper was about. Oh well, get over it. Write it more clearly, etc. etc. So here is the paper. The implementation is very literal and slow. Each stage is run and re-run to idempotency. I called the filter a data-sieve. The twists in this story is firstly to switch to multiple pass median filters(Bangham, 1993)<ref>Bangham, J. Andrew, "Properties of a Series of Nested Median Filters, Namely the Data Sieve," IEEE Trans Sig. Process. Vol. 41. NO. I. Jan 1993</ref>, then being excited enough to look for something faster/better: 'recursive median filters''(Bangham, Chardaire et. al. 1996)<ref>Bangham, J. Andrew, "Multiscale nonlinear decomposition: the sieve decomposition theorem" IEEE Trans Pat. Anal. Mach. Intelligence. Vol. 18. NO. 5. Jan 1996</ref> and sieves. Each twist was patented. The first was (I think) in 1988. Patents are difficult to produce, and enforce. I was supported in this by Cambridge Consultants Limited (CCL) who had taken over the company. Fantastic engineers but the vision community had not (or we had not noticed) at that time appreciated the coming possibilities of SIFT. Scale-invariant feature transform (or SIFT) is an algorithm in computer vision to detect and describe local features in images. The algorithm [1] was published by David Lowe in 1999.[1]

References

<references />