slur - e

master 1Branch 0Tags

Clone

HTTPS

SSH

Open with VS Code

Diffstat (limited to 'src/iter.rs')

-rwxr-xr-x

src/iter.rs

258

1 files changed, 258 insertions, 0 deletions

diff --git a/src/iter.rs b/src/iter.rs
new file mode 100755
index 0000000..7fc7e38
--- /dev/null
+++ b/src/iter.rs

@@ -0,0 +1,258 @@

+//! This module implements the [`StackBlur`] struct and the improved version of

+//! the Stackblur algorithm.

+//!

+//! ## The improved Stackblur algorithm

+//!

+//! As previously stated, this crate implements a modified version of Stackblur,

+//! and understanding the original algorithm is key to understanding the

+//! improvements that have been made to it.

+//!

+//! The original Stackblur is essentially a weighted box-blur. Instead of taking

+//! the average of a flat array of pixels, like this:

+//!

+//! ```text

+//! ( 00 + 01 + 02 + 03 + 04 + 05 + 06 ) / 7

+//! ```

+//!

+//! it takes a weighted average of the pixels:

+//!

+//! ```text

+//! ( 03 +

+//! 02 + 03 + 04 +

+//! 01 + 02 + 03 + 04 + 05 +

+//! 00 + 01 + 02 + 03 + 04 + 05 + 06 ) / 16

+//! ```

+//!

+//! This is a rough approximation of a Gaussian blur, and in fact it's already

+//! most of the way there to being a complete algorithm on its own. You can just

+//! multiply each pixel by something like `radius + 1 - center_dist` when you

+//! make your sum, then divide by `radius * (radius + 2) + 1`.

+//!

+//! But there are two problems with this:

+//!

+//! 1. That would be *O*(n(m * 2 + 1)²) complexity, where `n` is the number of

+//! pixels in the image, and `m` is the radius of the blur. This is basically

+//! just as expensive as running a proper convolution filter.

+//!

+//! 2. How do we handle pixels off the edge of the image?

+//!

+//! I'm scared of #1 so I'm going to handle #2 first. What most implementations

+//! choose to do is just repeat the edge of the image:

+//!

+//! ```text

+//! ( 00 +

+//! 00 + 00 + 01 +

+//! 00 + 00 + 00 + 01 + 02 +

+//! 00 + 00 + 00 + 00 + 01 + 02 + 03 ) / 16

+//! ```

+//!

+//! But this creates even more problems, because the edge of the blur will be

+//! quite biased towards the pixels that aren't even in the image. This is known

+//! as "edge bleeding", where a single pixel at the edge of a blur can cause

+//! very large and ugly artifacts to show up.

+//!

+//! The solution, of course, is to not calculate the denominator using that

+//! equation from earlier, and instead incrementally update the denominator as

+//! pixels are scanned, allowing you to sum a varying number of pixels:

+//!

+//! ```text

+//! ( 00 +

+//! 00 + 01 +

+//! 00 + 01 + 02 +

+//! 00 + 01 + 02 + 03 ) / 10

+//!

+//! ( 01 +

+//! 00 + 01 + 02 +

+//! 00 + 01 + 02 + 03 +

+//! 00 + 01 + 02 + 03 + 04 ) / 13

+//!

+//! ( 02 +

+//! 01 + 02 + 03 +

+//! 00 + 01 + 02 + 03 + 04 +

+//! 00 + 01 + 02 + 03 + 04 + 05 ) / 15

+//!

+//! ( 03 +

+//! 02 + 03 + 04 +

+//! 01 + 02 + 03 + 04 + 05 +

+//! 00 + 01 + 02 + 03 + 04 + 05 + 06 ) / 16

+//! ```

+//!

+//! This is one of the improvements made to the Stackblur algorithm that is

+//! implemented by this crate.

+//!

+//! Now for #1 - the complexity problem. It is possible to make a streaming

+//! Stackblur that does not need to read any pixels out of order or more than

+//! once, or even know how long the input is. In fact, that's exactly what this

+//! crate does.

+//!

+//! First you fill up the cache with `radius + 1` pixels in order to be able to

+//! make the first sum, then you start producing results until you run out of

+//! pixels, then you produce the last `radius` results using what you have in

+//! cache. I don't have the skill to explain the algorithm in full detail, but

+//! it's open-source so you can look at it if you want.

+//!

+//! In this crate the "cache" is not actually the actual heap of values, as that

+//! would be too slow. Instead, the "cache" is a list of changes to make to the

+//! rate of change of the sum, and the denominator is updated incrementally in

+//! response to the number of leading and trailing values changing.

+//!

+//! The reason the whole rate of change thing exists is so that adding to the

+//! sum and registering it in the cache becomes *O*(1) instead of *O*(2n+1)

+//! (where `n` is the radius). It's basically the most important thing that

+//! makes the algorithm constant-time.

+use std::collections::VecDeque;

+use crate::traits::StackBlurrable;

+/// An iterator that implements an improved Stackblur algorithm.

+///

+/// For any [`StackBlurrable`] element `T` and any iterator `I` over items of

+/// type `T`, [`StackBlur`] will yield the same amount of elements as `I`,

+/// blurred together according to the improved Stackblur algorithm as described

+/// by the [`iter`][crate::iter] module documentation.

+///

+/// ## Usage

+///

+/// [`StackBlur`] just wraps another iterator with a radius and a cache (called

+/// `ops` by [`StackBlur::new`]):

+///

+/// ```compile_fail

+/// # use std::collections::VecDeque;

+/// # use std::num::Wrapping;

+/// # use stackblur_iter::iter::StackBlur;

+/// #

+/// let arr = [255u8, 0, 0, 0, 127, 0, 0, 0, 255u8];

+/// let iter = arr.iter().copied().map(|i| Wrapping(i as usize));

+/// let blur = StackBlur::new(iter, 2, VecDeque::new());

+/// ```

+///

+/// That example unfortunately doesn't compile because `Wrapping<usize>` doesn't

+/// implement `Mul<usize>` and `Div<usize>`, which are required for the

+/// averaging that [`StackBlur`] performs. It's recommended to create a newtype

+/// wrapper around whatever you plan on blurring, and implement all the traits

+/// required by [`StackBlurrable`].

+///

+/// A [`StackBlur`] always yields exactly as many items as its inner iterator

+/// does. Additionally, a non-fused iterator which repeats will cause the

+/// [`StackBlur`] to repeat as well.

+pub struct StackBlur<'a, T: StackBlurrable, I: Iterator<Item = T>> {

+ iter: I,

+ radius: usize,

+ sum: T,

+ rate: T,

+ dnom: usize,

+ ops: &'a mut VecDeque<T>,

+ leading: usize,

+ trailing: usize,

+ done: bool,

+ first: bool,

+impl<'a, T: StackBlurrable, I: Iterator<Item = T>> StackBlur<'a, T, I> {

+ /// Creates a new [`StackBlur`] from the provided iterator, radius, and

+ /// [`VecDeque`].

+ ///

+ /// The iterator is not advanced until a call to [`StackBlur::next`].

+ pub fn new(iter: I, radius: usize, ops: &'a mut VecDeque<T>) -> Self {

+ Self {

+ iter,

+ radius,

+ sum: T::default(),

+ rate: T::default(),

+ dnom: 0,

+ leading: 0,

+ trailing: 0,

+ ops,

+ done: true,

+ first: true,

+ }

+ fn init(&mut self) {

+ self.done = false;

+ for sub in 0..=self.radius {

+ let Some(item) = self.iter.next() else { break };

+ if sub == 0 {

+ let start = self.radius + 1;

+ let needed = start * 2 + 2;

+ self.ops.reserve(needed.saturating_sub(self.ops.capacity()));

+ self.ops

+ .iter_mut()

+ .take(start)

+ .for_each(|place| *place = T::default());

+ self.ops.resize_with(start, T::default);

+ self.sum = T::default();

+ self.rate = T::default();

+ self.dnom = 0;

+ self.leading = 0;

+ self.trailing = 0;

+ }

+ let mul = self.radius + 1 - sub;

+ self.sum += item * mul;

+ self.rate += item;

+ self.dnom += mul;

+ if self.dnom > mul {

+ self.trailing += 1;

+ }

+ self.ops[sub] -= item * 2;

+ self.ops.push_back(item);

+ }

+ if self.dnom == 0 {

+ self.done = true;

+ }

+impl<T: StackBlurrable, I: Iterator<Item = T>> Iterator for StackBlur<'_, T, I> {

+ type Item = T;

+ #[inline]

+ fn next(&mut self) -> Option<Self::Item> {

+ if self.done {

+ self.init();

+ if !std::mem::replace(&mut self.first, false) {

+ return None;

+ }

+ let result = self.sum / self.dnom;

+ self.rate += self.ops.pop_front().unwrap();

+ self.sum += self.rate;

+ if self.leading < self.radius {

+ self.leading += 1;

+ self.dnom += self.radius + 1 - self.leading;

+ }

+ if self.radius == 0 || self.trailing == self.radius {

+ if let Some(item) = self.iter.next() {

+ self.sum += item;

+ self.rate += item;

+ self.ops[self.radius] -= item * 2;

+ self.ops.push_back(item);

+ } else if self.radius > 0 {

+ self.dnom -= self.radius + 1 - self.trailing;

+ self.trailing -= 1;

+ } else {

+ self.done = true;

+ }

+ } else if self.trailing > 0 {

+ self.dnom -= self.radius + 1 - self.trailing;

+ self.trailing -= 1;

+ } else if self.trailing == 0 {

+ self.done = true;

+ }

+ Some(result)

+ }