Yeah, I struggle with that too, its just not a very elegant implementation.
It might help thinking about what happens under the hood, to better conceptualize it:
- All audio signals are 16 bit values.
- The "mixer" object simply adds the signals together.
- any part of the signal which exceeds...