The README sounds like it's describing Python's venerable iterator protocol. The example code looks like higher order functions and iterator composers from the standard functools and itertools modules. It should probably explain how this is different from those if the audience is Python programmers.
Transducers arise out of realizing that transformations between (possibly infinite) lists can be encoded as a "reducer transformations", "transducers". When you work in this encoding you automatically get lazy transformation (since everything goes through a function call) and the avoidance of intermediate data structures (since everything goes through a function call!).
Moreover, there's a neat property of this encoding, a CPS encoding, which is that function composition still "works", it just works "backwards".
Yes, although transducers are more general than iterators. Iterators model lazy series from which you must 'pull' elements. Transducers can also be used with elements that are 'pushed', such as events coming from channels, or whatever. Essentially they decouple the transformation from the mechanisms by which input elements arrive and result elements are received, whilst still allowing separate transformations to be composed into more complex composite transformations.
You could use the python-transducer library <https://github.com/sixty-north/python-transducers> I mentioned in my other comments to do this. It supports using the same transducers with iterables and push-event coroutine generators.
Exactly. This is a lib created by someone that didn't get, at all, the point of iteration in Python. We have lazy coroutine exactly for that. It's embeded in the language. You got itertools, sum, any, all et comprehensions to use them.
Iteration is everything in Python. It's the core philosophy.
We already got tools to do this in better, lighter way.
"Stateful" doesn't mean "bad". I know functional programming is all the rage right now, but being stateless doesn't turn magically something into a better version of something stateful. If the stateful version is better integrated, have a heavier stateless version is not going to make your code better.
This code by Cognitect is a fairly literal port of Clojure transducers to Python. If you'd like to see a more Pythonic interpretation of transducers, take a look at Sixty-North's alternative https://github.com/sixty-north/python-transducers
Transformations created with premap are stateless, they depend only on the current input. So one can't build UTF8 decoders with them, for example. That said, if you supply a function to premap, the result typechecks as a "Transduction".
The idea is that you can define a transformation separate from its use and then reuse it in many contexts. Some of the win sits in decoupling what you are doing from when and where you are doing it.
You can transform map or filter into a transducer by decoupling the transformation from the data. When you call map, you usually call it on some data, or some stream. When you create a transducer, and you call for example (map inc), you're saying "whenever this transducer is applied, on whatever it is applied to, increase each unit by one".
You can lug around the transformation itself as if it were an object, almost, separate from the data it's meant to transform.
A key component to transducers is to realize that map, filter, take, can all be expressed as special cases of the reduce function.
So a curried map function can be expressed as a reduction operation, for example, in python you could do..
# map that adds 1 to each element
# same as curried map(sub1)([1,2,3])
sub1 = lambda x: x - 1
map(sub1, [1,2,3])
# same outcome using reduce
reduce(lambda acc, x: acc + [sub1(x)], [1,2,3], [])
Notice that when writing a map operation using the reduce function, we've put two pieces of logic on each step. (1) we have a transformation called sub1, and (2) we have a step operation (returning an array with the transformed x appended to it).
We can use this same logic to create filters with an if statement..
A critical thing to notice in each of these cases is that if we ran a curried filter and map (with transformation / filtering functions already passed), then they would loop over the data twice. However, if you look at the example on this github repo, the want to
* take 3 elements from the sequence
* perform a transformation (map operation) to those elements
Notice that in this case take is being performed before the map, but you could imagine doing something like
* map items from geometric series to float
* filter items that are greater than .15
* take 3 of the remaining items
How would you do that with map and filter functions without looping over the entire (inexhaustible) sequence?
This is the beauty of transformers and transducers, they provide a composable approach to filtering, transforming, and taking elements from a (possibly infinite) sequence.
Of course, all of this could be done in a for loop, but often it introduces complexity..
# assume geom_series is the generator from the github example
take = 3
for el in geom_series:
if float(el) < .15:
result.append(el)
if len(result) == take: break
However, this for loop is using the same ideas: we have a transformation / if (predicate) operation, and step operation to append the data we want. The problem transducers try to solve is writing a series of these types of operations compactly and cleary as a pipeline of functions (erm, transformers), so those functions can be replaced/moved around quickly, and the pipelines can be combined together simply.
A good article that covers tranducers in javascript (sorry!) is here:
Python's map and filter already run over iterators. If you're talking about creating filters that are composable, so that you could do
filter1 = filter(predicate1) # curried filter
filter2 = filter(predicate2) # curried filter
pipeline = compose(filter1, filter2)
pipeline(<generator of some kind>)
and make it so that the sequence of operations will be
for el in some_generator:
if predicate1(el) and predicate2(el): <accumulate value>
Then I would say defining them in this way is useful and important--transducers are exactly one way of doing this! A critical aspect I didn't go into detail on is the idea of a take function. In the github repo, T.take(3) is the portion that allows the transducer to operate of an infinite stream of values.
This is the piece your workflow example would need to take into account. How could I apply a filter followed by something that takes 3 passing values from an infinite sequence? I'm sure you could come up with a way, and it would be worth comparing to the transducer approach :).
(it's worth noting that currying map, filter, etc.. are very complimentary to the transducer approach)
Well, in Clojure they solve the problem that the standard functions are inefficient when you chain a bunch of list manipulations together (because they all return lazy lists) and neatly sidestep the fact that Clojure doesn't have generators.
That's almost literally what happens. (comp (map inc) (filter even?)) when applied takes all even numbers from a collection/channel/what-have-you and adds one to them. Filter and map normally take a collection as their last argument, but when none are supplied, a transducer is returned instead.