Monday morning thoughts: dynamic language

qmacro · ‎10-22-2018

In this post, I think about a particular function definition in JavaScript, and how it represents beauty and practicality in equal measure.

On Friday evening, I tweeted:

Heading to the woodstore with a beer and cigar. I was reminded of this little gem just now. Lovely.
const push = (xs, x) => (_ => xs)(xs.push(x))

It was something I'd come across while idly perusing some source code, and I thought it was quite beautiful. I had a few responses to the tweet. In one of them, speri asked for an explanation, and then julie.plummer suggested I do it in the form of a post in this Monday morning thoughts series. This was a great idea, as I was already on my way to the woodstore.

The woodstore, where I read and think about things (and enjoy a beer or two).

The context

The source code in question is a utility program, reuseTableData.js, part of the base repository SAP/cloud-sample-spaceflight which contains the base Core Data & Services data model for the Application Programming Model sessions at SAP TechEd this year (see "Application Programming Model for SAP Cloud Platform - start here") and written by christian.georgi & my Language Ramblings partner in crime chris.whealy.

Why was I reading this? Well, out of curiosity, and a desire to learn more, of course. I have heard that programmers spend only around 10% of their time writing code, and 90% reading it. That sounds extreme, but I can imagine it being true in some circumstances. I'm guessing this covers not only reading code to work out what it does before extending or modifying it, but also reading code for pleasure, to learn how other people write.

I remember researching for my first book "Programming Jabber" in the early 2000's. I spent many hours in the local coffee shop reading through the source code of the reference implementation of the Jabber (now XMPP) protocol - the jabberd source code, written in C by Jeremie Miller and others. I learned a lot from it, and I enjoy reading other people's code now and then - not least to see how each author's character gently but inevitably leaks through into the code.

So back to the program in question. To add a little context, here's the line of code again, with the comment that accompanies it:

// A useful version of Array push that returns the modified array 

// rather than the index of the newly added item...

const push = (arr, newEl) => (_ => arr)(arr.push(newEl))

In JavaScript, Array is a global object used to represent lists, and has a number of built-in functions, or methods, that operate on Array object instances.

For example, you can create an array and then use the Array object's join() function like this:

> const colours = ['red', 'green', 'blue']

undefined

> colours.join(' and ')

'red and green and blue'

>

(Code samples here are from Chrome console sessions, using ES6).

Another built-in function for the Array object is push(), one of a group of four functions I mentally think about together as a family, as they do very similar things:

  shift() <---+           +---> pop()

              |           |

              |           |

             [1, 2, 3, 4, 5]

              ^           ^

              |           |

unshift() ----+           +---- push()

The functions shift() and unshift() operate on the front of an array, removing or adding elements respectively. The functions pop() and push() operate on the end of an array, removing or adding elements respectively.

There are also other functions for manipulating elements in other places in an array, functions such as splice() and slice(), but what we want to concentrate on right now is push() and its sibling unshift(). Both add one or more elements to the array. And in both cases, what's returned is the length of the new array. For example:

> const colours = ['red', 'green', 'blue']

undefined

> colours.push('yellow')

4

>

This is not unreasonable, but it's also less useful than you might think.

Practicality

In many situations, you'll want to create something - an array, a map, another higher level object, and manipulate it. One approach to this which feels to many quite natural is to use method chaining, which is effectively like saying: "do this, then that, then the other".

This is common in UI5, where for example standard controls, such as the Button control in the sap.m library has methods, many of which return the Button instance upon which they're operating, explicitly to allow method chaining (see for example attachPress). So we can end up with something like this, where four methods are "chained" together:

In a similar way, you might want to create a list of elements, rearrange it by some means, add something to the end of it, and then map over each of the final collection of elements to end up with what you really need. Like this, for example:

['red', 'white', 'blue']

	.push('orange')

	.sort()

	.map(x => x.toUpperCase())

You'd hope to get this:

["BLUE", "ORANGE", "RED", "WHITE"]

However, you'd end up with this:

Uncaught TypeError: ["red","white","blue"].push(...).sort is not a function

    at <anonymous>:3:3

This is because the push() function returns the new length of the array (4) rather than the array itself. Not ideal.

What's perhaps worse is that it makes it cumbersome to employ higher level functions like reduce(), in combination with the shorter ES6 based arrow function style of function definition. With arrow functions, the function definition itself is much shorter and (without curly braces) there's an implicit return of whatever is evaluated as a result of that function's execution.

If you're not familiar with using reduce() and taking advantage of the new ES6 arrow function style, here's a contrived example:

> [1,2,3,4,5].reduce((a, x) => a * x, 1)

120

Here, we multiply a list of numbers, and take advantage of the fact that the function definition being passed to reduce:

(a, x) => a * x

implicitly returns the result of the expression a * x, to be fed into the next element iteration until the list of elements is exhausted.

In other words, if you want to use Array's push() function as the function definition passed in a reduce() scenario, you can't, or at least, what it evaluates to (the new length of the array) is almost certainly not what you want fed into the next iteration - you want the new (modified) array*.

*the keen readers amongst you will guess that I'm currently slightly uncomfortable at completely ignoring the fact that push() is mutating the array, which is generally a Bad Thing(tm) - but I'm ignoring it deliberately, as that's a whole other subject for another time.

If you read further on in the reuseTableData.js program, you'll see that there's a reduce() in line 73 function being employed to gather table names together:

var tableNames = _getTableDataSync(path.join('db/src/csv')).

  reduce(

    (accOuter, filePath) =>

      JSON.

        parse(fs.readFileSync(filePath)).

        imports.

        reduce((accInner, entry) => push(accInner, entry.target_table), accOuter)

  , [])

This use of

push(accInner, entry.target_table)

is not the standard push() function from the Array object. Rather, it's our push() function defined earlier, the subject of this post:

const push = (arr, newEl) => (_ => arr)(arr.push(newEl))

Now we understand the context of where it's used, and why the standard push() function is no good, let's dig in to this definition to see how it works, and why it can be used.

Beauty

So first off, we can see it's a function definition using the fat arrow (=>) syntax from ES6. We're now already somewhat familiar with that, but it still looks a little odd, with that strange looking underscore, and what initially looks like a slightly uncomfortable number of brackets.

Remember that contrived example using a reduce() function above? You can see that the function passed to reduce() is one with two parameters - the accumulator 'a', and 'x' to represent the elements that are passed in, one iteration at a time. So also here we have a function definition with two parameters, again, an accumulator 'arr' (array) and 'newEl' representing the elements passed to the function as reduce's list mechanics iterate over the array, one element at a time.

So what's the actual function definition here? Well, it's this bit:

(_ => arr)(arr.push(newEl))

The first part, (_ => arr) is yet another function definition. What the heck does it do?

Well, it takes one parameter, and promptly ignores it, simply returning the value of 'arr'. The use of the underscore for this ignored parameter is convention - it tells the reader "we're expecting a parameter to be passed to this function, but we're not actually interested in it". (It's often used where there's more than one parameter in a function definition signature, and the intention is to ignore one or more of them.)

The body of this function is simply 'arr' which returns whatever value 'arr' has. Which is what?

To answer that, we need to look at the other part of the definition. This is the 'arr.push(newEl)' bit, but we can't and shouldn't ignore all those brackets. Breaking those brackets down, we have a first pair surrounding '_ => arr', a second pair surrounding 'arr.push(newEl)' and the pair around 'newEl'.

We can be happy enough with this last pair, it's just a standard invocation of the push() function, where the value of 'newEl' is the new element being pushed onto the end of 'arr'.

So what about the other two pairs of brackets? Well, in JavaScript, you can define an anonymous function on the fly, and call it immediately. When you do this, you wrap the anonymous function definition in brackets:

(_ => arr)

and then call it, passing any arguments in brackets, as normal. In this case, there's one argument to pass (to match up with the single '_' parameter the function is expecting). This argument is the result of the evaluation of this:

arr.push(newEl)

which, as we know, will be the new length of the 'arr' array. But by the time this is evaluated, the 'arr' array will already have had 'newEl' added as a new last element, and so we don't have to worry about the value returned from this (the new length), as we're going to capture it in the '_' parameter of '(_ => arr)' and ignore it anyway. The sole purpose of the '(_ => arr)' function is to return the value of 'arr' -- which now has the value of 'newEl' on the end.

The upshot of this definition of a custom push function is that we can use it to push an element onto the end of an array, and have it return the infinitely more useful modified array, rather than the almost useless new length of the array. We can then employ it in higher order function invocations such as those with reduce().

Beyond the ability to define anonymous functions (to use in higher order functions) and generally program in this dynamic way, the mechanic that allows this whole idea to work is called a closure.

const push = (arr, newEl) => (_ => arr)(arr.push(newEl))

The value of 'arr', initially passed to the outer function definition as the accumulator, by reduce's list mechanics, is available on the right hand side of the main fat arrow, both in the inner function definition as well as the on-the-fly call to push that is passed as the argument to that inner function.

Dynamic programming in general, and this style of extension in particular, is a wonderful thing, and I consider this particular definition of push to be quite beautiful.

There are alternatives, of course - my friend Martin Rue suggested this:

const push = (xs, x) => xs.push(x) && xs

which achieves the same effect by relying on the fact that the last value evaluated in the execution of a function (again, without curly braces) is the value that is returned. In this case, that is simply 'xs'*. Also very elegant.

*I note that Martin used 'xs' to represent a list of 'x' elements, which is a rather nice meme, or at least an idea, that I picked up from various functional programming treatises and used, for example, by Erik Meijer in his wonderful series on Haskell and F#. Perhaps more on that another time.

So, here's a hat tip to Chris Whealy who I'm guessing had some influence on the distinctive style of this reuseTableData.js code. I know Chris and have started to recognise his 'signature' in code. And I'll leave you with a recommendation to study other people's code every now and then. It can be a lot of fun, and educational to boot. Happy reading!

This post was brought to you by a cold Monday morning, before setting off to the airport for Barcelona and the European instance of SAP TechEd. If you're there, stop by the Developer Garage for a chat about this or anything else!

Read more posts in this series here: Monday morning thoughts.

former_member181883 · ‎10-22-2018

LOL!

“I know Chris and have started to recognise his ‘signature’ in code.”

I guess I code with a strong accent....

?

Chris W

qmacro · ‎10-22-2018

I had a summer job at IBM in 1985 (between my first and second year at Uni) and was tasked with documenting a system that had been written in CLIST on one of the mainframes (running MVS) that the department used to run its affairs. It had been the work of a team of people, and over the weeks I worked on the project, I got to be able to recognise which bits of code were from which people. It was then I formed the idea about how someone's character will naturally leak into code. Happy memories.

former_member181883 · ‎10-22-2018

Ah, CLISTs...

Another one of my former specialist topics

The very first “system” of any size or usefulness I wrote was a Change Mangement system that used ISPF panels for the user interface

That was 30+ years ago now...

?

qmacro · ‎10-22-2018

Gosh, that takes me back. Working at Deutsche Telekom on a massive R/2 installation in the early 1990s, I was on a team that built a system to create, destroy, and recreate SAP R/2 systems (and the massive array of VSAM datasets that went with them, for the DB layer) … and that was also implemented using ISPF panels and a combination of CLIST and Rexx. I feel a combination of beer and reminiscing coming on this week in Barcelona … maybe grey hair is a prerequisite

speri · ‎10-23-2018

Great post DJ! This makes your tweet much more clearer ?

qmacro · ‎10-23-2018

Glad it helped! 🙂

nabheetscn · ‎10-23-2018

Wow super post, made my Tuesday apart from #SAPTechEd Keynotes! I saw this tweet and was confused like others what the heck it is?:) This one post talks about so many things chaining, Anonymous functions, closures and what not superb. Arrays in javascript always keeps amazing me, endless possibilities, for example today just discovered about filter function, did not know about them. Earlier i was making meaningless loops and now its all one line stuff. Amazed with endless possibilities with arrays.

Nabheet

qmacro · ‎10-23-2018

Thanks Nabheet! Glad you enjoyed the keynote - I did too.

BTW, you may have seen this already, if not ... I wrote a short document "Programming in a more functional style" which covers some of the stuff you mention above, including filter:

Programming in a more functional style

Let me know what you think. Cheers!

nabheetscn · ‎10-23-2018

You know what, i learnt using map via one of your web sheet only, in fact i think filter i would have read but forgot. Keeping track of too many things is also a challenge I am facing. How do we remember what we have read any tips from your experience?

Nabheet

qmacro · ‎10-24-2018

I'm afraid I struggle like anyone else. I have to rely on the fact that if I find it interesting, I remember it. Otherwise I don't, but I can at least remember some vague notion of having learnt about it, and can usually look up my notes or re-Google for the details 🙂

mmcisme1 · ‎10-24-2018

I'm glad I'm not the only one that can spend 90% of my day reading code. Yes, it's an exaggeration sometimes. But sometimes, it is a struggle.

I really like the new syntax. I really hate the new syntax. It's easy to use, but it makes it harder to read depending on how the variables and/or comments are written. Comments? This is self-documenting code, right? Sometimes those are so very nice to have.

Also my time is normally split 20/60/20 for old code 20 - code, 60 - reading the code, 20 - user requirements. New projects 80/20, 80 requirements, 20 code. And the VERY old code sometimes it's 95 reading and 5 coding. And of course testing is part of coding for me.

I hope you are having a great Teched! Such fun I am jealous.

qmacro · ‎10-24-2018

The new syntax, shorter, yes. I’m musing whether that means, with smaller, succinct and pure functions, the requirement to document is reduced (self-documenting code works better when the code is in small pieces and is built and tested separately as small building blocks. I dunno – I’m half convinced.

TechEd is fun, yes – lots of learning and sharing. Thanks for the comments, as always!

mmcisme1 · ‎10-24-2018

The requirement documentation should be reduced. But the comments - I don't think so. While slowly moving to the new syntax, it would be nicer to fellow programmers to comment.

Small pieces? It's slowly driving me insane, debugging some of SAP code. Sometimes it is one statement - that statement sends you to another method. Really? Yes, really.

qmacro · ‎10-24-2018

If the methods (or functions) are well-defined and understood (and ideally small and pure) then that's a good thing, in my view. 🙂

mmcisme1 · ‎10-24-2018

Crazy person. OK, I'll get used to it.

julieplummer20 · ‎10-25-2018

Phew. Have just learnt so much. Never again will I be intimidated by an underscore. Thank you, DJ. I hope you had a good TechEd.

qmacro · ‎10-27-2018

Thanks Julie. Love that thought about intimidation and underscores 🙂 And yes, TechEd was great, thank you. Cheers!

Monday morning thoughts: dynamic language

Get Your SAP HANA Idea Incubator Badge Today!

SCN Mission - SAP HANA Quiz Challenge is now retired

Share your #HANAStory and Win