Julio Biason
6 years ago
2 changed files with 196 additions and 0 deletions
@ -0,0 +1,152 @@
|
||||
+++ |
||||
title = "Don't Diminish Types" |
||||
date = 2019-06-03 |
||||
categories = "code" |
||||
|
||||
[taxonomies] |
||||
tags = ["programming languages", "dynamic types", "types", "en-au"] |
||||
+++ |
||||
|
||||
In a previous life, I had a long discussion on why adding booleans was a bad |
||||
idea. And just recently one of the core Python developers suggested the same |
||||
thing -- adding booleans, that is. This is a long rant on why such things are |
||||
bad. |
||||
|
||||
<!-- more --> |
||||
|
||||
Once at some previous job, I blocked a code review in which the other developer |
||||
did something like |
||||
|
||||
```python |
||||
if boolean1 + boolean2: |
||||
do_thing() |
||||
``` |
||||
|
||||
Why this was bad, in my view? 'Cause it was reducing the booleans into simple |
||||
integers. While it is true that booleans are, internally, in Python, integers, |
||||
what they _represent_ is not integers, but actually true or false. People got |
||||
it -- or, at least, they say they got it, but the code changed, anyway -- and |
||||
live moved on. |
||||
|
||||
But, last month, I had to see Raymond Hettinger, one of the core Python |
||||
developers, post |
||||
[this on twitter](https://twitter.com/raymondh/status/1123950707273551878): |
||||
|
||||
> #python tip: The boolean values False and True are equal to zero and one. |
||||
> |
||||
> Replace: |
||||
> sum(1 for x in data if pred(x)) |
||||
> |
||||
> with: |
||||
> sum(pred(x)) |
||||
|
||||
Obviosly, what he meant was to use |
||||
|
||||
```python |
||||
sum(pred(x) for x in my_list) |
||||
``` |
||||
|
||||
... instead of |
||||
|
||||
```python |
||||
sum(1 if pref(x) else 0 for x in my_list) |
||||
``` |
||||
|
||||
Again, basing on the fact that Python uses integers behind the scenes to deal |
||||
with booleans. |
||||
|
||||
And, as you can guess, that really annoyed me. |
||||
|
||||
## What Are Types |
||||
|
||||
Let me explain, in a long rant, why "booleans are integers" is bad. |
||||
|
||||
Imagine the computer memory. Imagine one specific memory location, being used, |
||||
with this value: |
||||
|
||||
> 65 |
||||
|
||||
What does it mean? That's where languages and their types come in. |
||||
|
||||
For example, let's imagine that this location is being managed by a C program |
||||
and the program and this program marked this value as a `char`. While `char`s |
||||
in C represent 8 bit integers[^1], they have being for a long time used to |
||||
represent one character in a string -- a sequence of `char`s actually |
||||
represents a string[^2]. So, when other developers see `char`, they think "Ok, |
||||
this is the character with code 65" (which is "A", by the way). |
||||
|
||||
If the same code use `int`, other developers would think "Yeah, this is the |
||||
_number_ 65". |
||||
|
||||
And, just to screw things up, it could be an `enum`, in which the value |
||||
represents the 65th variant (element) of that enum. |
||||
|
||||
And that's the reason types exist: -- instead of, say, developers managing |
||||
memory directly and just changing their representation, like in Assembly -- |
||||
they provide consistent representation over the internal storage. |
||||
|
||||
## The Problem With Adding Booleans |
||||
|
||||
So, we saw that memory is just a bunch of bytes and what gives meaning for |
||||
those bytes -- in programming languages, that is -- are types[^3]. |
||||
|
||||
Now let's see another developer seeing the code above; they go from the top of |
||||
the code to the bottom, and reach the line of `sum(pred(x) for x in my_list`. |
||||
The first thought they come is that `sum` acts on numbers, so obviously `pred` |
||||
is a function that return numbers. But what number it represents? So they go |
||||
check `pref` and see it returning `True` or `False`. Now they have to trace |
||||
back and rethink what the line did, leaving them with [cognitive |
||||
dissonance](https://en.wikipedia.org/wiki/Cognitive_dissonance), which is a |
||||
clever way of saying "they have to rethink what they already though". |
||||
|
||||
And too many situations with cognitive dissonance is what makes code |
||||
"unreadable" -- the line above is still readable, but it doesn't actually |
||||
represent what it shows. |
||||
|
||||
## Respect Your Types |
||||
|
||||
Python is very loose with its type system[^4], but it doesn't mean one could |
||||
play "fast and furious" over it. Let's say that, at some point, Python |
||||
developers decide to change `True` and `False` from their integer roots to be |
||||
actually symbols -- things that simply "exist" and have no value[^5][^6]. Then |
||||
everyone that managed booleans as integers would see their code misbehaving or |
||||
crashing, simply because they didn't thread booleans _as booleans_. |
||||
|
||||
Now let's see the other option: `sum(1 if pred(x) else 0 for x in my_list)`. |
||||
This line is (a) longer and (b) slower due the branching during execution (the |
||||
`if`). But when you read something like this you see that there is a function |
||||
where its value isn't being checked, which probably means it returns a |
||||
boolean[^7]; if it is true, returns 1; if it is false, returns 0; and you're |
||||
actually doing a sum of ones and zeroes -- as numbers. |
||||
|
||||
No cognitive dissonance, no messing around and just because we treated types |
||||
as types. |
||||
|
||||
--- |
||||
|
||||
Footnotes: |
||||
|
||||
|
||||
[^1]: They could use more than 8 bits, depending on the architecture, and due |
||||
the fact that the C Standard is very flexible in this concept |
||||
|
||||
[^2]: It doesn't mean that every single `char` is a character in a string, it |
||||
could be used exactly as an 8 bit integer |
||||
|
||||
[^3]: I'm being very lose here about types, there is a lot more complex |
||||
context in them, but I'm going to stick with this "representation" for now. |
||||
|
||||
[^4]: Maybe nose as loose as C, which lets you "convert" a memory that |
||||
represents a float into a integer with no sign. |
||||
|
||||
[^5]: Or, better yet, that can have _any_ value and would still work. |
||||
|
||||
[^6]: As far as I know, Python standard already forced booleans to be |
||||
integers, so that will never happen, but let's add this for the sake of |
||||
discussion. |
||||
|
||||
[^7]: This is one of times I feel jealous of Clojurist, which can use `?` in |
||||
their functions and actually have a coding style that says that predicates |
||||
-- functions that either return True or False -- end with `?`; so not only |
||||
someone reading a piece of Clojure code seeing a `is_valid?` knows it |
||||
returns a boolean, it actually _reads_ like a boolean check. |
@ -0,0 +1,44 @@
|
||||
+++ |
||||
title = "Modern Java in Action - Raoul-Gabriel Urma" |
||||
date = 2019-02-08 |
||||
|
||||
[taxonomies] |
||||
tags = ["books", "en-au", "review", "java", "java 8", "java 9", "raoul-gabriel urma"] |
||||
+++ |
||||
|
||||
Java 8 Lambdas in Action is a clearly-written guide to Java 8 lambdas and |
||||
functional programming in Java. It begins with a practical introduction to the |
||||
structure and benefits of lambda expressions in real-world Java code. The book |
||||
then introduces the Stream API and shows how it can make collections-related |
||||
code radically easier to understand and maintain. Along the way, you'll |
||||
discover new FP-oriented design patterns with Java 8 for code reuse, code |
||||
readability, exception handling, data manipulation, and concurrency. For |
||||
developers also exploring other functional languages on the JVM, the book |
||||
concludes with a quick survey of useful functional features in Scala and |
||||
Clojure. |
||||
|
||||
<!-- more --> |
||||
|
||||
{{ stars(stars=2) }} |
||||
|
||||
To be short: It's a good book, but it is extremely (and unnecessary) verbose. |
||||
|
||||
It covers the new stuff on Java 8 (like streams and lambdas) and some of Java 9 |
||||
(like the Flow/reactive interface). It does a good job on Streams and Lambdas, |
||||
but seems to fall a bit short on the reactive interface, maybe 'cause it's just |
||||
an interface, although I found the examples a little bit missing in some |
||||
points, like showing the Three interfaces, Subscriber, Subscription and |
||||
Publisher, but showing examples without the Publisher. A bit worse (IMHO), is |
||||
that, in order to produce an example, instead of publishing some data, it shows |
||||
a generic publisher of sequential numbers, in which it calls the real |
||||
publisher. |
||||
|
||||
Also, the authors seem a bit too enthusiastic about lambdas. Even when the code |
||||
becomes less readable -- specially in the Flow examples -- they still use |
||||
lambdas. They are so into it that the example of a `for` being converted to a |
||||
stream is shown at least 4 times. |
||||
|
||||
There are too many "as follows"; there are too many "in the next section" just |
||||
before the next section; there is too much repetition that shouldn't be there. |
||||
|
||||
Again, the content is good, but the text is terrible. |
Loading…
Reference in new issue