Recursion: Difference between revisions

From JookWiki
(Save current page incarnation)
(→‎Language support: Add personal opinion)
(25 intermediate revisions by the same user not shown)
Line 1: Line 1:
Warning: This article is a work in progress! No refunds if it moves!
[[Category:Research]]
Recursion is a fantastic and often ignored feature of programming languages. Most introductions show an example you'd never use in practice, so this article is my attempt at showing some better ones using Lua.


TODO: This article is about tail calls
==Loops==
Recursion can create loops without language constructs.


- introduction/overview
Here's an infinite loop:
function infinite_loop()
  print("Hello there!")
  return infinite_loop()
end
infinite_loop()


- 'magic of recursion'?
This is a bit longer than a non-recursive example.


- lua
Here's a counting loop:
function count_down(number)
  if number == 0 then return end
  print(number)
  return count_down(number - 1)
end
count_down(100)


- lambdas, etc
A non-recursive version of this would likely use some kind of for or while loop.


== Function calls==
Here's a loop that asks a user to pick a valid choice:
Let's look at a hypothetical hello world program:
  function get_choice(choices)
  function main()
  local line = io.read()
   value = 0
   choice = choices[line]
   print("Hello world!")
  if choice then
   return value
    return choice
   else
    print("Invalid choice! Try again")
    return get_choice(choices)
   end
  end
  end
print("Select a letter to get a number: A, B, C")
choice = get_choice({A=1, B=2, C=3})
print("You picked number " .. choice)
Without recursion this code would look a lot more confusing, at least to me.


The main function does two things here:
==State machines==
Not all recursion has to be direct. Indirect recursion lets you represent state machines easily.


*Creates a variable with the value 0
Here's a tiny adventure game with the player choosing state transitions:
*Calls the print function with the value "Hello world!"
function dark_room()
* Returns with the value of the variable
  print("You are in a dark room.")
 
  print("Pick a door: fuzzy or metal")
Calling a function is fairly easy:
  choice = get_choice({fuzzy=1, metal=2})
 
  if choice == 1 then return fuzzy_room()
* You save your work and current location
  elseif choice == 2 then return metal_room()
* You set the function's arguments
  end
* You jump to the beginning of the function  
end
 
function fuzzy_room()
Returning from a function is just as easy:
  print("This room feels pretty fuzzy...")
 
  print("Pick a door: dark, metal")
* You restore the previous work
  choice = get_choice({dark=1, metal=2})
*You set the return value
  if choice == 1 then return dark_room()
* You jump to the previous location
  elseif choice == 2 then return metal_room()
This system works well in most cases.
  end
 
end
== Tail calls==
function metal_room()
 
  print("This room feels really metallic.")
== Tail call elimination ==
  print("Pick a door: dark, fuzzy or win")
What if we
  choice = get_choice({dark=1, fuzzy=2, win=3})
 
  if choice == 1 then return dark_room()
- we've been writing code as there's no stack
  elseif choice == 2 then return fuzzy_room()
 
  elseif choice == 3 then return metal_room()
- tail call elimination
  end
end
function win_room()
  print("You found the treasure!")
  return
end
dark_room()
Without recursion you'd likely need to put everything in a single function with a loop and state variable.


- NOT an optimization, how many optimizations decide which way you can program?
Some things just make more sense when implemented recursively, to me at least.
 
- hints deeper at function calls vs jumps
 
- structured programming, goto wars
 
==Loops==
- recursion can implement loops!
 
- implementing a for loop
 
- implementing a while loop
 
- implementing a do while loop
 
- each loop iteration only shares global and function args
 
==State machines==
- implementing a stateful algorithm


- some kind of menu system
== Tail call optimization ==
There is a caveat with recursive programs: Each function call takes up stack space. The deeper you recurse, the more likely you are to run out of stack space and crash your program. This makes recursion useless in most programming languages.


- the code makes sense to read
However there is a compromise: If a return in a function is just a call to another function then that return call is a 'tail call'. Languages that implement tail call optimization will re-use the current function call's stack for the function you're calling, solving the issue of stack space.


- this is mutual recursion
All the examples on this page use tail calls and run in Lua which implements tail call optimization. This means every program on this page is immune to stack overflows.


- very hard to do in a traditional structured language
The 1977 [https://en.wikisource.org/wiki/Index:AIM-443.djvu AI Lab Memo 443] talks more broadly about how tail calls are like goto statements that you can pass arguments to. Huge shout-out to the folks at Wikisource that transcribed this to an accessible text form.


==Lambdas==
The significant downside of tail call optimization is that it can make debugging more difficult as you lack a proper stack trace.
- lambdas to actually replace looping constructs/switch statements


- most mainstream languages support lambas
==Language support==
Despite languages slowly adding features from functional languages developed 40 years ago, tail call optimization is still unpopular. I'm guessing that the reason is because not many people see the use of recursion.


- recursion-based control flow
Here's an incomplete list of languages that support it automatically:


==Mainstream support==
* Haskell
- functional programming languages
* Erlang (and Elixir)
*Any Scheme implementation (Chez Scheme, Chibi Scheme, Chicken Scheme, TinyScheme)
*Lua (see [https://www.lua.org/pil/6.3.html Programming in Lua 6.3 - Proper Tail Calls])
*Steel Bank Common Lisp (See [http://www.sbcl.org/manual/#Debug-Tail-Recursion SBCL Debug Tail Recursion])
*Squirrel (See [http://squirrel-lang.org/squirreldoc/reference/language/threads.html Squirrel's Threads page])
*Racket (See [https://docs.racket-lang.org/guide/Lists__Iteration__and_Recursion.html#%28part._tail-recursion%29 The Racket Guide 2.3.3 - Tail Recursion])
Here's an incomplete list of languages that require explicit support:
* Clang C and C++ (see the [https://clang.llvm.org/docs/AttributeReference.html#musttail Clang musttail attribute])
* Tcl (see [https://www.tcl.tk/man/tcl/TclCmd/tailcall.html Tcl's tailcall manual page])
*OCaml (See [https://ocaml.org/manual/attributes.html OCaml's tailcall attribute])
* Perl (See [https://perldoc.perl.org/functions/goto Perl's goto function], specifically the goto &NAME variant)
*Unix (See [https://jeapostrophe.github.io/2012-05-28-exec-vs--post.html exec and Tail-call Optimization])
*Assembly (Instead of returning set up registers and jump)
*Ruby (See [https://nithinbekal.com/posts/ruby-tco/ Ruby's tailcall_optimization compile option])
*Zig (See [https://ziglang.org/documentation/master/#call Zig's always_tail call option])


- lua
Here's an incomplete list of popular languages that don't support it:


- clang mustcall
* C and C++
* Go
* Rust
* Swift
* PHP
*Python
*Raku
* Anything running on the JVM (Java, Clojure, Scala, Kotlin)
* Anything running on .NET (C#, F#)
* Anything running on WebAssembly
* Anything JavaScript or transpiling to JavaScript (TypeScript, CoffeeScript)
Things look decent for desktops, but not so much for phones or web browsers.


- webassembly
Personally I lean towards the idea of languages adding new keywords or explicit support for this, such as a 'goto' or 'jump' keyword. It helps in a debugger when you have stack frames by default, and it helps make it clear that it's important that this tail call is optimized.

Revision as of 06:29, 13 October 2022

Recursion is a fantastic and often ignored feature of programming languages. Most introductions show an example you'd never use in practice, so this article is my attempt at showing some better ones using Lua.

Loops

Recursion can create loops without language constructs.

Here's an infinite loop:

function infinite_loop()
  print("Hello there!")
  return infinite_loop()
end
infinite_loop()

This is a bit longer than a non-recursive example.

Here's a counting loop:

function count_down(number)
  if number == 0 then return end
  print(number)
  return count_down(number - 1)
end
count_down(100)

A non-recursive version of this would likely use some kind of for or while loop.

Here's a loop that asks a user to pick a valid choice:

function get_choice(choices)
  local line = io.read()
  choice = choices[line]
  if choice then
    return choice
  else
    print("Invalid choice! Try again")
    return get_choice(choices)
  end
end
print("Select a letter to get a number: A, B, C")
choice = get_choice({A=1, B=2, C=3})
print("You picked number " .. choice)

Without recursion this code would look a lot more confusing, at least to me.

State machines

Not all recursion has to be direct. Indirect recursion lets you represent state machines easily.

Here's a tiny adventure game with the player choosing state transitions:

function dark_room()
  print("You are in a dark room.")
  print("Pick a door: fuzzy or metal")
  choice = get_choice({fuzzy=1, metal=2})
  if choice == 1 then return fuzzy_room()
  elseif choice == 2 then return metal_room()
  end
end
function fuzzy_room()
  print("This room feels pretty fuzzy...")
  print("Pick a door: dark, metal")
  choice = get_choice({dark=1, metal=2})
  if choice == 1 then return dark_room()
  elseif choice == 2 then return metal_room()
  end
end
function metal_room()
  print("This room feels really metallic.")
  print("Pick a door: dark, fuzzy or win")
  choice = get_choice({dark=1, fuzzy=2, win=3})
  if choice == 1 then return dark_room()
  elseif choice == 2 then return fuzzy_room()
  elseif choice == 3 then return metal_room()
  end
end
function win_room()
  print("You found the treasure!")
  return
end
dark_room()

Without recursion you'd likely need to put everything in a single function with a loop and state variable.

Some things just make more sense when implemented recursively, to me at least.

Tail call optimization

There is a caveat with recursive programs: Each function call takes up stack space. The deeper you recurse, the more likely you are to run out of stack space and crash your program. This makes recursion useless in most programming languages.

However there is a compromise: If a return in a function is just a call to another function then that return call is a 'tail call'. Languages that implement tail call optimization will re-use the current function call's stack for the function you're calling, solving the issue of stack space.

All the examples on this page use tail calls and run in Lua which implements tail call optimization. This means every program on this page is immune to stack overflows.

The 1977 AI Lab Memo 443 talks more broadly about how tail calls are like goto statements that you can pass arguments to. Huge shout-out to the folks at Wikisource that transcribed this to an accessible text form.

The significant downside of tail call optimization is that it can make debugging more difficult as you lack a proper stack trace.

Language support

Despite languages slowly adding features from functional languages developed 40 years ago, tail call optimization is still unpopular. I'm guessing that the reason is because not many people see the use of recursion.

Here's an incomplete list of languages that support it automatically:

Here's an incomplete list of languages that require explicit support:

Here's an incomplete list of popular languages that don't support it:

  • C and C++
  • Go
  • Rust
  • Swift
  • PHP
  • Python
  • Raku
  • Anything running on the JVM (Java, Clojure, Scala, Kotlin)
  • Anything running on .NET (C#, F#)
  • Anything running on WebAssembly
  • Anything JavaScript or transpiling to JavaScript (TypeScript, CoffeeScript)

Things look decent for desktops, but not so much for phones or web browsers.

Personally I lean towards the idea of languages adding new keywords or explicit support for this, such as a 'goto' or 'jump' keyword. It helps in a debugger when you have stack frames by default, and it helps make it clear that it's important that this tail call is optimized.