MakeCode Languages: Blocks, Static TypeScript and Static Python

MakeCode programs can be authored in Blocks, Static TypeScript or Static Python.

Both Blocks and Static Python are converted to Static TypeScript before being compiled to lower-level languages. Blocks is implemented using Google Blockly.

Static TypeScript is a subset of TypeScript. Currently, we are using TypeScript version 2.6.1. TypeScript itself is a superset of JavaScript, and many MakeCode programs, especially at the beginner’s level, are also just plain JavaScript. There are more technical details about the language and the compiler in this MPLR 2019 paper.

MakeCode is meant for teaching programming first, and JavaScript second. For this reason, we have stayed away from concepts that are specific to JavaScript (for example, prototype inheritance), and instead focused on ones common to most modern programming languages (for example, loops, lexically scoped variables, functions, lambdas, classes).

Support for Static Python may still be experimental for some MakeCode editors. If Python isn’t shown as a language choice in the editor you’re using, you can enable it by clicking the gear icon ⚙️, going to About , then clicking Experiments. Experiments are released selectively, so if an experiment you want does not show yet, please be patient — it will be rolled out to your editor soon!

Static TypeScript

Supported language features

variable declarations with let, const
functions with lexical scoping and recursion
top-level code in the file; hello world really is console.log("Hello world")
if ... else if ... else statements
while and do ... while loops
for(;;) loops
for ... of statements (see below about for ... in)
break/continue; also with labeled loops
switch statement (on numbers, strings, and arbitrary types - the last one isn’t very useful)
debugger statement for breakpoints
conditional operator ? :; lazy boolean operators
namespaces (a form of modules)
all arithmetic operators (including bitwise operators)
strings (with a few common methods)
string templates (`x is ${x}`)
arrow functions () => ...
passing functions (with up to 3 arguments) as values
classes with static and instance fields, methods and constructors; new keyword
array literals [1, 2, 3]
enumerations (enum)
asynchronous functions that look synchronous to the user
method-like properties (get/set accessors)
basic generic classes, methods, and functions
class inheritance
classes implementing interfaces (explicitly and implicitly)
object literals { foo: 1, bar: "two" }
typeof expression
public/private annotations on constructor arguments (syntactic sugar to make them into fields)
initializers for class fields
lambda functions with more than three arguments
using generic functions as values and nested generic functions
binding with arrays or objects: let [a, b] = ...; let { x, y } = ...
exceptions (throw, try ... catch, try ... finally)
downcasts of a superclass to a subclass
function parameter bi-variance
explicit or implicit use of the any type
union or intersection types
using a generic function as a value
class inheritance for generic classes and methods
delete statement (on object created with {...})
object destructuring with initializers
shorthand properties ({a, b: 1} parsed as {a: a, b: 1})
computed property names ({[foo()]: 1, bar: 2})

Unsupported language features

Static TypeScript has nominal typing for classes, rather than the structural typing of TypeScript. In particular, it does not support:

interface with same name as a class
casts of a non-class type to a class
interface that extends a a class
inheriting from a built-in type
this used outside of a method
function overloading

Things you may miss and we may implement:

spread and reset operators (statically typed)
support of enums as run-time arrays
new on non-class types
using a built-in function as a value

Things that we are not very likely to implement due to the scope of the project or other constraints (note that if you don’t know what a given feature is, you’re unlikely to miss it):

file-based modules (import * from ..., module.exports etc); we do support namespaces
yield expression and function*
await expression and async function
tagged templates tag `text ${expression} more text` are limited to special compiler features like image literals; regular templates are supported
with statement
eval
for ... in statements (for ... of is supported)
prototype-based inheritance; this pointer outside classes
arguments keyword; .apply method
JSX (HTML fragments as part of JavaScript)

Static TypeScript has somewhat stricter ideas of scoping than regular TypeScript. In particular var is not allowed (let and const are supported), and identifiers defined with function can only be used after all variables from outer scopes have been defined. (The closure objects for functions that are used before definition is constructed right after last used variable have been defined. For functions defined before usage, the closure is constructed at the point of definition.) Both of the following examples will yield a compile error.

function foo1() {
    bar()
    let x = 1
    function bar() {
        let y = x // runtime error in JavaScript
    } 
}
function foo1() {
    const tmp = bar
    let x = 1
    tmp() // no runtime error in JavaScript
    function bar() { let y = x } 
}

For JS-only targets we may implement the following:

regular expressions

Note, that you can use all of these while implementing your runtime environment (simulator), they just cannot be used in user’s programs.

Semantic differences against JavaScript

As such, it isn’t really feasible to run a full JavaScript virtual machine in 3k of RAM, and thus PXT programs are statically compiled to native code to run efficiently.

PXT used to support a legacy compilation strategy, where numbers were represented as 32 bit signed integers, and all types were static. This is used by the v0 branch of micro:bit (but not the current v1) and the Chibitronics editors, but is no longer included in the main PXT code base.

PXT follows nominal typing for classes. This means that if you declare x to be of class type C, and at runtime it happens to be not of this type, then when you try to access fields or methods of x you will get an exception, just as if x was null.

It is also impossible to monkey-patch classes by overriding methods on class instance. As prototype chains are not accessible or even used, it’s also not possible to monkey-patch these.

Finally, classes are currently not extensible with arbitrary fields. We might lift this in future.

Object.keys(x) is not yet supported when x is dynamically a class type. It is supported when x was created with an object literal (eg., {} or { a: 1, b: "foo" }). The order in which properties are returned is order of insertion with no special regard for keys that looks like integer (JavaScript has really counter-intuitive behavior here). When we support Object.keys() on class types, the order will be the static order of field definition.

Execution environments

PXT programs are executed in at least three different environments:

microcontrollers, with native code compilation (ARM)
browsers
server-side JavaScript engines (node.js, etc)

We refer to the browser execution environment as the “simulator” (of the microcontroller), even though for some targets it’s the only environment.

The node.js execution is currently only used for automated testing, but one can easily imagine a programming experience for scripts running on headless devices, either locally or in the cloud.

In case of microcontrollers, PXT programs are compiled in the browser to ARM Thumb assembly, and then to machine code, resulting in a file which is then deployed to the microcontroller, usually via USB mass-storage interface.

For browsers and node.js, PXT programs are compiled to continuation-passing style JavaScript. This utilizes the TypeScript abstract syntax tree as input, but does not use TypeScript JavaScript emitter. On the plus side, this allows for handling of async calls, even if the browser doesn’t support yield statement, as well as cross-browser and remote debugging. On the other hand, the generated code is not really human readable.

Numbers are either tagged 31-bit signed integers, or if they do not fit boxed doubles. Special constants like false, null and undefined are given special values and can be distinguished. We’re aiming at full JavaScript compatibility here.

Static compilation vs a dynamic VM

PXT programs are compiled to native code. The only currently supported native target is ARM Thumb. PXT used to support two different AVR ports, but these have been removed together with the legacy compilation strategy.

Compared to a typical dynamic JavaScript engine, PXT compiles code statically, giving rise to significant time and space performance improvements:

user programs are compiled directly to machine code, and are never in any byte-code form that needs to be interpreted; this results in much faster execution than a typical JS interpreter
there is no RAM overhead for user-code - all code sits in flash; in a dynamic VM there are usually some data-structures representing code
due to lack of boxing for small integers and static class layout the memory consumption for objects is around half the one you get in a dynamic VM (not counting the user-code structures mentioned above)
while there is some runtime support code in PXT, it’s typically around 100KB smaller than a dynamic VM, bringing down flash consumption and leaving more space for user code

The execution time, RAM and flash consumption of PXT code is as a rule of thumb 2x of compiled C code, making it competitive to write drivers and other user-space libraries.

Interfacing C++ from PXT is easier than interfacing typical dynamic VMs, in particular for simple functions with numbers on input and output - there is no need for unboxing, checking types, or memory management.

The main disadvantage of using static compilation is lack of dynamic features in the language (think eval), as explained above.

While it is possible to run a dynamic VM even on an nRF51-class device (256KB of flash, 16KB of RAM), it leaves little space for innovative features on the software side, or more complex user programs and user-space (not C++) drivers.

Smaller int types

As noted above, when performing computations numbers are treated as doubles. However, when you store numbers in global variables or (soon) record fields you can choose to use a smaller int type to save memory. Microcontrollers typically have very little memory left, so these few bytes saved here and there (especially in commonly used packages) do add up.

The supported types are:

uint8 with range 0 to 255
uint16 with range 0 to 65536
int8 with range -128 to 127
int16 with range -32768 to 32767
int32 with range -2147483648 to 2147483647
uint32 with range 0 to 4294967295

If you attempt to store a number exceeding the range of the small int type, only the lowest 8 or 16 bits will be stored. There is no clamping nor overflow exceptions.

If you just use number type (or specify no type at all) in tagged strategy, then if the number fits in signed 31 bits, 4 bytes of memory will be used. Otherwise, the 4 bytes will point to a heap-allocated double (all together, with memory allocator overhead, around 20 bytes).

In legacy strategy, number is equivalent to int32, and there is no uint32.

Limitations

arrays of int types are currently not supported; you can use a Buffer instead
locals and parameters of int types are not supported

Near future work

There are following differences currently, which should be fixed soon. They are mostly missing bridges between static, nominally typed classes, and dynamic maps.

default parameters are resolved at call site; they should be resolved in the called method so eg. virtual methods can have different defaults
x.foo, where x is class and foo is method cannot be currently used as a value; we could make it equivalent to JavaScript’s x.foo.bind(x)
Object.keys() is currently not implemented for classes; when it will be the order of fields will be static declaration order
how to validate types of C++ classes (Pin mostly)?

Python

Supported language features

The following language features should be fully supported and work according to the Python 3 language specification.

lists
dictionaries
function definitions
function calling
method calling
calling into any MakeCode library code
literals: strings, numbers, boolean, None
many list methods: pop, clear, index, count, len
many string methods: casefold, capitalize, center, count, endswith, find, index, isalnum, isalph, isascii, isdigit, isnumeric, isspace, isdecimal, isidentifier, islower, isprintable, istitle, issupper, join, ljust, lower, lstrip, replace, rfind, rindex, rjust, rsplit, rstrip, split, splitlines, startswith, strip, swapcase, title, upper, zfill, ord
many math functions: int, min, max, abs, randint
while loop
for-in loop with range(), array or string literal
break, continue
conditional statements (if, elif, else)
pass statement
variables*
if expression / ternary operator
comparison (in, notin)
byte literal
type annotations using “:” syntax
slice notation**
lambda

*: variable semantics have slightly different scoping rules than Python 3 and global & nonlocal keywords are unsupported. **: some slice notation is not yet supported

Not supported language features

The following language features are not yet supported.

with
assert
classes
__constructor
super()
global & nonlocal
raise
try
generators
attributes
import, import from
sets
list comprehensions
set comprehensions
dictionary comprehensions
await
yield, yield from
format strings
arrays
all list, string, math methods not listed above
*args / varargs

Edit this page on GitHub
Edit template of this page on GitHub