NAME
ngslang - Next Generation Shell language reference.
What is NGS?
NGS is an alternative shell. At its core is a domain-specific language that was specifically designed to be a shell language.
NGS is under development. The language part is already good enough to write some useful scripts. The CLI still doesn’t exist and will be written using the same language.
Running NGS scripts
ngs script_name.ngs
You can put the following line as first line of your script:
#!/usr/bin/env ngs
If you do that, you can run the script as ./script_name.ngs
or /full/path/to/script_name.ngs
(you must make your script executable, by running chmod 755 script_name.ngs
).
See more about running NGS in NGS(1).
Terminology
- type - Built-in or user-defined data type, similar to Python, Ruby and other languages.
- object - Instance of a type, similar to Python, Ruby and other languages. The phrase "MyType object" refers to an Instance of "MyType".
- field - A named slot of an object, similar to field in Python, Java, etc.
- attributes - Slot for auxiliary data on most types of objects (the ones that are references, i.e. not Int,Bool,Null), typically a
Hash
ornull
. Attributes are not fields and are accessed differently (usingattrs
method). - method - Built-in or user-defined function. User defined methods can be closures. Methods are also called functions in several places in the documentation.
- multimethod - A MultiMethod object containing ordered list of methods. When called, the appropriate method is selected from the list to perform the computation.
Language principles
This section is about principles behind NGS language design.
Systems engineers’ language
NGS is a domain-specific language. It is aimed to solve common system tasks in a convenient manner. Some examples:
fetch('myfile.json')
will decode the JSON and return the data structure. (Useread()
to get raw contents).- The
``my_command``
will decode the command output (JSON for example) and return the data structure. Note that``aws ...``
will be parsed even further (not just JSON) to return more usable data structures. my_array.my_prop
returns an array of.my_prop
properties of each element.
Uniformity
NGS tries to be uniform wherever possible to minimize surprises. The idea here is to match between expected and actual behaviour of given method plus parameters.
Power
As a rule, trade-offs between power and not allowing to shoot yourself in the foot are resolved in favor of the power solution. The language is aimed at experienced engineers which use their own judgement. The language should be powerful enough to shoot all feet in the building at once.
Simple methods naming for less guess work
For example, the multimethod +
:
1 + 2
adds the numbersarr1 + arr2
adds (concatenates) arrayshash1 + hash2
produces merged hash.
Extensibility
fetch('your_file.super-format')
can be extended to decode your format.read
, which reads from a file, can be extended to support HTTP or S3.- Most of the syntax (for example
my_var.my_field
ormy_var[my_index]
) is just sugar for calling methods. This behaviour lets you, the user, define any operator for existing or your custom types.
Simplicity
Very small number of core concepts in the language:
- Types with a simple type system, geared only toward multiple dispatch. No classes.
- Multimethods, which allow using same method name for operations on different types, as in the
+
example above. - Closures
- Exceptions
Familiarity
Many concepts and syntax constructs come from other languages.
Naming convention
var_name
CONST_NAME
- Currently NGS does not support constants, it’s only a naming conventionmethod_name
(multimethod name)TypeName
TransformationName
- example:Strs
(converts to array of strings),Argv
(constructs command line arguments array),ExitCode
(converts anything to integer exit code).
Reasoning behind TransformationName
:
- Transforms data into something else, like many other constructors
- The output data might get it’s own data type some day
The two syntaxes
NGS has two syntaxes: commands syntax and code syntax.
Motivation: I can not imagine any syntax for a "real" programming language that I would like which is built on top of the commands syntax, which is similar to bash. That’s why NGS has code syntax for "real" programming and commands syntax for running external programs and i/o redirection, where this syntax is very convenient.
Commands syntax
This is the resembles-bash syntax geared towards running external programs and i/o redirection. Command syntax is the syntax at the top level of every NGS script. The most simple NGS scripts might look very similar to bash scripts.
Commands are separated by either newlines or by semicolon (;
).
cat a.txt; touch myfile echo mystr >myfile ls | wc -l
Code syntax
Code syntax resembles other high-level languages such as Python or Ruby.
Example:
1 + 2 * 3; %[abc def ghi].without('ghi').each(echo)
Expressions are separated by either newlines or by semicolon (;
). It is also permitted to have trailing semicolon in the end of a line:
my_func(); my_other_func()
code syntax is the syntax of -e, -E, -p, -pi, -pl, -pj, -pjl, and -pt switches to ngs
interpreter. Example:
$ ngs -p '(1...10).filter(F(num) num % 2 == 0)' # Output: [2,4,6,8,10]
Switching between commands and code syntaxes
In commands syntax it is possible to switch to code syntax in one of the following ways:
ls { code syntax here } ls ${ code that computes the file name and returns a string, spaces don't matter, expanded into single argument of ls } # Expands to zero or more positional arguments to ls ls $*{ code that computes the files names and returns array of strings. Spaces don't matter, each element is expanded into single argument of ls }
In code syntax it is possible to switch to commands syntax in one of the following ways:
# Capture output out = `commands syntax` my_var = "mystring\n" + `my other command` # Capture output and decode it parsed_data_structure = ``commands syntax`` n = ``curl https://example.com/myservice/stats``.number_of_items_in_storage # Get reference to a process my_process = $( commands syntax )
Comments
Comments are allowed in both commands and code syntax. At the beginning of a line, a comment starts with a hash sign (#
). A comment that starts in a middle of the line starts with space and a hash sign (#
) and continues to end of the line.
# comment till end of line run_external_program --input my_file # COMMENTS HERE ARE NOT IMPLEMENTED YET { a = 1 # comment till end of line }
Also, lines that start with "TEST" (note the trailing space) are considered to be comments. They are ignored. The purpose is to allow an external testing script to extract and run these tests.
Variables
As in other languages, variables are named locations that can store values. Variables’ names should consist of ASCII letters (a-z, A-Z) and numbers (0-9). Variable name must start with a letter. Assignment to variables looks the same in both commands and code syntax. Referencing a variable in the code syntax is just the variable’s name while referencing it in commands syntax or inside string interpolation is $my_var
(not recommended) or ${my_var}
(recommended).
Advanced topic: more precisely, the naming restrictions for the variables mentioned above are naming restrictions on identifiers. NGS can have variables that are named not by the rules above.
Note that it is not recommended to use names which are not valid identifiers. As an exception to this rule, it’s OK for methods which correlate with NGS syntax to have names which are not valid identifiers. Example of such methods are binary operators (+
, -
, etc.). See more about methods’ naming in "Methods and multimethods" section below.
Assigning to a variable works in both commands and code syntax.
# Assigning value to a variable. # Note that syntax to the right of = is code syntax even if the surrouning syntax is commands syntax. a = 1 + 2 # Referencing a variable in code syntax. # Not that arguments to method call are in code syntax even if the surrouning syntax is commands syntax. echo(a) # Output: # 3 my_file = '1.txt' # Referencing a variable inside commands syntax. $(ls $my_file) $(ls ${my_file}) # Referencing a variable inside string interpolation. echo("A is ${a}") # Output: # A is 3 # In-place assignment ( -=,+=,*=,/=,%= ) a += 100 # The above is exactly the same as a = a + 100
Referencing undefined variable will cause GlobalNotFound
or UndefinedLocalVar
exceptions.
Checking whether a variable is defined
echo(defined a) a = 100 echo(defined a) # Output: # false # true
It’s a rare circumstance that one needs to use defined
. Please try to avoid such situations.
Variables’ scoping rules
Variables scoping is similar to Python’s.
Variables scope types:
local
- local to a methodupvar
- local to enclosing methodglobal
- global
In a method, any variable that is not assigned to inside the method is looked up as an upvar
(enclosing methods) and as global
.
a = 1 F f() { echo(a) } f() # Output: 1 a = 1 F f() { a = 2 F g() { echo(a) } g() } f() # Output: 2
In a method, any variable that is mentioned in any of the enclosing methods is automatically upvar
- references the variable in the outer scope.
a = 1 F f() { a = 2 F g() { a = 10 } g() echo(a) } f() echo(a) # Output: 10 # Output: 1
In a method, any variable that is assigned to (including the i
in construct for(i;10) ...
) in the method is automatically local
unless it’s an upvar
as described above. Inner methods are also local
by default.
a = 1 F f() { a = 2 echo(a) F inner_func() "something" } f() echo(a) # Output: 2 # Output: 1 # Can not call inner_func from outside f()
Modifying variables’ scoping
You can modify default scoping using the global
and local
keywords.
a = 1 F f() { a = 2 F g() { # local instead of upvar local a a = 3 } g() echo(a) } f() # Output: 2 a = 1 F f() { a = 2 F g() { # global instead of upvar global a a = 3 } g() echo(a) } f() # Does not work yet due to a bug, "a" stays upvar # Output: 3
Planned features for variables
Destructuring assignment. Something like the following:
[a, b, *rest] = my_arr
Syntax for Hash
destructuring is not clear yet:
... = my_hash
Destructuring should also be available in for
:
# Hash iterator gives Arr of two elements each iteration. # Destructuring would place the two elements in k and v varaibles. for [k, v] in my_hash { ... }
Methods and multimethods
Methods’ names are composed of letters (a-z, A-Z), digits (0-9) and the following characters: _
, -
, |
, =
, !
, @
, ?
, <
, >
, ~
, +
, *
, /
, %
, (
, )
, $
, .
, ``
, "
, :
, (space), [
, ]
.
Multi-dispatch
Each value in NGS has a type, similar to many other languages. One of the main features of NGS is choosing the correct method of a multimethod, based on types of the arguments. This feature is called multi-dispatch.
Let’s start with the following example:
F +(a:Int, b:Int) { ... } F +(s1:Str, s2:Str) { ... } { 1 + 1 # 2 'a' + 'b' # 'ab' }
The +
is a multimethod. It has a few methods. You can see definitions of two of the methods in the example above. One method adds numbers. Another method concatenates strings. How NGS knows which one of them to run? The decision is made based on arguments’ types. NGS scans the methods list backwards and invokes the method that matches the given arguments (this matching process is called multiple dispatch).
When a MultiMethod
is called, the multimethod’s methods list is searched from last element to first element. The method which parameters’ types match is executed. This model is different from many other languages, where the most specific method is called. In NGS, if two methods’ parameters both match the arguments, that one that was defined last is called. Typically, methods for more specific types are defined later in code so the behaviour is similar to other languages. If matched method is executed and a guard
(see below) fails, the search is continued as if the method did not match.
Consider the following types:
{ type Vehicle type Car(Vehicle) }
Typical MultiMethod definition:
F park(v:Vehicle) { ... } F park(c:Car) { ... } park(Vehicle()) # calls the first method park(Car()) # calls the second method
Leveraging the NGS MultiMethod call behaviour: one can for example do debugging by adding the third method to the park
MultiMethod:
F park(v:Vehicle) { debug("Parking vehicle ${v}") super(v) # Execute one of the two methods defined above }
Calling a method - simple syntax
This syntax is available in both commands and code syntax.
echo('Hello world') # Output: # Hello world
Calling a method - object-style syntax
This syntax is available in code syntax only.
As a syntactic sugar, method call f(a, b, c)
can be written as a.f(b, c)
.
{ 'Hello world'.echo() } # Output: # Hello world
Methods for operators
In NGS, binary operators are just syntactic sugar for calling methods. The variety of permitted characters in methods’ names allow using same method name as the operator.
# Define the + operator for Fun objects (Fun objects are callable objects: methods and types) F +(f:Fun, g:Fun) { F composed_function(*args) { f(g(*args)) } } ... F reject(something:Eachable, predicate) { p = Pred(predicate) something.filter(not + p) # Usage }
Current methods with special names are listed below.
# Output wrapped manually for your convenience. # All of them correspond to a syntax, mostly binary operators. $ ngs -p 'globals().filterv(Fun).keys().filter(/[^a-zA-Z0-9_]/).join(", ")' ==, ., [], +, *, /, -, <, <=, >, >=, .=, []=, %, ===, ::, is not, ::=, ?, \, +?, !=, ~, not in, .., ..., !==, ~~, "$*", $(), ``, ````, %(), //
Binary operators (methods) and precedence
Higher numbers mean higher precedence. Note that short-circuit operators can not and are not implemented as method calls - that would require eager evaluation of the arguments.
tor 40 "Try ... or", short-circuit questionable_code tor default_value tand 50 "Try ... and", short-circuit (not sure when it's needed, don't use it) or 60 Logical or, short-circuit and 65 Logical and, short-circuit in 70 Value-in-container check 1 in [1, 2, 3] "a" in {"a": 1} not in 70 Value-not-in-container check 10 not in [1, 2, 3] "b" not in {"a": 1} is 90 Instance-of check 1 is Int is not 90 Not-instance-of check 1 is not Str | 120 "Pipe", currenty not used === 130 "Same as" v = [1, 2]; v === v !== 130 "Not same as", [1, 2] !== [1, 2] == 130 "Equals", [1, 2] == [1, 2] != 130 "Not equals" [1, 3] != [1, 2] <= 150 "Less than or equals" < 150 "Less than" >= 150 "Greater or equals" > 150 "Greater" ~ 150 "Match" "a1b2c" ~ /[0-9]/ ~~ 150 "Match all" "a1b2c" ~~ /[0-9]/ ... 160 "Inclusive range" 0...5 # 0,1,2,3,4,5 .. 160 "Exclusive range" 0..5 # 0,1,2,3,4 + 190 "Plus" +? 190 "Plus maybe" "a" + "b" # "ab" null + "b" # null "a" + null # null - 190 "Minus" * 200 "Multiply" or "repeat" 3 * 5 # 15 "ab" * 3 # "ababab" EmptyBox * 2 # two values of EmptyBox type % 200 "Modulus" or "each" 3 % 2 # 1 ['a', 'b'] % echo # Outputs a and b on different lines / 200 "Divide" or "map" 10 / 5 [1, 2, 3] / F(x) x * 2 ? 200 "Filter" [1, 2, 3] ? F(x) x > 1 \ 200 "Call" [1, 2, 3] \ echo
Defining a method
Method definition works in both commands and code syntax.
When defining a named method, NGS automatically creates a MultiMethod
with the given name (if it does not exist) and appends the new method to the multimethod’s list of methods.
F my_method() { echo("wassup?") echo("my method is running") }
Method with optional parameters
F mysum(a:Int, b:Int=100) a+b echo(mysum(5)) # Output: 105 echo(mysum(5, 200)) # Output: 205
Method with "rest" parameter
F print_with_prefix(prefix:Str, *strings) { strings.each(F(s) { echo("$prefix$s") }) echo("Printed ${strings.len()} lines with prefix") } print_with_prefix('-> ', 'abc', 'def') # Output: # -> abc # -> def # Printed 2 lines with prefix
Method with "rest keywords" parameter
F print_properties(separator:Str, **kw_args) { kw_args.each(F(name, value) { echo("$name$separator$value") }) } print_properties(' => ', a=10, b=20) # Output: # a => 10 # b => 20
Method guards
Method guards restrict method execution to specific conditions, typically the conditions are functions of parameters’ values. If the guard condition evaluates to false
, NGS considers the method as if it did not match the arguments and continues to search for other methods to run in the multimethod’s methods list.
F gg(i:Int) { echo("First gg active") echo(i*10) } # gg is now MultiMethod with one method gg(1) gg(5) # Output: # First gg active # 10 # First gg active # 50 F gg(i:Int) { echo("Second gg checking guard") guard i > 3 echo("Second gg active") echo(i*100) } # gg is now MultiMethod with two methods gg(1) gg(5) # Output: # Second gg checking guard # First gg active # 10 # Second gg checking guard # Second gg active # 500
Calling super methods
Super methods are methods higher in the multimethod’s list of methods.
F sup(x) x+1 F sup(x) super(x) * 10 echo(sup(5)) # Output: 60
Anonymous function (method) literal
Syntactically, anonymous method is F
not followed by a name.
f = F(item) { echo("Item: $item") } echo("F is $f") # Output: F is <UserDefinedMethod <anonymous> at 1.ngs:1> each([10,20,30], f) # Output: # Item: 10 # Item: 20 # Item: 30 # Anonymous method as map() parameter echo([1,2,3].map(F(x) x*5)) # Output: [5,10,15]
Anonymous function (method) literal using magical X, Y or Z variables
Three special variables can be used as a syntactic sugar for creating anonymous methods. A method call which uses one of the special variables X, Y, or Z is wrapped in anonymous function as follows: F(X=null, Y=null, Z=null) { method_call(....) }
.
Operators are just syntactic sugar for method calls so X*5
for example is also processed as described.
# X*5 is same as F(X=null, Y=null, Z=null) { X*5 } which in out case is roughly equivalent to F(X) X*5 echo([1,2,3].map(X*5)) # Output: [5,10,15] # "Key:" + X is same as F(X=null, Y=null, Z=null) { "Key:" + X } echo({"a": "one", "b": "two"}.map("Key:" + X)) # Output: ['Key:a','Key:b'] echo({"a": "one", "b": "two"}.map("Val:" + Y)) # Output: ['Val:one','Val:two'] echo([1,2,3,11,12].count(X>10)) # Output: 2
If the special variables X, Y, or Z are found in a string interpolation, such as "Key $X, Value $Y"
, the string is wrapped in anonymous function as described above.
echo({"a": 1, "b": 2}.map("Key $X, Value $Y")) # Output: ['Key a, Value 1','Key b, Value 2']
Anonymous function (method) literal using magical A, B or C variables
Additional syntactic sugar for anonymous function is { ... }
. This is equivalent to F(A=null, B=null, C=null) { ... }
.
# X*5 + 1 would not work as X*5 itself would be anonymous function echo([1,2,3].map({ A*5 + 1 })) # Output: [6,11,16]
Returning values from methods
Method evaluates to the last expression, as in Lisp and Ruby. Additionally following expressions allow immediate return from a method.
return
- returnsnull
return EXPR
- returns the value of EXPRCOND returns
- if COND evaluates to true, returnsnull
COND returns EXPR
- if COND evaluates to true, returns the value of EXPR
Examples of return
, returns
and last expression value as result of a method.
F flow_ret(x) { if x < 0 { unrelated_calculation = 1 return "negative" } x == 0 returns "zero" "positive" } echo(flow_ret(-1)) echo(flow_ret( 0)) echo(flow_ret( 1)) # Output: # negative # zero # positive
Returning from inner method
Use Return
type to return from inner methods.
F find_the_one(haystack:Arr, needle) { ret_from_find_the_one = Return() echo(ret_from_find_the_one) haystack.each(F(elt) { elt == needle throws ret_from_find_the_one("Found it!") }) "Not found" } echo([10,20].find_the_one(20)) echo([10,20].find_the_one(30)) # Output: # <Return closure=<UserDefinedMethod find_the_one at 1.ngs:2> depth=7 val=null> # Found it! # <Return closure=<UserDefinedMethod find_the_one at 1.ngs:2> depth=7 val=null> # Not found
Referencing specially named methods
Since identifiers are alphanumeric (with underscores), my_method
can be referenced simple by name, while +
for example can not. Here is example of referencing specially named methods:
# (+) references the method + F sum(something:Eachable1) something.reduce(0, (+))
Types
NGS is a dynamically typed language: values (and not variables) have types.
a = 1 a = "ok" # 'a' had value of one type and then value of another type
NGS is a "strongly typed" language: values are not implicitly converted to unrelated types. This makes the language more verbose in some cases but helps catch certain type of bugs earlier.
echo(1+"2") # ... Exception of type MethodNotFound occurred ... # That means that NGS has no method that "knows" how to add an Int and an Str echo(1+Int("2")) # Output: 3
NGS has several pre-defined types. A programmer can define additional types. Types roughly correspond to classes in other languages (Python, Ruby, Java) but without defined fields and methods.
Creating object of given type
Creating object of a given type is calling the type. Syntactically, it’s type name followed by parenthesis which optionally contain arguments.
# Create object of type Path p = Path("/") # Create EmptyBox object eb = EmptyBox()
Defining your own types
Type definition is only supported in code syntax.
# Switch to code syntax inside { ... }. "type" currently does not work in command syntax { # Define type type Vehicle # Define sub-type type Car(Vehicle) type RedThing # Multiple inheritance type RedCar([Car, RedThing]) } echo(Vehicle) echo(Car) echo(RedCar.parents) # Output: # <Type Vehicle> # <Type Car> # [<Type Car>,<Type RedThing>]
Type constructors and object creation
- When type definition (
type T
) is executed, NGS defines the.constructors
MultiMethod that includes one method. This method creates object of typeT
. The type of this method isNormalTypeConstructor
. - After defining a type
T
, one can add methods to.constructors
usingF T(...) {...}
definitions. If you add your method to the.constructors
field, please make sure it returns an object of typeT
or object of a type that is a subtype ofT
. Otherwise, it will be surprising for the callers. - When you call a user defined type, its
.constructors
MultiMethod is called. - If the default constructor added by NGS (method of type
NormalTypeConstructor
) gets executed, it does the following:- Creates object of type
T
- Runs
init()
MultiMethod with the new object as the first argument and the arguments of the call toT(...)
as second and on arguments toinit(...)
. - Note that calling
init()
might triggerMethodNotFound
exception. It is ignored if and only if the call toT()
had no arguments.
- Creates object of type
Examples:
# Customizing Box object creation (code from stdlib) F Box(x) FullBox(x) F Box(n:Null) EmptyBox() F Box(a:Arr, idx) { ... } F Box(h:Hash, k) { ... } # FullBox initialization, called when FullBox(x) is called F init(b:FullBox, val) b.val = val # Note that there is no init(eb:EmptyBox) as there is nothing to initialize Box(1) # FullBox with 1 as value Box(null) # EmptyBox
Loading and running additional code
require
The require
method reads, compiles (to bytecode) and executes the given file. Currently absolute paths and paths relative to current directory are supported. In future, paths relative to current directory will not be supported but rather paths relative to the file that does require()
call will be supported, similar to Node.js. This needs work which was not done yet, there was never intention to support paths relative to current directory; it’s just wrong.
require('/home/someone/my_module.ngs')
require
returns the value of last expression in the file. This feature plays nice with namespaces (see below) in cases when the whole file is an ns { ... }
expression.
Auto-loading
When an undefined global variable is referenced, global_not_found_handler
is called. This handler tries to load a module (currently from the lib/autoload
directory) that will define the referenced variable. Here are some modules that are autoloaded and additional variables/methods they define.
- AWS
- IP (IPAddr, IPNet)
- Iter (ArrIter, ConstIter, HashIter, RangeIter, …)
Namespaces
Namespaces are used to prevent names collisions and hide methods. One of the main usages for namespaces is in modules.
Let’s consider a file named my_ns.ngs
:
ns { # This method is only visible to other methods in the namespace F _private_helper_method() { ... } F accessible_method() { ... _private_helper_method() ... } global global_method F global_method() { ... _private_helper_method() ... } }
Another file, example.ngs
could be using the above namespace utilizing the require
method:
arbitrary_ns_name = require('my_ns.ngs') global_method() # :: is namespace member access operator arbitrary_ns_name::accessible_method()
How namespace definition works
Namespace definition works in both commands and code syntax.
The ns { ... }
definition is equivalent to the following code:
F() { _exports = {} .... _exports }() # Call as soon as the method is defined
Inside the F() { ... }
, all variables and methods which do not start with underscore (_
) are automatically added to the _exports
hash. The underscore exclusion prevents from _exports
itself to be added to the hash. This implementation also allows arbitrary manipulations of the returned value.
# The simple case ngs -pi 'ns { F func() 7; }' Hash of size 1 [func] = <MultiMethod with 1 method(s)> # Prefix all exported methods and variables with "my_" ngs -pi 'ns { F func() 7; _exports .= mapk("my_$X") }' Hash of size 1 [my_func] = <MultiMethod with 1 method(s)> # Make "ns" return arbitrary value $ ngs -pi 'ns {_exports=7}' Int: 7 # Show exclusion of underscore prefixed items $ ngs -pi 'ns { F visible() 7; F _invisible() 7; }' Hash of size 1 [visible] = <MultiMethod with 1 method(s)> # Not only methods but also variables are exported $ ngs -pi 'ns { visible=1; _invisible=2; }' Hash of size 1 [visible] = 1
Different names for external variables in a namespace
Namespaces also support additional syntax ns(PARAMETERS) { ... }
. This syntax is converted to F(PARAMETERS) { ... }()
. This allows referencing external variables by another names. Inside the namespace one can now define variables and methods which would shadow the global (or upvar) variables and still access them.
Here is example from lib/autoload/AWS.ngs
:
ns(GlobalRes=Res, GlobalResDef=ResDef, global_test=test) { ... doc AWS resource type Res(GlobalRes) ... doc AWS Resource definition type ResDef(GlobalResDef) ... }
Note that when ns(PARAMETERS) { ... }
syntax is converted to F(PARAMETERS) { ... }
, the method is called without any arguments so all parameters must have default values or an exception will occur:
$ ngs -pi 'ns(a) { 7 }' ... Exception of type ArgsMismatch occurred
Planned future changes to namespaces
The ns { ... }
construct will probably not return a Hash
object but object of a new type, Namespace
which will probably be subtype of Hash
. In any case, the ::
operator will be defined in such way that my_namspace::my_method()
will continue to work.
Frequently used types and methods
This section is not a reference but rather an overview. It includes the most important types and methods. For full documentation regarding types and methods see the generated documentation.
Null
null
signifies absence of data. null
is the only value of type Null
.
echo("abcd".pos("c")) echo("abcd".pos("x")) # Output: # 2 # null if null echo("if null") # No output, null converts to false
Bool
The type Bool
represents a boolean. The values true
and false
are the only values of type Bool
.
if true echo("if true") if false echo("if false") # Output: # if true echo(1 == 1) # Output: # true echo(1 == 2) # Output: # false echo(true and false) # Output: # false echo(true or false) # Output: # true
When a boolean value is needed, such as in if EXPR {...}
, the EXPR
is converted to boolean by calling Bool(EXPR)
.
NGS defines Bool
multimethod (technically Bool
type constructors), with methods for many types. These methods cause the following values to be converted to false:
- 0
- null
- false
- empty array
- empty hash
- empty string
- an EmptyBox object, for example
[1,2,3].Box(10)
(which boxes the element at index 10) - a Failure object, for example
Result({ 1 / 0 })
- regular expression that did not match, for example
"abc" ~ /XYZ/
Unless specified otherwise, all other values are converted to true
.
To define how your user-defined types behave as boolean, define Bool(x:YOUR_TYPE)
method.
Int
Int
represents integer numbers
echo(1 + 2 * 3) # Output: # 7 { 3.each(echo) } # Output: # 0 # 1 # 2 { 3.map(X*2).each(echo) } # Output: # 0 # 2 # 4
Str
Str
represents string of bytes.
Strings - string interpolation
a = 1 echo("A is now $a") echo('A is now $a') # Output: # A is now 1 # A is now $a echo("Calculation result A: ${10+20}") echo("Calculation result B: ${ [1,2,3].map((*), 10).join(',') }") # Output: # Calculation result A: 30 # Calculation result B: 10,20,30
Strings - some basic methods that operate on strings
echo("abc" + "abc") # Output: # abcabc echo("abc:def:ggg".split(":")) # Output: # ['abc','def','ggg'] echo("abc7def8ggg".split(/[0-9]/)) # Output: # ['abc','def','ggg'] echo("abc7def8ggg".without(/[0-9]/)) # Output: # abcdefggg if "abc" ~ /^a/ { echo("YES") } else { echo("NO") } # Output: # YES m = "abc=120" ~ /=/ if m { echo("Name=${m.before} Value=${m.after}") } # See RegExp type # Output: # Name=abc Value=120
Arr
Arr
represents an ordered list which can be accessed by zero-based index. Currently Arr
is implemented as consecutive memory locations.
Arrays - basics
x = ["first", "second", "third", "fourth"] echo(x) # Output: # ['first','second','third','fourth'] echo(x.len()) # Output: # 4 echo('first' in x) # Output: # true echo('fifth' in x) # Output: # false echo(x[1]) # Output: # second echo(x[1..3]) # Output: # ['second','third'] echo(x == %[first second third fourth]) # Output: # true x = [ "blah" 2 'some text' ] echo(x) # Output: # ['blah',2,'some text'] echo(x[10]) # ... Exception of type IndexNotFound occurred ...
Arrays - some basic methods that operate on arrays
{ [1,2].each(echo) } # Output: # 1 # 2 { [1,2].each(F(item) { echo("Item: $item") }) } # Output: # Item: 1 # Item: 2 echo([1,2].map(X * 10)) # See "Anonymous function literal using magical X, Y or Z variables" # Output: # [10,20] echo(Hash([["a",1],["b",2]])) # Converts array consisting of pairs into Hash # Output: # {a=1, b=2}
Hash
Hash
represents a mapping between keys and values. Hash
is implemented as a hash table.
Hashes - basics
x = {"a": 1, "b": 2} echo(x) # Output: # {a=1, b=2} echo(x['a']) echo(x.a) # Output: # 1 # 1 { x.b = 20 } echo(x.len()) # Output: # 2 echo(x.keys()) # Output: # ['a','b'] echo(x.values()) # Output: # [1,20] x = { "c": 1 "d": 2 } echo(x) # Output: # {c=1, d=2} echo('c' in x) # Output: # true echo(1 in x) # Output: # false echo(x.get('d')) # Output: # 2 echo(x.get('e')) # Output: # null echo(x.get('e', 'my_default')) # Output: # my_default echo(x.e) # ... Exception of type KeyNotFound occurred ... x = %{akey avalue bkey bvalue} echo(x) # Output: # {akey=avalue, bkey=bvalue}
Hashes - some basic methods that operate on Hashes
{ h = {"a": 1, "b": 2} h.each(F(k, v) { echo("$k = $v") }) } # Output: # a = 1 # b = 2 h = {"a": 1, "b": 2} echo(h.map(F(k, v) "key-$k value-$v")) # Output: # ['key-a value-1','key-b value-2'] h = {"a": 1, "b": 2} echo(h.mapk("key-${X}")) # Output: # {key-a=1, key-b=2} h = {"a": 1, "b": 2} echo(h.mapv(X*10)) # Output: # {a=10, b=20} h = {"a": 1, "b": 2} echo(h.mapkv({ ["key-$A", B*100] })) # Output: # {key-a=100, key-b=200} h = {"a": 1, "b": 2} echo(Arr(h)) # Converts Hash into array consisting of pairs # Output: # [['a',1],['b',2]]
Higher-order functions (methods)
This section is an overview. The purpose is to give overall impression of how higher-order functions feel in NGS. For full documentation regarding methods (including higher-order functions) see the generated documentation.
Higher order functions are functions that get another function as a parameter.
echo([1,2,3].all(Int)) # Output: # true echo([1,2,3,"a","b"].all(Int)) # Output: # false echo([1,2,3,"a","b"].any(Str)) # Output: # true echo([1,2,11,12,3].none(X>100)) # Output: # true echo([1,2,"a","b",3].filter(Int)) # Output: # [1,2,3] echo([1,2,"a","b",3].reject(Int)) # Output: # ['a','b'] echo([1,2,11,12,3].reject(X>10)) # Output: # [1,2,3] echo([1,2,11,12,3].count(X>10)) # Output: # 2
Exceptions
Similar to other languages (such as Python, Ruby, Java, etc), NGS has exceptions which can be thrown and caught.
{ type MyError(Error) try { # "e1 throws e2" is same as "if e1 { throw(e2) }" 1 == 2 throws Error("This can't be!") throw MyError("As usual, very helpful message") } catch(e:MyError) { echo("[Exceptions] This error was expected: $e") } catch(e:Error) { echo("[Exceptions] Unexpected error: $e") throw e } # Output: [Exceptions] This error was expected: ... }
The catch
clauses are implemented using multimethod. If an exception occurs, the catch
multimethod is called with a single argument, the exception. Unlike all other situations, the search of a method that matches the exception is done from the beginning to the end of methods list.
Implementation of catch
using multimethod prevents using return
inside catch
clauses in the expected way. That’s why this implementation might change in the future.
Ignoring exceptions using "try" without "catch"
myhash = {"a": 1, "b": 2} v = try myhash.a echo(v) # Output: 1 v = try myhash.c echo(v) # Output: null v = try { unrelated = 1+2; myhash.c } echo(v) # Output: null
Default value for expression that causes exception
The short-circuit binary operator tor
(try-or) allows specifying default value for an expression that might throw an exception.
# .backtrace might not exist, .frames might not exist # Accessing non-existent fields using the . notation will cause KeyNotFound exception parent_frames = parent.backtrace.frames tor []
EXPR tor null
is functionally equivalent to try EXPR
.
Alternative mechanism for handling exceptions
See generated documentation for the Result
type.
Flow control
If
If works in both commands and code syntax.
if my_var > 10 { a += 100 b = "x" } else { b = "y" } result = if my_var then "xyz" else "ww". result = if my_var 10 20 # result is now either 10 or 20
In if
, while
, and for
, where {...}
code block is expected, if the block consists of only a single expression, curly braces are optional. In if
, the then
and else
keywords are optional.
Note that if
is an expression. if
without else where the condition is equivalent to false
, evaluates to null
.
Loops
Loops work in both commands and code syntax.
for(i=0; i<5; i+=1) { if i == 3 { continue } echo("Regular loop, iteration $i") } # Output: # Regular loop, iteration 0 # Regular loop, iteration 1 # Regular loop, iteration 2 # Regular loop, iteration 4 for(i;5) { i == 3 continues echo("Shorthand loop, iteration $i") } # Output: # Shorthand loop, iteration 0 # Shorthand loop, iteration 1 # Shorthand loop, iteration 2 # Shorthand loop, iteration 4 for i in [1,5,10,20,50] { echo(i) } # Output: # 1 # 5 # 10 # 20 # 50 # See "Iterators" i = 0 while i<10 { echo("While loop, iteration $i") i += 1 # Same as "if i == 2 { break }" i == 2 breaks } # Output: # While loop, iteration 0 # While loop, iteration 1
Switch and switch-like expressions
Switch and switch-like expressions work in both commands and code syntax.
- The first match activates the related code, there is no fall-through.
- The value resulting from executing the code is the result of the switch-like expression.
- If there is no match,
switch
,match
andcond
returnnull
. - If there is no match,
eswitch
,ematch
andecond
throwSwitchFail
exception.
Switch and switch-like expressions examples.
a = 10 result = switch a { 10 "ten" 20 "twenty" 30 { more_code(); "thirty" } } echo("Switch result for $a is $result") # Output: Switch result for 10 is ten a = "my_string" result = match a { Int "an integer" /^my_/ "special string" Str { more_code(); "a string" } Any "not sure" } echo("Match result for $a is $result") # Output: Match result for my_string is special string a = 12 result = cond { a > 10 "Excellent" a > 5 { more_code(); "Good enough" } a > 3 "so so" } echo("Cond result for $a is $result") # Output: Cond result for 12 is Excellent # SwitchFail exception will be thrown F will_throw_exception1() { a = "bad value" result = eswitch a { 1 "one" 2 "two" } } # SwitchFail exception will be thrown F will_throw_exception2() { a = true result = ematch a { Int "an integer" Str "a string" } } # SwitchFail exception will be thrown F will_throw_exception3() { a = 10 result = econd { a > 15 "one" a > 20 "two" } }
Regular expressions
myregex = /^begin/ echo(myregex) # Output: <RegExp> mymatch = "beginABC" ~ myregex echo(mymatch) # Output: <MatchY matches=['begin'] named={} positions=[[0,5]] whole=begin before= after=ABC> echo(mymatch.matches[0]) # Output: begin echo(mymatch.after) # Output: ABC all_matches = "1a2bcd3efg" ~~ /([0-9])(.)/ each(all_matches, F(match) { echo("The character after the digit ${match.matches[1]} is ${match.matches[2]}") }) # Output: # The character after the digit 1 is a # The character after the digit 2 is b # The character after the digit 3 is e
Collector facility
Collector facility assists in building lists, hashes and other data structures. Collector facility should be used where map
and other functional facilities are not meeting the requirements or when using collector is cleaner. Typically, the code of the following forms is replaced by collector (or some mix or something similar to the following forms):
# --- 1 --- my_arr = [] for ... { my_arr.push(my_something_one) if ... { my_arr.push(my_something_two) } } # --- 2 --- my_arr = [] my_arr.push(my_something_one) for ... { if ... { my_arr.push(my_something_two) } } my_arr.push(my_something_three)
The collector
keyword wraps the following expression (or expressions if they are grouped using {...}
syntax) in a function with single argument, collect
.
collector
keyword can be followed by slash (/
) and initial expression, which defaults to an empty array.
mylist = collector { collect("HEADER") [10,20].each(collect) collect("FOOTER") } echo(mylist) # Output: ['HEADER',10,20,'FOOTER'] myhash = collector/{} { collect("first", -1) {"a": 1, "b": 2, "c":100}.each(F(k, v) { if v < 100 { collect("($k)", v*10) } }) collect("last", -2) } echo(myhash) # Output: {first=-1, (a)=10, (b)=20, last=-2} mysumm = collector/0 [1,10,100].each(collect) echo(mysumm) # Output: 111 # Code from stdlib: F flatten(arr:Arr) collector arr.each(F(subarr) { subarr.each(collect) })
Customizing collector facility
collector
keyword is implemented as a call to collector
method with two arguments: the initial value (which defaults to empty array) and the wrapped code, which is passed as a method.
By defining collector
method, the programmer can customize the behaviour. Here is implementation of collector for Arr
.
# Code from stdlib: F collector(a:Arr, body:Fun) { body(F(elt) a.push(elt)) a }
Here a
gets the initial value, body
is the wrapped code that comes after the collector
keyword, the anonymous function that is passed to body
becomes the collect
function inside the wrapped code.
Here is collector
for Hash
and it’s usage example:
# Code from stdlib: F collector(h:Hash, body:Fun) { body(F(k, v) h[k] = v) h } F mapk(h:Hash, mapper:Fun) collector/{} h.each(F(k, v) collect(mapper(k), v))
Running external programs
t = `echo -n text1` echo("[ $t ]") # Output: [ text1 ] seq = `seq 5`.lines() echo(seq) # Output: ['1','2','3','4','5'] proc = $(seq 3) each(inspect(proc), echo) # Output: # CommandsPipeline # command[0]: <Command options={} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=4 write_fd=5>>] argv=[seq,3]> # process[0]: Process # process[0]: command = <Command options={} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=4 write_fd=5>>] argv=[seq,3]> # process[0]: pid = 43582 # process[0]: exit_code = 0 # process[0]: exit_signal = 0 # process[0]: output on fd 1, stdout (3 lines): # process[0]: 1 # process[0]: 2 # process[0]: 3 # process[0]: output on fd 2, stderr (0 lines): data = ``echo '{"a": 1}'`` echo("Parsed data: $data, a is ${data.a}") # Output: Parsed data: {a=1}, a is 1
Assignment shortcuts
These are syntactically equivalent expressions:
a = a + 1 a += 1 a = a - 1 a -= 1 a = a * 1 a *= 1 a = a / 1 a /= 1 a = a % 1 a %= 1 a = a.f() a .= f() a = f(a) a = a.f(b) a .= f(b) a = f(a, b)
Language gotchas
This section will be expanded as I get feedback :)
NGS should not be your first language
I do not recommend NGS as your first language. Python, for example, would be a much better choice. Programming experience is assumed prior to using NGS.
Watch the version
NGS is under development. Breaking changes can happen. If you do anything important with NGS, it’s preferable to note the git revision you are using for reproducible installation. The plan is to stop breaking NGS when it reaches version 1.0.0. From that version, the behaviour will be common - patch level increase for fixes, minor for improvements, major for breaking changes.
Keyword arguments gotchas
Keyword arguments implementation is preliminary so:
# Keyword arguments are silently ignored if corresponding positional argument is passed. kwargs = {"a": 10} F f(a, **kw) a; f(1, **kwargs) == 1 # Keyword arguments for existing named parameters cause parameters not to match kwargs = {"a": 10} F f(a) a; f(1, **kwargs) == 1 # **kwargs silently override (consistent with literal hash) previous named arguments kwargs = {"a": 10} F f(a=1) a; f(a=10, **kwargs) == 1 # For speed and implementation simplicity reasons, the **kw parameter has all keys, # even if some of them matched and used for parameters. # This is somewhat likely to change in the future. kwargs = {"a": 10} F f(a=1, **kw) [a, kw]; f(**kwargs) == [10, {"a": 10}]
Namespaces gotchas
Since by default all variables are local, the following example will not add method to some_global_name
multimethod but will create namespace-local some_global_name
multimethod.
myns = ns { F some_global_name() ... } # Only available as myns::some_global_name(...) # Global some_global_name(...) will not be able to access the # above implementation.
The correct version to add method to a global multimethod is
ns { global some_global_name F some_global_name() ... } # Available as global some_global_name(...)
Comments syntax gotchas
Comments syntax is implemented in many places but not everywhere. If you get syntax error regarding comment, move it to somewhere nearby.
Mutable default parameter gotchas
The code below will probably not do what was intended. Note that the default parameter value is only computed once, at method definition time.
F f(x, a:Arr=[]) { a.push(x) a } echo(f(10)) # [10] echo(f(20)) # [10,20]
Same happens in Python and hence already described: http://docs.python-guide.org/en/latest/writing/gotchas/#mutable-default-arguments
Handlers
Handlers are called by NGS when a certain condition occurs.
A handler is a MultiMethod
. Like with any other multimethod, you can override what it does by defining your own method with the same name further down in the code. Since standard handlers are defined in stdlib.ngs which is typically loaded first, your own methods will be "further down".
method_not_found_handler
method_not_found_handler
is called when a multimethod was called but no method matched the arguments. Use F method_not_found_handler(callable:Fun, *args) ...
to add your behaviours.
global_not_found_handler
global_not_found_handler
is called on attempt to read from an undefined global variable. Sample usage from stdlib.ngs
F global_not_found_handler(name:Str) { require("${NGS_DIR}/autoload/${name}.ngs") }
Hooks
A hook is a Hook
object. Some hooks are called by NGS when a certain condition occurs. You are free to create and use your own hooks. When called, it executes all registered methods. The main difference vs calling a MultiMethod
is that using hook you get accumulative behaviour instead of overriding behaviour.
User-defined hook example:
{ h = Hook() h.push({ echo("A") }) h.push({ echo("B") }) h() } # Outputs one per line: A, B
Another way is to add named hook handlers (also a practical example):
exit_hook['cleanup_temp_files'] = F(exit:Exit) { # remove my temp files }
exit_hook
exit_hook
is called when NGS is about to exit. Typical cases are:
- All of the given code was executed.
throw
was invoked and no matchingcatch
was found.
Method signature: exit_hook(exit:Exit)
. Exit
currently has two keys: exit_code
(Int
) and exceptions
(Arr
). stdlib.ngs defines one standard exit hook.
$ ngs -pi 'exit_hook.Hash()' Hash of size 1 [print_exception] = <UserDefinedMethod <anonymous>(exit:Exit) at /usr/local/lib/ngs/stdlib.ngs:5345>
print_exception
prints exception details if an exception occurred.
Predicates
Methods such as filter
, filterk
, filterv
, etc take predicate as one of the arguments.
Traditionally, such methods (functions) took a method (function) as the predicate argument.
A predicate function is a function with one parameter and that returns a boolean.
NGS allows more powerful and convenient usage of predicates in these and other methods.
All kinds of objects can be passed as predicates.
NGS higher-order methods (functions) that have predicate parameters convert the predicates into predicate methods (functions) using the Pred
multimethod:
# Note that "predicate" can be of any type F filter(e:Eachable1, predicate) { pred = Pred(predicate) # <-- Converting predicate to a predicate function. t = typeof(e) ret = t() e.each() do F(elt) { if pred(elt) ret.push(elt) } ret }
This behaviour allows defining how any given type behaves as a predicate. Several built-in types have Pred
method defined for them.
Here are some examples of how predicate conversion described above can shorten the code:
# Would be {...}.filterk(F(k) k ~ /^b/) {"a1": 1, "a2": 2, "b1": 10}.filterk(/^b/) # {"b1": 10} # Would be {...}.filter(F(elt) elt.x == 7) [{"x": 10, "y": 20}, {"x": 7, "y": 30}].filter({"x": 7}) # [{x=7, y=30}] # Would be {...}.filter(F(t) t is Str) ["abc", 1, "def"].filter(Str) # [abc,def]
One can easily define how a type behaves as a predicate: define appropriate Pred
method.
Here is the Pred
method that allows .filter(SomeType)
such as .filter(Str)
example above:
F Pred(t:Type) F is_pred(x) x is t
That’s a method (Pred
) that returns a predicate method (function) which is called is_pred
.
Naming the returned method helps a bit when looking at backtrace. is_pred
, the predicate method, checks whether it’s argument is of type t
.
Note that using a method as predicate is still possible:
[1,2,11].filter(F(x) x > 10) # [11]
That is because Pred
method leaves it’s argument as is, if it is callable (a method for example):
F Pred(f:Fun) f
Iterators
Iterators give more flexibility and control as opposed to each
iteration.
The iterator protocol
Iterators must implement the following methods:
Bool(x:YOUR_ITER_TYPE)
which tells whether there are more valuesnext(x:YOUR_ITER_TYPE)
which returns the next value (or throwsNoNext
exception)
When for x in EXPR { BODY }
syntax is used, it is equivalent to the following:
_hidden_iter = Iter(EXPR) while _hidden_iter { x = _hidden_iter.next() BODY }
Referencing the iterator in the BODY
Since Iter(x:Iter)
is defined as x
, you can use the following solution if you want to use the for
syntax and have access to the iterator:
for x in my_iter=Iter(EXPR) { BODY # can manipulate my_iter for advanced control }
The above works as follows: my_iter=Iter(EXPR)
is an assignment which evaluates to an Iter
object. for
uses Iter
on that value to get an iterator but it is the same iterator that my_iter
references.
Built-in iterators
See Iter
type documentation to see which iterators are available in NGS.
Executing external programs
When a command (more precisely commands pipeline) is parsed, an CommandsPipeline
object is created. Then depending on the syntax, appropriate method is called with the CommandsPipeline
argument. The methods are: $()
, ``
, and ````
.
The syntactic options for executing external programs are:
- Being in top level syntax of an NGS script, which is the commands syntax
- In code syntax:
$(...)
- run the process and get the reference to it - In code syntax:
%(...)
- construct theCommandsPipeline
but do not run it. Useful for passing ready-to-run commands around:t=%(ls /); ...; $($t)
- In code syntax:
`...`
- capture the output - In code syntax:
``...``
- capture and decode the output
Note that if you want just to construct the command and not run it
External commands are represented with type CommandsPipeline
. In the simple case, there is exactly one command. In more complex cases CommandsPipeline
references several programs and pipes between them, for example constructed by $(my_prog_1 | my_prog_2 | my_prog_3)
.
$ ngs -pi '$(ls | wc -l)' CommandsPipeline command[0]: <Command options={} redirects=[<Redirect 1 null <WritingPipeBetweenChildren read_fd=4 write_fd=5>>] argv=[ls]> process[0]: Process process[0]: command = <Command options={} redirects=[<Redirect 1 null <WritingPipeBetweenChildren read_fd=4 write_fd=5>>] argv=[ls]> process[0]: pid = 87711 process[0]: exit_code = 0 process[0]: exit_signal = 0 process[0]: output on fd 1, stdout (0 lines): process[0]: output on fd 2, stderr (0 lines): pipe[1]: <CommandsPipe options={} name=|> command[1]: <Command options={} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=6 write_fd=7>>,<Redirect 0 null <ReadingPipeBetweenChildren read_fd=4 write_fd=5>>] argv=[wc,-l]> process[1]: Process process[1]: command = <Command options={} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=6 write_fd=7>>,<Redirect 0 null <ReadingPipeBetweenChildren read_fd=4 write_fd=5>>] argv=[wc,-l]> process[1]: pid = 87712 process[1]: exit_code = 0 process[1]: exit_signal = 0 process[1]: output on fd 1, stdout (1 lines): process[1]: 20 process[1]: output on fd 2, stderr (0 lines):
Substitution of command line arguments
$(ls $fname) # $var_name - expands to exactly one argument $(ls ${ EXPR }) # ${ EXPR } - expands to exactly one argument $(ls $*fnames) # $*var_name - expands to zero or more arguments $(ls $*{ EXPR }) # $*{ EXPR } - expands to zero or more arguments
EXPR
above can be a single expression or any number expressions, new-line or ;
separated, value of the last one is used for the substitution.
The examples above show $()
syntax but the substitution works the same for all syntaxes mentioned in syntactic options for executing external programs.
Options
Rationale: it is sometimes desirable to alter the behaviour of execution of an external program.
NGS provides syntax for passing arbitrary key-value pairs to augment CommandsPipeline
or Command
.
Syntax:
- Options for one command:
option1:value1 option2:value2 optionN:valueN command and arguments
inside$()
and friends and at the top level. - Options for one the whole pipeline:
option1::value1 option2::value2 optionN::valueN command_1 | command_2
inside$()
and friends and at the top level.
In both cases, the values must follow the colon immediately, no spaces allowed. Missing value defaults to true
.
Options:
$(nofail: my_command)
- (deprecated, use$(ok: my_command)
) no exception will be thrown asfinished_ok
always returnstrue
for commands with option.$(ok:CODES my_command)
-finished_ok
will returntrue
for the listedCODES
.CODES
can be a boolean (Bool
), single integer (Int
), an array of integers (Arr
ofInt
), or aNumRange
.$(top_level:: my_command)
- do not capture the output of the command. This is the default behaviour of the top-level commands syntax: the stdout connected to the stdout of the NGS process.Internally the
&
in$(my_command &)
sets the&
option totrue
. This mechanism can be used to modify the behaviour ofCommandsPipeline
prepared by%(...)
.ngs -pi '%(ls / &)' CommandsPipeline options: Hash of size 1 options: [&] = true command[0]: <Command options={} redirects=[] argv=[ls,/]>
`line: my_prog`
- return the first line of the output, without the new-line characters
WARNING: unrecognized options are silently ignored.
Input/Output Redirections
Redirections are represented by the Redir
type. When a command is parsed and converted to CommandsPipeline
, each Command
gets a (possibly empty) list of redirections (in the .redirects
property)
Redirections syntax (based on bash syntax):
- Overwrite :
my_command >myfile
- Append:
my_command >>myfile
- Read from:
my_command <myfile
- Specify file descriptor, overwrite:
my_command 2>myfile
- Specify file descriptor, append:
my_command 2>>myfile
- Capture:
my_command 2>${true}
. The default is to not capture stderr but rather output it immediately to NGS’ process stderr.
Running external programs examples:
$ ngs -pi '$(ok: ls xxx)' ls: xxx: No such file or directory CommandsPipeline command[0]: <Command options={ok=true} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=4 write_fd=5>>] argv=[ls,xxx]> process[0]: Process process[0]: command = <Command options={ok=true} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=4 write_fd=5>>] argv=[ls,xxx]> process[0]: pid = 26157 process[0]: exit_code = 1 process[0]: exit_signal = 0 process[0]: output on fd 1, stdout (0 lines): process[0]: output on fd 2, stderr (0 lines): $ ngs -pi '$(ok: ls xxx 2>${true})' CommandsPipeline command[0]: <Command options={ok=true} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=6 write_fd=7>>,<Redirect 2 > <CollectingPipeFromChildToParentProcess read_fd=4 write_fd=5>>] argv=[ls,xxx]> process[0]: Process process[0]: command = <Command options={ok=true} redirects=[<Redirect 1 null <CollectingPipeFromChildToParentProcess read_fd=6 write_fd=7>>,<Redirect 2 > <CollectingPipeFromChildToParentProcess read_fd=4 write_fd=5>>] argv=[ls,xxx]> process[0]: pid = 26179 process[0]: exit_code = 1 process[0]: exit_signal = 0 process[0]: output on fd 1, stdout (0 lines): process[0]: output on fd 2, stderr (1 lines): process[0]: ls: xxx: No such file or directory
Exit codes handling when running external programs
Immediately after an external program finishes, it’s exit code (and in future, possibly other aspects) is checked using finished_ok
multimethod. The multimethod returns a Bool
. If it’s false
, an exception is thrown.
For unknown programs, finished_ok
returns false
for non-zero exit code and hence an exception is thrown. This behaviour can be modified with the ok:
option of the command.
NGS knows about some of external programs (false
, test
, fuser
, ping
) which might return non-zero exit codes which do not indicate an error. These commands do not cause exceptions for exit code one.
If there is a particular program that you are using which also returns non-zero exit codes which should not cause an exception, you should add your own method finished_ok
which handles that program. Please consider contributing this addition back to NGS.
External program boolean value
When converting to Bool
, for example in the expression if $(test -f my_file) { ... }
, all programs in the CommandsPipeline
must exit with code zero in order to get true
, otherwise the result of converting to Bool
is false
.
Constructing command line arguments
While it’s feasible to construct an array with command line arguments for a program, Argv
makes this task much easier by supporting common cases such as omitting a command line switch which has no corresponding value to use.
See Argv
multimethod.
Globbing
Globbing is not implemented yet.
Accessing environment variables
On NGS startup, the global variable ENV
is populated with the environment variables. ENV
is a Hash
object.
echo(ENV.HOME) echo(ENV["HO" + "ME"])
ENV
is also passed to external programs.
{ ENV.HOME = "/home/fake" $(my_prog) }
NGS exit codes
Both regular results of running a program and exceptions are converted to exit code using ExitCode
MultiMethod.
Built in methods of ExitCode
do the following conversion:
true
- 0false
- 1Int
object - the number itselfException
objects - 240FullBox
object - 0EmptyBox
object - 1Success
object - 0Failure
object - 1FatalError
objects - using theexit_code
field of the exception, which if not given toFatalError
constructor defaults to 1.CommandsPipeline
/Process
objects - the exit code of the external commandMatchY
object - 0MatchN
object - 1- All others - 0
Special cases:
- An exception and additional exception when converting the first exception using
ExitCode
- 241 - An exception and
exit_hook
failure - 242 - An exception, additional exception when converting the first exception using
ExitCode
, andexit_hook
failure - 243 - Non-Exception was thrown and then caught by C code - 245
- NGS could not find the bootstrap file - 246
- NGS could not open bootstrap file - 247
- NGS could not parse bootstrap file - 248
Customizing translation of your types/objects to exit code
Define F ExitCode(t:MyType) {...}
method that returns Int
to customize the exit code.
Declarative primitives
Some of NGS libraries (currently only AWS) implement declarative primitives approach. Declarative primitives approach is to expose functions to manipulate resources in an idempotent way. Similar to configuration management systems, exposed resources are manipulated by declaring their desired state. Unlike configuration management systems, this approach does not deal with dependencies and order resolution. Declarative primitives approach provides convenient scripting facility, not a configuration management system.
Related methods
Constructors
Resources references are created using constructors, such as AWS::Vpc
. Resources references are of type which is a subtype of ResDef
.
my_vpc = AWS::Vpc() # Reference to all VPCs that can be fetched using AWS CLI echo(my_vpc) # Output: <Aws::Vpc anchor={regions=null, Tags={}}> echo(AWS::Vpc().typeof()) # Output: <Type Vpc> echo(AWS::Vpc().typeof().parents) # Output: [<Type ResDef>]
find
find
- find the specified resources. Almost never should be called manually. expect
, converge
, each
and other methods run find
automatically if needed (if it did not run previously on the reference).
my_vpc = AWS::Vpc().find() echo(my_vpc) # Output: # <Aws::Vpc vpc-ef...,vpc-ef...,vpc-bf... anchor={regions=null, Tags={}}>
Empty result is not an exception for find
.
expect
expect
- check that referenced resources exist. Throw an exception otherwise. Used to assert assumptions about the system. If you write a script that for example counts on exactly one VPC that matches your description to exist, I strongly recommend to be explicit and state your expectation as .expect(1)
.
my_vpc = AWS::Vpc().expect() # If no VPCs exist - throws exception echo(my_vpc) # Output: # <Aws::Vpc vpc-ef...,vpc-ef...,vpc-bf... anchor={regions=null, Tags={}}> # Find VPCs with name tag "XX" my_vpc = AWS::Vpc({"Name": "XX"}).expect() # If no VPCs exist - throws exception # ... Exception of type AssertFail occurred ... # Find VPCs with the given tag my_vpc = AWS::Vpc({"aws:cloudformation:stack-name": "edge-vpc"}).expect(1) # Would throw exception if there was not exactly one resources matching the description echo(my_vpc) # Output: # <Aws::Vpc vpc-ef... anchor={regions=null, Tags={aws:cloudformation:stack-name=edge-vpc}}>
converge
converge
- Creates or updates resources if needed in order to match given properties.
Example from production code, shortened for brevity:
# Library that describes ELB creation. The system has several ELBs which are similarly configured. # The library contains this common configuration. ns(c=converge) { F converge(env, role, dns=null) { edge_vpc_anchor = {'aws:cloudformation:stack-name': 'edge-vpc'} vpc = AWS::Vpc(edge_vpc_anchor).expect(1) tags = {'env': env, 'role': role, 'creator': 'system/images/elb.ngs'} # Anchor: name + vpc_id elb_sg = AWS::SecGroup("https-server", vpc) elb = AWS::Elb("${env}-${role}") # ---> Converge happens here <--- # Load balancer is either created or updated here elb.c( Tags = tags + if dns { {'DNS': dns} } else { {} }, ListenerDescriptions = ..., Subnets = AWS::Subnet(edge_vpc_anchor).expect(2), SecurityGroups = elb_sg, HealthCheck = ..., Instances = AWS::Instance({'env': env, 'role': role}).expect() ) } }
delete
delete
deletes referenced resources.
# Delete referenced resources, in this case one specific load balancer AWS::Elb("prod-mobile-redirect").delete() # Delete all unused load balancers AWS::Elb().reject(X.Instances).delete()
Anchor
Anchors describe which resources are referenced.
As you probably have noticed, resource references, which are created using constructors, get some parameters. These parameters are called "Anchor". Note that all parameters passed to the constructors are called "Anchor" collectively.
Examples of Anchors:
- Tags hash:
AWS::Instance({'env': env, 'role': role})
- Explicit tags:
AWS::Instance(Tags={'env': 'prod', 'role': 'ssh-sandbox'})
- Resources specific properties:
AWS::Vpc(IsDefault=true).expect(1)
- Tags and resource-specific properties:
AWS::Vpc(IsDefault=false, Tags=...).expect(1)
Any combination of explicit tags and resource-specific properties is possible.
Properties
Properties describe the desired state of resources. The parameters to converge
are properties. See the converge
example above.
Planned future changes
regions
in AWS library will probably be renamed to_Region
or something else starting with underscore to emphasize it’s a syntactic property, not something that correlates to AWS CLI output.- Currently there is no way to know whether
converge
has created resources or updated them. There will be a way to tell.
Debugging
This section is planned to grow over time.
MethodNotFound exception
$ ngs -pi 'map(1,2,3)' ... Exception of type MethodNotFound occurred ... # Means the parameters did not match any of the existing methods. # See which methods are defined: ngs -pi map MultiMethod Methods Array of size 7 [0] = <UserDefinedMethod map(e:Eachable, mapper:Fun) at /usr/local/lib/ngs/stdlib.ngs:242> [1] = <UserDefinedMethod map(arr:Arr, n:Int, mapper:Fun) at /usr/local/lib/ngs/stdlib.ngs:1600> [2] = <UserDefinedMethod map(h:Hash, mapper:Fun) at /usr/local/lib/ngs/stdlib.ngs:2209> [3] = <UserDefinedMethod map(fb:FullBox, mapper:Fun) at /usr/local/lib/ngs/stdlib.ngs:2550> [4] = <UserDefinedMethod map(eb:EmptyBox, mapper:Fun) at /usr/local/lib/ngs/stdlib.ngs:2555> [5] = <UserDefinedMethod map(s:Success, fun:Fun) at /usr/local/lib/ngs/stdlib.ngs:2762> [6] = <UserDefinedMethod map(f:Failure, fun:Fun) at /usr/local/lib/ngs/stdlib.ngs:2771>
AUTHORS
- Ilya Sher