pamplemousse’s blog

Solving LinkedIn’s Queens game with CodeQL

2024-08-14T00:00:00+00:00

CodeQL is a technology to do static analysis of software program sources. Although aimed to extract information from codebases, its implementation relies on logic programming, which allows to ab use it to solve ~~video games~~ logic puzzles.

Our target of the day, seemingly heavily inspired from star battle, will be LinkedIn’s Queens game. Our goal is to find a way to solve any grid of the game leveraging CodeQL.

Introduction

CodeQL

CodeQL usually loosely refers to a set of components: an engine, a query language, libraries, etc.

From a high level, it operates by hooking into the compilation steps building a program to gather facts about its source code that are consolidated into a database. One can then write queries in “QL”¹, to extract information out of this database. Usually, information about where vulnerabilities are lurking in one’s code.

The syntax of QL is similar to SQL, but the semantics of QL are based on Datalog, a declarative logic programming language often used as a query language. This makes QL primarily a logic language, and all operations in QL are logical operations. ²

So, CodeQL inherits from Datalog, a logic programming language.

Logic Programming

For those bred to imperative programming languages, that are used to write programs expressing how to solve a particular problem, logic programming is a bit of a surprising beast that is confusing to tame.

With logic programming,

you specify what you want to achieve rather than how to achieve it. In other words, you define the rules and constraints that govern the data, and the system automatically derives the answers to your queries. ³

Now we have our secret recipe: instead of trying to compute the solution ourselves, we “just need” to express facts and constraints about the game we will be trying to solve, and let CodeQL deduce the solution for us.

Queens

Queens is a puzzle game where the goal is to place a set of “queens” on a small chess-like board with colored cells, respecting the following rules:

Each row, column, and colored region must contain exactly one queen.
Queens cannot be placed in adjacent cells, including diagonally.

Here is an example of an empty grid:

On paper

So, how can we “encode” the game of Queens as facts and constraints that we will have CodeQL reason about?

Encoding the game setup:
1. The board’s cells can be uniquely represented by their cartesian coordinates on the grid;
2. The board is partitioned into zones of different colors, which gives that each zone can be represented as a set of coordinates;
3. All queens end on cells, therefore the endgame solution can also be also represented as a set of coordinates.
Encoding the game rules:
1. As per rule 1., any pair of queens cannot on the same row, column, nor zone;
2. As per rule 2., any pair of queens cannot be adjacent, including diagonally.

… and that’s it.

We can know reformulate what we want to do:

After encoding the different zones, we will let CodeQL find a set of values for coordinates of cells to put queens on (i.e. 1.3.) respecting the rules (i.e. 2.);
If CodeQL finds values that fit our constraints, we place our queens on the represented cells and call it a day.

Implementation

Top level query

At a high level: we want to be given the set of cells to places our queens on. These cells must respect the predicates representing the constraints 2.1. and 2.2. we deduced from the rules.

Here is the query we get:

from
  Cell queen_1, Cell queen_2, Cell queen_3, Cell queen_4, Cell queen_5, Cell queen_6, Cell queen_7,
where (
  allDifferentRowsColumnsAndNotAdjacent(
    queen_1, queen_2, queen_3, queen_4, queen_5, queen_6, queen_7
  )
  and
  inEachZone(queen_1, queen_2, queen_3, queen_4, queen_5, queen_6, queen_7)
)
select queen_1, queen_2, queen_3, queen_4, queen_5, queen_6, queen_7

Let’s implement the elements that we are missing for this query to work: a Cell class, and the two predicates allDifferentRowsColumnsAndNotAdjacent, and inEachZone.

Cell representation

As we discussed earlier, we see a Cell as a pair of Coordinate representing respectively the row and column we can find the cell at. The grid we are solving is \(7*7\) cells big, so coordinates are between 0 and 6. We use QL classes to implement these representations.

class Coordinate extends int {
  Coordinate() {
    this in [0..6]
  }
}

class Cell extends string {
  Coordinate x;
  Coordinate y;

  Cell() { this = x.toString() + y.toString() }
}

Rules predicate

We use predicates to encode the rules of the game.

All queens are on different rows, columns, and not adjacent

Constraint 2.1. states that any two pair of queens can’t be on the same row, same column, nor be adjacent to each other. Therefore, we can restrict all the cells we should be given, saying that for any pair of different cells a and b among them, a and b must not be on the same row, same column, nor be adjacent.

predicate allDifferentRowsColumnsAndNotAdjacent(Cell c1, Cell c2, Cell c3, Cell c4, Cell c5, Cell c6, Cell c7) {
  forall(
    Cell a, Cell b
    | a in [ c1, c2, c3, c4, c5, c6, c7 ] and
      b in [ c1, c2, c3, c4, c5, c6, c7 ] and
      a != b
    | not a.sameRow(b) and
      not a.sameColumn(b) and
      not a.adjacent(b)
  )
}

We see that we need to enrich our Cell class with the member predicates sameRow, sameColumn, and adjacent:

class Cell extends string {
  // ...

  Coordinate getX() { result = x }
  Coordinate getY() { result = y }

  predicate sameRow(Cell c) {
    x = c.getX()
  }

  predicate sameColumn(Cell c) {
    y = c.getY()
  }

  predicate adjacent(Cell c) {
    exists(
        int i, int j
        | i in [-1..1] and j in [-1..1]
        | x + i = c.getX() and y + j = c.getY()
    )
  }
}

One queen per colored zone

As mentioned earlier, we need to encode the specific color zones of the board we are trying to solve. This happens through the QlBuiltins::InternSets parameterized module, that has a Set class exposing a contains predicate that tells if a given element belongs to a selected set.

Cell getAValue(string zone) {
  zone = "purple" and result = ["00",]
  or
  zone = "orange" and result = ["01", "10", "11", "12", "21",]
  or
  zone = "blue" and result = ["02", "03", "13", "20", "22", "23", "30", "31", "32",]
  or
  zone = "green" and result = ["04", "05", "06", "14", "16", "24", "26", "36", "46",]
  or
  zone = "pink" and result = ["33", "34", "43",]
  or
  zone = "grey" and result = ["15", "25", "35", "44", "45", "51", "52", "53", "54",]
  or
  zone = "cay" and result = ["40", "41", "42", "50", "55", "56", "60", "61", "62", "63", "64", "65", "66",]
}

module Z = QlBuiltins::InternSets;

predicate inEachZone(Cell c1, Cell c2, Cell c3, Cell c4, Cell c5, Cell c6, Cell c7) {
  Z::getSet("purple").contains(c1) and
  Z::getSet("orange").contains(c2) and
  Z::getSet("blue").contains(c3) and
  Z::getSet("green").contains(c4) and
  Z::getSet("pink").contains(c5) and
  Z::getSet("grey").contains(c6) and
  Z::getSet("cay").contains(c7)
}

And with that, we have completed the query we need to solve the example instance of the game we want to crack.

A word on inefficiency

I was tempted at first to break down the logic of allDifferentRowsColumnsAndNotAdjacent into three separate predicates that follow the same structure and logic, except they apply the constraints “one by one”.

predicate allDifferentRows(Cell c1, ... , Cell c7) { ... }
predicate allDifferentColumns(Cell c1, ...,  Cell c7) { ... }
predicate allNotAdjacent(Cell c1, ... , Cell c7) { ... }

Doing so lead to queries that were being solved extremely slowly by the engine (I cut the process short after 60+ minutes without a result).

Inspired by this behaviour, I also thought that I should inversely get rid of the inEachZone predicate, to move its inner logic into the other predicate.

predicate allDifferentRowsColumnsZonesAndNotAdjacent(Cell c1, Cell c2, Cell c3, Cell c4, Cell c5, Cell c6, Cell c7) {
  forall(
    Cell a, Cell b
    | a in [ c1, c2, c3, c4, c5, c6, c7 ] and
      b in [ c1, c2, c3, c4, c5, c6, c7 ] and
      a != b
    | not a.sameRow(b) and
      not a.sameColumn(b) and
      not a.adjacent(b) and
      not a.sameZone(b)
  )
}

But that also led to an explosion of the query run time.

The CodeQL documentation⁴ mentions a couple of usual suspects when having performance issues. After investigating, and trying to tweak my solution, I still failed to fully make sense of what I experienced.

The code I shared yields a correct solution, and is fast (enough), as we will see. But I am not able to explain why it does so, and why the trials presented in this section don’t, without a lot of hand waving. Diving further into the troubleshooting of CodeQL query performance appeared to be yet another rabbit hole, and will therefore either be left as an exercise to the reader, or be covered in a future post.

That being said, if you are knowledgeable about CodeQL inner workings, and are kind enough to be willing to explain to me why my alternative pieces of code have been inefficient, I would be very happy to hear from you⁵.

Plumbing

As previously presented, CodeQL is a set of tools for analyzing codebases. Therefore, we need a little bit of extra work to bootstrap an environment in which we can run QL queries for solving our puzzle:

We initialize a pack for the database to be generated well⁶;
And we create an empty database (we randomly pick a language to target - which doesn’t matter for solving our logic puzzle).

codeql pack init pamplemousse/queens
codeql database create --language=javascript-typescript empty.db

And we are finally ready to run our query⁷ to solve the game:

$ codeql query run Queens.ql --database=empty.db
[...]
Starting evaluation of queens/Queens.ql.
Evaluation completed (299ms).
| queen_1 | queen_2 | queen_3 | queen_4 | queen_5 | queen_6 | queen_7 |
+---------+---------+---------+---------+---------+---------+---------+
| 00      | 21      | 13      | 46      | 34      | 52      | 65      |
Shutting down query evaluator.

Under half a second, we get our answer. Remember, from how we declared our Cell class, each queen_i then represents a pair of coordinates: first digit is the row number, and second is the column number. Placing the queens on the board according to the values CodeQL returned will give us:

Et voilà !

All sources shared in this post are available on SourceHut, in the git.sr.ht/~pamplemousse/Queens repository.

Conclusion

The idea of solving a puzzle using CodeQL has probably partly been inspired by the QL tutorials, which I highly recommend if you want to practice your query writing skills.

This motivated me to look into more details about how logic programming languages are actually implemented. Maybe investigating the performance questions would be a great mean to learn more about how the CodeQL engine is implemented… someday.

Thanks for following this epic match, where we threw two almost unrelated things at each other, and took pleasure in it.

“QL” stands for Query Language. CodeQL often casually refers to any, or multiple, part(s) of the whole shebang, but documentation uses “QL” to lift the ambiguity and talk about the programming language the queries are written in. ↩
c.f. https://codeql.github.com/docs/ql-language-reference/about-the-ql-language/#properties-of-ql . ↩
c.f. https://datalog.dev/article/Introduction_to_Datalog_programming_language.html ↩
c.f. https://codeql.github.com/docs/writing-codeql-queries/troubleshooting-query-performance/ ↩
I am reachable via Matrix, Mastodon, or via email. ↩
Without a qlpack.yaml, CodeQL is able to create a database, but that one seems then to be lacking folders to be able to run our query later. ↩
I ran the query on a Dell XPS 15 9560, with an Intel Core i7-7700HQ CPU (quad cores, 2.80GHz base frequency), and 32GB of RAM, running on NixOS 24.05. ↩

Handle function calls during static analysis in angr

2021-02-25T00:00:00+00:00

On the research project I work on at SEFCOM, I use angr to statically analyse binary programs.

Incidentally, I was invited to give a presentation as part of CSE545 during the Fall semester of 2020 at ASU. This talk was meant to be a hands-on introduction on data-flow analysis, using angr, to find “taint-style” vulnerabilities ¹ in binaries. Thanks to the one of the class’s TA, the video recording is available. Furthermore, I published the slides of the presentation, as well as the illustrating code examples.

Some of the examples do not work “as is”. It means that for the people trying to reproduce it, extra elbow grease is necessary. Sadly, it can be somewhat of a tedious (and painful) process: angr is not really stable (its API evolves wildly depending on the needs of people working on it), and documentation is helpful, but not self-sufficient.

This post is aiming to bridge the gap for who would like to get similar examples working. In particular, by answering the question:

How to write a function handler to simulate the effect of a function on the state of the analysis?

This post is divided in four sections:

Context: Presentation of the analysis, and problems encountered;
Usage and description: Runthrough of the documentation, implementation requirements and first thoughts;
Examples: Examples of handlers for a local and an external function;
One step beyond: Inter-procedural analysis: Discussion and high-level overview of turning ReachingDefinitionsAnalysis inter-procedural using function handlers;
Conclusion: A closing proclamation.

If you already know what “function handler” means, you can skip the Context section, and start reading from the Usage and description section.

Context

At a high level, we can use a static analysis to gather data-flow facts about the variables of programs without executing them. To do so, such analysis somewhat sequentially interprets the effects of program’s statements on the state it keeps track of ².

But what if such a statement is a function call?

Well, the analysis could continue on the statements of the targeted function, and then jump back to where it was once the function returns.

… And what if this function is an external function? For example provided by a dynamically linked library?

Ah! In such case, the statements that make up the content of the targeted function (its implementation in the binary) are not directly available for analysis.

One thing to note is that we don’t really want to analyse a external library as part of the process: We want to focus on the binary at hand, and prefer to avoid spending resources (computing time and memory) tracking what happens “outside” of it…

Most of the time though, we “know” what a library function does. Here are examples of what we “know” about a couple of libc functions:

printf: Uses several parameters to deterministically compose a string, and write it to stdout;
malloc: Allocates a chunk of memory of size determined from its first parameter, and return a pointer to it;
strcpy: Copies the content of its second parameter into the memory area pointed to by its first parameter.

From the program perspective, and thus the analysis perspective, these functions are black boxes: their implementation details remain hidden. However, we are only interested in the effect such functions have on the state of the system when the program is running; From the analysis perspective, the effect they have on the representation of this state.

So?

What we need in both cases is a mechanism to produce the effect of a function on the state representation managed by the analysis. This is achieved using function handlers.

In the first case (local function), a function handler should drive the analysis to the function called, and return adequately; In the second case (external function), a function handler should update the analysis state respecting the “known” function behavior.

Usage and description

We are implementing our analysis using angr’s ReachingDefinitionsAnalysis. As described in the documentation, it takes an optional function_handler parameter.

To work, what is passed via function_handler needs to inherit from the FunctionHandler astract base class: As you can see in the documentation of FunctionHandler, it means that the given function_handler must have the following methods:

hook: A mean for the handler to have a reference to an analysis, to be able access to information about its context (architecture, facts gathered in the knowledge base, etc.). In particular, ReachingDefinitionsAnalysis calls it at initialisation;
handle_local_function: That the analysis will run when it encounters a call to a local function.

Those are the minimal requirements for a function_handler to have.

Then, for ReachingDefinitionsAnalysis to be able to deal with say printf, malloc, or strcpy, we would add the corresponding methods: handle_printf, handle_malloc, and handle_strcpy to the concrete class inheriting from FunctionHandler. For example, such a concrete class MyHandlers, would produce instances exposing handle_printf, that will be called during the analysis when a call to printf is encountered in the binary (and respectively handle_malloc, handle_strcpy for calls to malloc, strcpy) ³.

To recap, and because the terminology is somewhat confusing:

A “function handler” is a (Python) method that will be called by the analysis when encountering a call instruction;
FunctionHandler is an ABC class that describe what a concrete class (say MyHandlers) to have to work with angr;
function_handler is the name of the parameter to pass the ReachingDefinitionsAnalysis; It’s a kind of MyHandlers, and thus of FunctionHandler, exposing “function handlerS”;

Examples

Let’s see what it looks like in practice.

Binary to analyse

We will analyse the binary produced by command_line_injection.c . Here is how to download and compile the code:

git clone git@github.com:Pamplemousse/bits_of_static_binary_analysis.git
cd bits_of_static_binary_analysis
make

If everything went fine, running ./build/command_line_injection ~/ should list your home directory.

The simplest analysis

The most straightforward analysis starting from the function main looks like the following analysis.py:

from angr import Project

project = Project('./build/command_line_injection', auto_load_libs=False)
cfg = project.analyses.CFGFast(normalize=True, data_references=True)

main_function = project.kb.functions.function(name='main')
program_rda = project.analyses.ReachingDefinitions(
    subject=main_function,
)

# Do domething with `program_rda`
...

However, as is, the analysis is intra-procedural: it only runs on the function main. Pleasantly, when executing python analysis.py, angr warns us with the following "Please implement the local function handler with your own logic."; So we know it encountered a call to a local function, and he felt helpless. Poor angr.

Handle local functions

We can improve analysis.py to give the ReachingDefinitionsAnalysis the necessary handle_local_function that will get triggered when analysing main, precisely on the instruction calling check.

from angr import Project
from angr.analyses.reaching_definitions.function_handler import FunctionHandler


class MyHandler(FunctionHandler):
    def __init__(self):
        self._analysis = None

    def hook(self, rda):
        self._analysis = rda
        return self

    def handle_local_function(self, state, function_address, call_stack, maximum_local_call_depth, visited_blocks,
                              dependency_graph, src_ins_addr=None, codeloc=None):
        function = self._analysis.project.kb.functions.function(function_address)

        # Break point so you can play around with what you have access to here.
        import ipdb; ipdb.set_trace()
        pass

        return True, state, visited_blocks, dependency_graph

project = Project('./build/command_line_injection', auto_load_libs=False)
cfg = project.analyses.CFGFast(normalize=True, data_references=True)

handler = MyHandler()

main_function = project.kb.functions.function(name='main')
program_rda = project.analyses.ReachingDefinitions(
    function_handler=handler,
    observe_all=True,
    subject=main_function
)

# Do domething with `program_rda`
...

Running python analysis.py, we now get a shell thanks to the breakpoint placed in the handle_local_function. From there, I invite you to investigate and play around with what you can do; And remember: you have access to a lot of facts gathered by angr through self._analysis.project whether it be .arch, .kb, etc.

Handling external functions

As presented earlier, handlers can also be triggered on calls to library functions, and used to model the effects of code that cannot be directly analysed. In our example, we can see that the function check calls the libc function sprintf.

Here is a new analysis.py that showcases how to have the analysis to consider this call; With a richer MyHandler, containing a handle_sprintf method.

from angr import Project
from angr.analyses.reaching_definitions.function_handler import FunctionHandler


class MyHandler(FunctionHandler):
    def __init__(self):
        self._analysis = None

    def hook(self, rda):
        self._analysis = rda
        return self

    def handle_local_function(self, state, function_address, call_stack, maximum_local_call_depth, visited_blocks,
                              dependency_graph, src_ins_addr=None, codeloc=None):
        function = self._analysis.project.kb.functions.function(function_address)
        return True, state, visited_blocks, dependency_graph

    def handle_sprintf(self, state, codeloc):
        # Break point so you can play around with what you have access to here.
        import ipdb; ipdb.set_trace()
        pass

        return True, state

project = Project('./build/command_line_injection', auto_load_libs=False)
cfg = project.analyses.CFGFast(normalize=True, data_references=True)

handler = MyHandler()

sprintf_plt_stub = project.kb.functions.function(name='sprintf', plt=True)
program_rda = project.analyses.ReachingDefinitions(
    function_handler=handler,
    observe_all=True,
    subject=sprintf_plt_stub
)

# Do domething with `program_rda`
...

Notice that for the sake of example simplicity, the analysis gets started on the sprintf PLT stub reconstituted by angr. If it was not, this example would be hitting the handle_local_function first, because check has a call instruction pointing to a PLT location, which is not at an external address! In other words, handling external functions that are called using the PLT mechanics, requires to start a ReachingDefinitionsAnalysis on the targeted PLT stub, with the proper handler.

Ideally, we would like to start the analysis on the function check, and expect the handle_sprintf to be called sometime: In particular, the analysis should use handle_local_function to point the analysis at the PLT stub, which in turn should end up triggering the handle_sprintf.

Coincidentally, this is a special case of a more generic problem: How to perform inter-procedural analysis?

One step beyond: Inter-procedural analysis

With real world programs, it is very unlikely that all the responses to analysts’ questions are waiting at a shallow level. Most of the time, we want to start the analysis from the entrypoint of the binary, and expect it to carry on across function calls until we get the information we were looking for. In our example, this means starting the ReachingDefinitionsAnalysis on the main function, and expecting it to analyse check, as well as calling handle_sprintf.

Because we want an analysis to run over multiple functions, we need an inter-procedural analysis. Sadly, this is currently not implemented in angr main repository!

In the presentation, and the corresponding video segment I however presented at a “high level” how we can turn angr’s ReachingDefinitionsAnalysis into an inter-procedural analysis.

The idea is to run it recursively: every time a call to a local function is encountered, a “child” ReachingDefinitionsAnalysis is started on the targeted function, and, once finished, the analysis state at its end is “copied” back to the parent, for it to continue from (after the call instruction).

Its implementation relies on function handlers. In particular, handle_local_function is where the “recursiveness” happens:

It starts the child ReachingDefinitionsAnalysis on the targeted function, with proper parameters (passing the current kb, initialising the child with the parent state using the init_state parameter, forwarding the function_handler);
It updates the parent’s .observed_results when the child returns, for the parent to be aware of what was captured during the child’s run;
It returns the state (which contains the current live_definitions) for the parent to continue from, as well as other structures the analysis records (visited_blocks, dep_graph).

Some functions can have several exits (in the case of multiple return statements in the source for example), and thus several output states from the analysis perspective! In such case, the handle_local_function must merge those states together to create a unique one for the parent analysis to resume from.

Conclusion

Function handlers are a handy tool for angr’s static analysis using ReachingDefinitionsAnalysis: they can be leveraged to apply the effect of external function to the state without having access to their implementation.

By applying the same principle on local functions, they even bring us one step beyond: inter-procedural analysis is nothing more than customization of the analysis behavior (recursiveness, state management, and internal bookkeeping) on call instructions.

Hoping you found those examples enlightening, happy hacking!

By “tainting” a variable taking a value from a user input, and propagating this taint on use, one can find other variables that can be influenced by a user input. Tainted variables being used for sensitive operations (arguments to execve, or system, affectation to a buffer of fixed size, etc.) points to potential security vulnerabilities. ↩
If you want to learn more details about how the analysis works, and a more concrete example of such analysis, I strongly encourage you to go look at the presentation mentioned above, available on YouTube. ↩
For those interested in the underlying mechanics on the angr side of things, the handler’s instance method is called in angr/analyses/reaching_definitions/engine_vex.py. ↩

Use SMT Solvers to generate crossword grids (3)

2019-11-13T00:00:00+00:00

This post is part of a series on using SMT Solvers to generate crossword grids.

Introduction to SMT, and programming with SMT Solvers;
Definitions and first formulas;
Plumbing everything together, complete formula, and results (currently reading).

Thanks @geistindersh for his feedback, and corrections!

In the two previous posts, we covered how to represent:

Valid words;
The potential values they can have, taking their length into consideration;
And their “crossing points”, i.e., the characters some of them must have in common.

We presented how the formulas derived from these representations are fed into a Solver, that would lead us to a set of values that we then “mapped back” into the grid to have it completed.

Although the savant part of the job is done, a couple of points that are left to discuss to end the series:

Automate the formula generation, from potentially different grid frames;
Present measurements we made and results we had;
~~Cry over the lack of efficiency~~ Discuss some potential improvements.

Formula generation

In the previous post, we presented how to write formulas to encode crossword grids constraints.

So far, the process has been very manual: declaring a variable for each word, and explicitly adding the “intersection” constraint. One can easily see how this can become arduous as we will want to generate bigger grids.

Ambitiously, our ultimate goal is to be able to generate a grid such as this real world one, which dimension is 17x12, counts 64 words and 162 intersections in total.

Because we interact with Z3 using its Python API, it makes it easy for us to write a program in this language to do all the fancy plumbing we need to:

Create the variables from a grid frame;
Formulate the constraints using these variables: values words can take, and the common letters they must respect on intersections;
Deal with a large wordlist: 200 000+ words.

Variables

Unlike in the previous post, where we had to deal with a small number of words to represent, we expect here to deal with a consequent number of variables.

Naming them \(\text{horizontal}\), or \(\text{vertical}\) would be very limiting; Still, using a naming scheme to help us locate words in the grid from the name of the variable use to represent them is an helpful idea.

Hence, let’s arbitrarily decide that our variable names will follow the pattern: \(direction\_x\_y\), where \(x\) and \(y\) are respectively the line and column components of the coordinate of the first letter of the word, and \(direction\) takes the value h or v if the word is either horizontal or vertical. We count coordinates respecting the French reading direction. So, the top left corner has coordinates \((0, 0)\).

Here is an example of a portion of a grid:

\(h\_1\_0\) represents the horizontal word which first letter is on the cell \((1, 0)\), in light blue;
\(v\_0\_1\) represents the vertical word which first letter is on the cell \((0, 1)\), in light orange.

Grid

The biggest grid we are aiming to represent counts 64 words to determine. Hence, we need to create the same number of variables, respecting the naming convention we just presented.

Doing so manually would be time consuming, especially if we expect to represent different grid frames (I did). So, I wrote a Python program to generate the variables out of a grid represented as an array of 0s and 1s, as shown in the following picture:

What you see, What I see.

From this representation, a complete formula can be generated (the different variables, with the values they can take, and the intersection constraints).

Stop waving your hands. Where is the code?

The complete code is available in a GitHub repository, and too long to completely expose here. Don’t let the amount of files intimidate you, the principles exposed in this series are the one implemented. Let’s briefly present its content:

francais.txt: The wordlist where the words are chosen from;
generate_dictionary.py: A script to generate this “normalised” wordlist (remove diacritics, deduplicate) out of a French wordlist found online;
dictionary.py: The representation of the wordlist as a Python structure, with the logic of “splitting” it into several pieces per the word size (as detailed in the previous post).
grid.py: The scanning of a grid from 0s and 1s, and interface to query grid related content, such as: list of words and intersections, with their coordinates, following the convention exposed above.
test_grid.py: Some unit test for the above logic;
solve.py: The central piece, making use of the above components to generate the formula as exposed in this series, call the solver, and print a solution ( ~~when~~ if found).

Results

I ran the solve.py program on a Lenovo x220, with an Intel Core i5-2520M CPU (dual core, 2.50GHz base frequency), and 8GB of RAM, running on NixOS 20.03 ¹. I used different parameters, varying the size of the grid and wordlists, at first to ensure that it worked as expected and produced valid solutions, then to measure its efficiency.

The wordlists’ sized has been reduced by shuffling the original (to keep a certain diversity among the options), then selecting only the first elements from it. I took three different grid sizes: small, medium, and large (respectively 6x3, 12x6, and 17x12); With the two smallest truncated from the original 17x12 frame.

Here are the results obtained with our solution, implemented with the code presented earlier:

On a 6x3 grid, with wordlists each reduced to 200 words, SAT with a solution, in ~6 seconds;
On a 6x3 grid, with complete wordlists, SAT with a solution, in ~1.5 hours;
On a 12x6 grid, with wordlists each reduced to 200 words, UNSAT, in ~5 minutes;
On a 12x6 grid, with wordlists each reduced to 500 words, UNSAT, in ~6 hours;
On a 12x6 grid, with complete wordlists, Unknown, timed out after ~100 hours;
~~On a 17x12 grid, with complete wordlists …~~

Generation is working fairly quickly on small grids, using a reduced number of words to pick from. However, although the production of the formula increases linearly (in the size of the grid, and number of words per wordlists), the time it takes for the Solver to solve a given query grows at least quadratically (if not exponentially).

In the end, I did not get the patience to run the experiment for the targeted 17x12 grid.

Improvements

During the development of this idea, the evolution of the code to support it, and the writing of this series, some points of interest regarding future improvements arose.

First, we point out that the support for String theory is very recent in Z3. Maybe our results could be improved by using a Solver with a more efficient support it. With the same objective, it could be interesting to have a look at Z3’s internals, and get a better understanding of the practical limitations of using it in our context.

Second, I started wondering if one could come up with efficient strategies to reduce the size of the queries without losing too much accuracy, for example:

Take guesses for words coming from lists containing a lot of words;
Clean the wordlists from words that are containing letters with low occurrences in the dictionary. For example, isolated words (farther from the others), defined using the Levenshtein distance;
“Divide and conquer”: isolate portions of the grid that could be solved separately, adapting the wordlists accordingly. For example solving the bottom right corner using wordlists of truncated words, where only their “end” portion is left. This approach would reduce the size of the wordlist too, as verbs with the same “ending” in their conjugation would be mapped to a single entry in the truncated wordlist.

Last words

All in all, using Solver to generate crossword grids is not the most efficient way to do it, making its use impractical: I will never be able to start my crossword editor startup…

However, this idea allowed us to explore several concepts around the use of SMT Solvers, to help us find solution to algorithmic problems.

In particular, we discussed in details how we can model (encode) crossword grids, to get a program give us, although very slowly, valid solutions!

Hope you found this journey very cool; At least it was from my side.

Clearly, this setup is rubbish from the computing power perspective, considering the task at hand. But that’s my laptop, and I love it. ↩

Use SMT Solvers to generate crossword grids (2)

2019-11-12T00:00:00+00:00

This post is part of a series on using SMT Solvers to generate crossword grids.

Introduction to SMT, and programming with SMT Solvers;
Definitions and first formulas (currently reading);
Plumbing everything together, complete formula, and results.

Thanks @geistindersh for his feedback, and corrections!

In a previous blog post we presented SMT Solvers, and mentioned that we can use them to solve problems; More explicitly, for our problem at hand, we plan to:

Construct a formula to represent (or encode) “generic” crossword grids;
Ask the Solver to give us a solution (a set of values);
Interpret this solution (or decode) to obtain a valid, completed, crossword grid.

\[\text{grid} \xrightarrow[]{\text{encode}} \text{formula} \xrightarrow[]{\text{solve}} \text{values} \xrightarrow[]{\text{decode}} \text{completed grid} \\\]

In this post, we will first define some vocabulary and definitions we use while constructing our solution. Then we are going to state more clearly what we are aiming to achieve. At the end, we will present some formulas, the building blocks of how we can represent any crossword grid.

Crosswords

Definitions

Let’s clarify some concept that we are going to use to describe crossword grids:

A grid is composed of: definition slots, and (empty) cells;
Definitions do not take more room than a cell, and there can be up to two definitions per slot;
Following the definitions, words can be written vertically (from top to bottom), and horizontally (from left to right).

Here is a basic example of what an empty crossword grid could look like:

The aim of a game of crosswords is to fill a grid with words taken from the dictionary, (preferably) respecting the definitions, with each cell containing one (and only one) letter.

Here is a valid solution of the previous example, using French words (by chauvinism):

So, back to the problem: What are we trying to do?

Let’s not care about definitions. If we can generate a grid of intersecting words respecting the rules:

One and only one letter per cell;
Words are coming from a list of valid words;
Words are written left to right or top to bottom;

Then adding definitions is relatively straightforward, using a mapping of words to definitions (a.k.a., a dictionary).

From grid to formula back to grid

Now that we specified what we talk about, let’s write some formulas; Gradually, from simple to complex.

As we mentioned earlier, we will use Z3, because it supports a String theory ¹. Furthermore, we will make use of the Python bindings for Z3, to make the code friendly to beginners’ eyes.

A single valid word

Let’s start with the simplest case we can think of: a grid composed of a single word. It would look like this:

Essentially, we declare a variable named \(horizontal\) and constrain it to take any value from a finite wordlist (our dictionary).

# Using a Python REPL
python> horizontal = String('horizontal')

python> formula = Or(
  horizontal == StringVal("abat"),
  horizontal == StringVal("abbe"),
  horizontal == StringVal("abri"),
  # [...]
)

In the above example, we see that \(horizontal\) can take the value abat, or the value abbe, or the value abri, etc. As you might guess, we truncated the display: dictionaries are get pretty big, so the formula is effectively thousands of lines long.

Note that if we consider \(horizontal\) to be potentially any word from the dictionary, we would end with a massive formula, involving ten of thousands of disjunctions. The French wordlist we will use counts 200 224 entries! And that would only get worse for a complete grid, counting between 50 and 100 words (ballpark estimate)… Intuitively, we want to keep the “size of the queries” we send to the Solver relatively small for them to be able to handle the task.

There is something that we already know about \(horizontal\) that we do not need the help of a Solver for: its size! Indeed, we know from the grid that \(horizontal\) should be of length 4, hence, we don’t need to pass the Solver anything that involves words of size different than that.

In practice, instead of having a single wordlist, we use many: each one of them referencing only words of the same size. At most, that means we have 36 wordlists for the French language ².

And so, asking a Solver:

python> solver = Solver()
python> solver.add(formula)
python> solver.check()
sat
python> solver.model()
[horizontal = "abbe"]

Which we can map back into the grid frame we had:

Yay! We generated a first single word grid!

Two valid words

Let’s up the game, because crossword grids would either get boring or very frustrating with a single word to guess. Consider the following:

In the same spirit than earlier: we now use \(horizontal\) and \(vertical\), two variables that can take any values from respectively two different wordlists, the first being the list of French words of length 4, the latter the list of French words of length 2 (again, we omit possible values for brevity):

python> horizontal = String('horizontal')
python> vertical = String('vertical')

python> formula = And(
  Or(
    horizontal == StringVal("abat"),
    horizontal == StringVal("abbe"),
    horizontal == StringVal("abri"),
    # [...]
  ),
  Or(
    vertical == StringVal("ah"),
    vertical == StringVal("an"),
    vertical == StringVal("ru"),
    # [...]
  )
)

python> solver = Solver()
python> solver.add(formula)
python> solver.check()
sat
python> solver.model()
[vertical = "ru", horizontal = "abbe"]

Again, the Solver was able to find a solution, that we interpret as the following filled grid:

Two valid intersecting words

Disconnected words do not represent very well the content of real world crossword grids. Next step is to start assembling:

From the formula perspective, we start exactly as earlier, and then we add a constraint saying that the character at index \(2\) of \(horizontal\) must equal the character at index \(0\) of \(vertical\).

python> horizontal = String('horizontal')
python> vertical = String('vertical')

python> formula = And(
  Or(
    horizontal == StringVal("abat"),
    horizontal == StringVal("abbe"),
    horizontal == StringVal("abri"),
    # [...]
  ),
  Or(
    vertical == StringVal("ah"),
    vertical == StringVal("an"),
    vertical == StringVal("ru"),
    # [...]
  ),
  horizontal[2] == vertical[0]
)

python> solver = Solver()
python> solver.add(formula)
python> solver.check()
sat
python> solver.model()
[vertical = "ru", horizontal = "abri"]

And the given solution is translated to the following grid:

Note that the result is (as in the previous examples) one of many valid combinations: there are other four letter words where the character at index \(2\) is the same as the character at index \(0\) of a well chosen two letter word.

Boom, we now have everything we need to represent complete crossword grids!

In the next (and last) blog post, we will present the complete formula we build, and the results we obtain trying to generate grids this way.

Under this theory, variables take their values among all the possible sequences of characters, and operations can be substring comparison, concatenation, etc. ↩
According to the relevant Wikipedia page, the longest French word is “hippopotomonstrosesquippedaliophobie”, counting 36 letters! ↩

Use SMT Solvers to generate crossword grids (1)

2019-11-11T00:00:00+00:00

This post is part of a series on using SMT Solvers to generate crossword grids.

Introduction to SMT, and programming with SMT Solvers (currently reading);
Definitions and first formulas;
Plumbing everything together, complete formula, and results.

Thanks @geistindersh for his feedback, and corrections!

SMT solvers are tools that are used in several fields. By modeling complex problems into logical formulas, and then leveraging the power of a Solver hoping to find values satisfying these formulas, it is possible to obtain solutions for the targeted problems.

When I first encountered this approach in a class on program analysis ¹, the whole concept of encoding problems with mathematics was not very straightforward, and required a bit of mental gymnastics.

However, after some practice, I became more accustomed to this idea, and recently had the opportunity to exercise and study these approaches more in depth. Now that it feels familiar, I believe it’s the perfect time to write what I wish I could have read earlier.

Hence, in the following blog posts, we will explore the use of SMT Solvers in a recreational way, making use of them to solve an absolutely unimportant problem: generation of crossword grids!

Introduction

SMT: Satisfiability Modulo Theory

What is SMT by the way?

SMT problem is a decision problem for logical formulas with respect to combinations of background theories ²

Huh? Let’s break that down:

A decision problem is a question that can be answered by Yes, No, or Don't Know ³;
Logical formulas are “mathematical” formulas using variables and operations; Such formulas can be evaluated to True, or False (depending on the values that the variables take, and the considered operations);
Background theories can be thought as the “universe in which our formula lives”. Some examples:
- Booleans: variables can take the values True or False, and operations being \(\land, \lor, \lnot\) (respectively logical and, or, not) …
- Integers: variables can take integers values \(\{ -\infty, ..., -2, -1, 0, 1, 2, 3, ... \infty \}\), and operations being arithmetic operations: \(+, -, *, /, \%, ...\)
- Strings: variables are sequences of characters, and operations can be substring comparison, concatenation, …

In our context, we will consider the latest, because crosswords involve finding words respecting certain constraints.

But before explaining how we will use this particular theory, we need to explicit one can generally use constraints Solvers to help solving problems.

Solvers

SMT Solvers (also known as constraints Solvers, or theorem Provers) are computer programs, that take formulas expressed under a specific theory as input, and answers one of the following:

SATisfiable, if there exists a set of values for (a valuation of) the variables that make the given formula True;
UNSATisfiable, if there are no values for which the formula is True; In other terms, the formula will always evaluate to False no matter how hard we try;
Don't Know, if the Solver did not manage to give one of the previous result under a specific time bound.

We will assimilate SMT Solvers as magical ⁴ black boxes, their inner working remaining mysterious. We can feed formulas to them, and expect one of the above response.

\[formula \xrightarrow[]{\text{SMT Solver}} \begin{cases} \text{SAT} \\ \text{UNSAT} \\ \text{Don't Know} \end{cases}\]

And if the formula is SAT, the Solver will return a proof alongside its answer: a set of values for the variables appearing in the formula. To verify the satisfiability using the proof, we evaluate the formula with the given values, and ensures that the computed result is True.

Z3, the theorem Prover

Z3 is an open source SMT Solver developed by Microsoft Research, and available on GitHub.

Well-know, well-documented, and relatively famous, it even comes with bindings for many popular programming languages, and notably Python (in other words, Python code can interact with Z3).

It is for this practical reasons that we will generate our crossword grids using Z3, but note that it could be done with any other Solver supporting a theory for Strings.

Constraint programming

Now that we know what SMT Solvers are, we can make use of them to create programs to help us find solutions to our problems.

However, that is where the conceptual difficulty arise: As educated humans, we have been taught and trained to solve problems that we are given.

Yet, here, we want to avoid doing so: Instead, we want to let a Solver do the hard work of “understanding” the problem and finding a solution to it. In other words: We don’t try to construct an algorithm to solve the problem, but rather, we express the constraints of the problem in Mathematical terms, with logical expressions using variables taking values in an adequate theory.

Example

Let’s consider the following puzzle:

Find \(x\) a String such that: \(x\)’s first letter is an ‘H’ and its last letter is a ‘S’.

Natural “bad” approach, thinking out loud:

Educated you: I need to construct \(x\) a string.
Educated you: First letter is an ‘H’, so \(x\) should look like ‘H…’ .
Educated you: Then, last letter is an ‘S’. Educated you: So \(x\) is anything of the form ‘H…S’ .
Educated you: … *thinking really hard* …
Educated you: Hey! ‘HS’ works!

“Good” approach, from the constraint programming point of view, thinking out loud again:

You: I have the following constraints: \(x\) is a String, \(x\) starts with an ‘H’, and \(x\) ends with an ‘S’.
You: Hey Solver! Can you give me a value for the following formula: \(x \text{ is a String } \land x \text{ starts with an 'H' } \land x \text{ ends with an 'S'}\)?
Solver: Let me see…
Solver: … *thinking* …
Solver: It’s SAT, and \(x = \text{'HYPOTHALAMUS'}\) works!
You: Thank you Solver.
Solver: No worries.

Solvers are very powerful: We can use them to avoid dealing with tedious logic and building complicated algorithms.

In the next blog post, we are going to explicit how we model the crossword grids, to get a Solver to help generate them.

Software Verification, of the CSI Masters . ↩
Source: https://en.wikipedia.org/wiki/Satisfiability_Modulo_Theories . ↩
One of the biggest upset in Computer Science is that not all problems can be “solved” by algorithms: https://en.wikipedia.org/wiki/Undecidable_problem , but we don’t know which ones for sure. ↩
Because their implementation is way out of the scope of this post, it can be easier to imagine them as transcendental entities, or at least being beyond our comprehension. ↩

NixOS on a Dell XPS15 9560

2019-02-28T00:00:00+00:00

Foreword

From a several months now, I have been using NixOS, as much for personal stuff than for work.

In particular, its declarative and reproducible system configuration allows me to have a GitHub repository, Pamplemousse/laptop, “representing” what the software setup of my machine is.

The idea with this is being able to automate the installation of my computer, easing the pain of installing and setting up a new machine.

I am not at all an expert in system administration, and this “project” is far from being mature, but this had done the job so far.

Until…

One day, I received a Dell XPS15 9560 which I needed to setup, thus I wanted to install NixOS on it.

It was painful ; but I realized I was not the only one running into trouble installing Linux on this machine: see this comparative result on the installation of several distributions.

Among the bit that caused me much trouble:

Move from MBR/BIOS to GPT/UEFI ¹;
LUKS encryption;
Machine freezing, very likely due to the nouveau, and bbswitch modules for Nvidia graphic card.

Miraculously, I discovered two blog posts that literally saved my day (or shall I say my week):

NixOS on a Dell 9560 (by Graham Christensen)
Installing NixOS on a XPS 9560 (by Julien Tanguy)

Thus, this post stands on their shoulders and comes essentially as a wrap up of the work they have provided.

Full disk encryption

I don’t think we can make these devices harder to lose; that’s a human problem and not a technological one. But we can make the loss just cost money, not privacy. ²

Solution: use Luks to encrypt partitions ; Here is what the partitioning looks like.

/dev/sda1: BIOS
/dev/sda2: EFI
/dev/sda3, /dev/mapper/cryptkey: LUKS key
/dev/sda4, /dev/mapper/cryptswap: swap partition
/dev/sda5, /dev/mapper/cryptroot: root filesystem

Why this partitioning?

We want the swap partition to be encrypted not to leak the RAM content on hibernation.

For user-friendliness, we create a partition /dev/mapper/cryptkey that will be used as a keyfile to unlock both the swap (/dev/mapper/cryptswap) and the root (/dev/mapper/cryptroot) partitions. This keyfile will then be encrypted using a user passphrase.

Hence, a passphrase will be asked only once, to decrypt the keyfile, which will then be used to decrypt the swap and root partitions.

And if `/dev/mapper/cryptkey` gets corrupted?

As is, that would mean that the swap and root partitions would be lost. For the latter one, that is very bad: all data on the root partition (system and user data) would then be inaccessible.

One solution is to create a random passphrase (not meant to be remembered by yourself, that you store securely store elsewhere), and then allow it to decrypt the root partition.

Here is how we proceeded:

Note that some space is left at the beginning of the disk for the GPT to take place. ³

# partitioning
DISK=/dev/sda
sgdisk -og "$DISK"
sgdisk -n 1:2048:4095 -c 1:"BIOS boot partition" -t 1:ef02 "$DISK"
sgdisk -n 2:0:+550MiB -c 2:"EFI system partition" -t 2:ef00 "$DISK"
sgdisk -n 3:0:+3MiB -c 3:"cryptsetup luks key" -t 3:8300 "$DISK"
sgdisk -n 4:0:+"${RAM}"GiB -c 4:"swap space (hibernation)" -t 4:8300 "$DISK"
sgdisk -n 5:0:"$(sgdisk -E "$DISK")" -c 5:"root filesystem" -t 5:8300 "$DISK"

# encrypting
cryptsetup luksFormat "${DISK}3"
cryptsetup luksOpen "${DISK}3" cryptkey

dd if=/dev/random of=/dev/mapper/cryptkey bs=1024 count=14000

cryptsetup luksFormat --key-file=/dev/mapper/cryptkey "${DISK}4"

cryptsetup luksFormat "${DISK}5"
cryptsetup luksAddKey "${DISK}5" /dev/mapper/cryptkey

# labeling, mounting and generating the base config
cryptsetup luksOpen --key-file=/dev/mapper/cryptkey "${DISK}4" cryptswap
mkswap -L swap /dev/mapper/cryptswap
swapon /dev/disk/by-label/swap

cryptsetup luksOpen --key-file=/dev/mapper/cryptkey "${DISK}5" cryptroot
mkfs.ext4 -L nixos /dev/mapper/cryptroot
mount /dev/disk/by-label/nixos /mnt

mkfs.vfat -n efi "${DISK}2"
mkdir /mnt/boot
mount /dev/disk/by-label/efi /mnt/boot

nixos-generate-config --root /mnt

And what the NixOS configuration looks like:

We created an extra file called /etc/nixos/luks-devices-configuration.nix, containing the following:

{
  boot.initrd.luks.devices = {
    cryptkey = {
      device = "/dev/sda3";
    };

    cryptroot = {
      device = "/dev/sda5";
      keyFile = "/dev/mapper/cryptkey";
    };

    cryptswap = {
      device = "/dev/sda4";
      keyFile = "/dev/mapper/cryptkey";
    };
  };
}

And then, included it in the general /etc/nixos/configuration.nix file:

{ config, pkgs, ... }:

{
  imports =
    [ # Include the results of the hardware scan.
      ./hardware-configuration.nix
      ./luks-devices-configuration.nix
    ];

  # ...
}

Machine freezing

As mentioned earlier in the post, I experienced many freezes of the laptop, and adopted the solution proposed in jtanguy.cleverapps.io/installing-nixos-on-a-xps-9560 by adding the following to my /etc/nixos/configuration.nix:

boot.blacklistedKernelModules = [ "nouveau" "bbswitch" ];
boot.extraModulePackages = [ pkgs.linuxPackages.nvidia_x11 ];

hardware.bumblebee.enable = true;
hardware.bumblebee.pmMethod = "none";

Conclusion

My GitHub repository Pamplemousse/laptop should contain the most up-to-date state of my configuration.

However, it does not concern specifically the Dell XPS15 9560, and not all that I have presented here is merged into the master branch (in particular the Kernel modules blacklisting or the bumblebee configuration). Despite, missing pieces can be found in the test branch of the same repository.

Warning / Need to be improved

It is worth nothing as I do not run these script as is.

So far, my work on this repository is actually more about having handful “templates” and/or bits of configuration to speed-up my laptop’s installation rather than having “production ready” autonomous scripts that have been thoroughly tested.

Some areas of improvements that are worth mentioning:

Install script, pay attention to what is generated in /etc/nixos/hardware-configuration.nix and what might overwrite stuff from the /etc/nixos/luks-devices-configuration.nix during the configuration generation;
/boot being located on /dev/sda2 is not encrypted.

Aside from that, I am happy now that my laptop is functional! (Pun intended.)

MBR, BIOS, GPT, UEFI definitions: https://wiki.manjaro.org/index.php?title=Some_basics_of_MBR_v/s_GPT_and_BIOS_v/s_UEFI . ↩
Source: https://www.wired.com/2006/01/big-risks-come-in-small-packages/ . ↩
GUID Partition Table: https://en.wikipedia.org/wiki/GUID_Partition_Table . ↩

Scanning “modern” web applications with OWASP ZAP

2018-10-01T00:00:00+00:00

During the summer of 2018, I was an intern in the FoxSec team at Mozilla, where I contributed to ZAP (for Zed Attack proxy), an open-source web application security scanner.

The subject of my internship was Scanning modern web applications with OWASP ZAP, and the report I wrote about it is available online at xaviermaso.com/internship_report_2018.pdf .

I do not intend to delve too much into the details of what have been implemented in this post, especially because the report should contain all the informations needed for whom is interested by the subject.

However, this is a good opportunity for me to talk a bit more loosely about what is inside. In a sense, this post is more of an “Abstract” (if you are from the academia) or a “TL;DR” (if you are from Reddit) to present the key ideas and motivations for those who still hesitate to read twenty pages of my lame prose.

Thus, let’s follow the plan of the report:

ZAP and “modern” web applications
The FrontEndScanner
The front-end-tracker

ZAP and “modern” web applications

Some ZAP concepts

ZAP is a web proxy: sitting between a web browser and the server serving the application that one wishes to test, it can monitor web traffic between the two entities, interrupt it, modify it and even record and replay it.

Because of that, it is a great tool to perform security testing of an application; either by passively looking for vulnerabilities in the requests and responses (such as missing headers, plaintext secrets, or else), or by actively crafting requests or tampering with the content aiming to trigger interesting behavior on the server, or in the browser (in the case of XSS vulnerabilities for example).

One of the most interesting feature of ZAP is for the user to be able to write their own scripts that will run under specific circumstances, to perform custom tasks. For examples, here are a couple of scripts that have been written by the community and published under github.com/zaproxy/community-scripts .

“Modern” web applications

Arguably, we called “modern” web applications the ones relying heavily on JavaScript.

In nowadays web, almost every page contains JavaScript to be executed by the client (aka, the web browser). This is even truer, especially with the rise of JavaScript framework such as React, AngularJS, Vue.js, Ember.js, and so on, encouraging developers to embed a whole applications into single pages, the so-called “SPA”.

The problem is that it somewhat breaks the approach taken by ZAP: by only scanning HTTP responses, our favorite proxy statically analyze the transferred content, without taking into consideration the transformations that might happen when the browser will interpret the embedded JavaScript.

For example, let’s say that to detect XSS or SQLi, you have a script to look for fields in a webpage. If the is present in the HTML of the page, ZAP will be able to find it. However, if there is a piece of JavaScript that modifies the DOM to add such an element, such as:

window.onload(function () {
  var inputElement = document.createElement('input');
  inputElement.type = 'text';
  document.body.appendChild(inputElement);
});

ZAP would be unable to understand the implications and detect the fact that a potential source of vulnerability will be added to the page “at run time” i.e when the browser will interpret the JavaScript.

Through this basic example, one can see the limits of the static analysis of an HTTP response: it is difficult to be confident in the fact that an application is vulnerability free just by looking at its source code, as complex chain of events in the browser (user or network interactions for example) can lead to the modification of the scanned content, making the process irrelevant.

To answer this problematic, we broke our solution down into two “components”:

the FrontEndScanner add-on,
the front-end-tracker JavaScript library

The FrontEndScanner

Add-ons add additional functionality to ZAP. They have full access to all of the ZAP internals, and so can provide very powerful new features. ¹

We wrote the FrontEndScanner add-on to provide a way for ZAP users to look for front-end vulnerabilities by executing scripts where they can make sense out of the dynamic nature of JavaScript: in the web browser, alongside the application that is being tested.

When turned on, our add-on will tamper with all HTTP responses coming back from the server to inject a piece of JavaScript code into the tested application, directly into the of the HTML document. By doing so, we ensure that our code will be run before anything else (especially before front-end frameworks and libraries) when loaded by the web browser. This is really important as we want to keep track of modifications to the DOM and to the WebAPI that those external scripts might be doing.

The piece of JavaScript code is made of the following:

the FrontEndScanner object, itself containing:
- ZAP constants to help scripts create alerts,
- the “mailbox”: a “publish-subscribe” mechanism to help ZAP users’ scripts react to events happening in the browser (such as user interactions, Storage accesses, etc.),
- a helper function to report findings back to ZAP
a list of user defined scripts, for which each of them will be encapsulated in a function

When in the browser, these functions are executed, taking the FrontEndScanner object as parameter. Thus, user scripts can make use of the content defined above to perform meaningful security checks and raise alerts in ZAP when finding vulnerabilities.

The front-end-tracker

As the WebAPI is mostly intended for application developers rather than for security testers, it does not expose all the features that we would hope to have for debugging and testing a web page.

To answer this lack of features considering our use case, we wrote the front-end-tracker, a JavaScript library meant to provide an extension of the API available in the browser that would be more pertinent for one wishing to write security checks.

How does it work?

When loaded into a web page, the front-end-tracker wraps behaviors that we are interested in tracking into our own functions that will perform some kind of reporting before running the expected code.

Here is a simplified example of what it could look like:

const oldGetItem = Storage.prototype.getItem;

Storage.prototype.getItem = function (...args) {
  mailbox.publish(
    'storage',
    {action: 'get', args: args}
  );
  return oldGetItem(args);
}

We call such a mechanism a “hook”, as it hooks a custom function to a standard behavior. So far, the following hooks have been implemented:

DOM events: catch when a user interacts with a webpage (by clicking, scrolling, hovering, or else), when resources are loaded, etc. ²,
Storage: catch when values in the storages in the browser are read, written or removed

If the front-end-tracker ever runs after one of this behavior get triggered in the page, this one would not be reported. Hence, if we want to monitor everything that we are interested in in a web page, the front-end-tracker needs to be the very first thing to be interpreted here.

Another key concept of the front-end-tracker is the mailbox: a topic based publish-subscribe object, on which the functions from our hooks publish to, and for which scripts in the page can subscribe to.

// example of subscription to log messages related to 'dom-events'
const topic = 'dom-events';
mailbox.subscribe(topic, (_, data) => {
  console.log(data);
});

Written to be a standalone component, the front-end-tracker can be used to help debugging any application. That is why it has been released on npm under @zaproxy/front-end-tracker.

Conclusion

After these twelve weeks of internship, we ended up having an interesting proof-of-concept of our approach and tools for scanning modern web applications.

Not only we implemented the basics FrontEndScanner add-on and the front-end-tracker it relies on, but we wrote the very first client-side passive script to detect when JWT tokens are written in an application ³.

Unfortunately, all the work presented here has not yet been released: indeed, the FrontEndScanner still lacks features and documentation to be made available on ZAP’s marketplace: see issue #4939 for more details.

On the other hand, the front-end-tracker is already published on npm, but could become even more useful with a couple more hooks added to it, such as ones for DOM mutations, XMLHttpRequest, or postMessage.

Unfortunately, as I am back to university, I do not have much time to invest on this, and as the ZAP core team members have already an awful lot of things to deal with, it does not seem that these features will be brought to ZAP users in a near future.

If you are interested to help and contribute, you can take a look at the related issues opened on GitHub, or come talk to the (very welcoming) team members on irc.freenode.net, in channel #zaproxy.

I am always happy to receive constructive feedback, so do not hesitate to ping me, on twitter @pamplemouss_ or elsewhere.

Source: https://github.com/zaproxy/zap-core-help/wiki/HelpStartConceptsAddons . ↩
The complete list of events to track: https://github.com/zaproxy/front-end-tracker/blob/master/src/events.js . ↩
This “scan-jwt-tokens” script is installed with the FrontEndScanner add-on, and thus available as an example for ZAP users. Here is what it looks like: https://github.com/zaproxy/zap-extensions/blob/master/addOns/frontendscanner/src/main/zapHomeFiles/scripts/scripts/client-side-passive/scan-jwt-tokens.js ↩

Patch option in Git

2018-06-29T00:00:00+00:00

Patch

I have (almost) always been using git add -p or git add --patch.

This option allows you to interactively select which pieces of your changes to be added to the index. (Before writing this article, I was even convinced that -p stood for “partial”…)

This is very convenient to 1) make sure that you will not commit unwanted code, 2) partially save your changes without losing your work in progress.

I recently learned that this -p/--patch option is available for the checkout and the reset commands as well!

With these, we can respectively get rid of only part of our changes and remove pieces of code from the index.

Examples

Here is the example setup: I created a git repository in which I have committed a single file.

$ ls
example.txt

$ git status
On branch master
nothing to commit, working tree clean

$ cat example.txt
This is an exmple fil.
Containing multiple lines.

Very interest.

git add -p

Let’s say we edited our example.txt to add some content that we would like to commit.

$ cat example.txt
File

This is an example file.
Containing multiple lines.

Very interesting.

We did multiple things here: added a “title” an corrected several words. To keep things clean, we would like to make one commit for each one of these changes.

Here is how I would use -p:

use “split” and “edit” to keep only the changes related to correct the words
commit those changes
verify that only the title is added
commit this change

git checkout -p

Similarly, we can use git checkout -p to discard part of the changes that we have performed on a file. Let’s say we have edited our example.txt to add a line in the middle and modify the last one.

$ cat example.txt
File

This is an example file.
Bwaaaaaaaaaah!
Containing multiple lines.

Some very interesting changes.

Then, we can use git checkout -p to get rid of the rubbish line that has been introduced:

use split
get rid of the first part
but not of the second

git reset -p

At last, we added our previous change to the index, as well as our edit of the third line, containing a grammar error…

$ cat example.txt
File

This is a example marvelous file.
Containing multiple lines.

Some very interesting changes.

$ git add example.txt

We changed our mind: this is not OK to commit broken English.

Let’s use git reset -p to remove the unwanted content from the index so we can commit peacefully:

split the content
reset the first part
keep the second one
commit

Et voilà!

Google Hangouts with Irssi on Nixos

2018-05-31T00:00:00+00:00

As I wanted to have access to Google Hangouts chats with Irssi on NixOS, here is a write-up of how I got it working.

The protagonists

After a quick research using my favorite search engine DuckDuckGo, it turns out that we will need to add two piece of software to Irssi.

BitlBee: an IRC gateway that act as a server your client connects to, using the IRC protocol ; and “translates” what you send and receive to another protocol (depending on whom your gateway connects to)
purple-hangouts: a library “to support the proprietary protocol that Google uses for its Hangouts service”

Add them to the system

We are pretty lucky as packages for BitlBee and purple-hangout are available on NixOS.

However, purple is not a plugin installed by default in the bitlbee package: we need to declare that we want it enabled.

Having a look at the declaration of bitlbee, we can find out the name of the relevant build option.

Let’s edit /etc/nixos/configuration.nix:

environment.systemPackages = with pkgs; [
  [...]
  bitlbee
  purple-hangouts
  [...]
];

nixpkgs.config.bitlbee.enableLibPurple = true;

services.bitlbee = {
  enable = true;
  libpurple_plugins = [ pkgs.purple-hangout ];
};

You can see how this fit my whole configuration on my GitHub repo.

Then rebuild the system:

sudo nixos-rebuild switch

Try it out

See how it goes: start irssi, then type the following commands:

<@pamplemousse> /connect localhost
<@pamplemousse> /join &bitlbee

At this point, I need to create an account to identify myself to the BitlBee server.

<@pamplemousse> register StrongPasswordGeneratedWithKeepassXC

And verify our the plugin to communicate with Google Hangouts is present:

<@pamplemousse> plugins
[...]
<@root> Enabled Protocols: aim, bonjour, gg, hangouts, icq, identica, irc, jabber, novell, oscar, simple, twitter, zephysr

All good!

Setting up BitlBee to access your Google Hangouts account

<@pamplemousse> account add hangouts MyAddress@Email.Com
<@pamplemousse> acc hangouts on

The next step is one of the most unreliable thing I have ever done to configure an account.

In fact, the previous command created another Irssi window to interact with the lib (that is, a private conversation with purple_request_0).

Follow the instruction that appeared there, and reply the oauth code that you obtain in the conversation (took me 30 minutes to figure this out).

Once that’s done, you should see all your contacts appearing in the &bitlbee window.

Try it out

We can now start 1-on-1 conversations, for example with JohnDoe:

<@pamplemousse> /msg JohnDoe hello

And even join group chats (that exists):

<@pamplemousse> help chat list
<@pamplemousse> chat list hangouts
<@pamplemousse> chat add hangouts !1 #chatname

So later on, we can use the shortcut #chatname:

<@pamplemousse> /j #chatname

That’s it! We now can chat on Google Hangouts from Irssi!

Setup a dev environment to contribute to ZAP

2018-04-15T00:00:00+00:00

Foreword

ZAP, or Zed Attack Proxy, is an OWASP project to make a free security tool to help developers and security experts test and find vulnerabilities in web applications.

I have been given the opportunity to contribute to it, and, being an open-source project, I feel like it would be a good idea to share my tribulations. I ambitiously hope it will reduce the time and effort potential future contributors would have to invest diving into it.

The subject is so wide I will not be able to cover it entirely in this single blog post. There is enough content to start a series, and I should stay motivated to write about it: stay tuned.

For now, let’s start with the basics.

How to get the code running on my machine?

First steps

Documentation about ZAP development can be found on the Zap’s repository wiki. Anything I will present here have been found roaming around the docs.

Let’s start by cloning the main repository, build ZAP and start it from the command line.

mkdir zap && cd zap
git clone git@github.com:zaproxy/zaproxy.git

ant -f zaproxy/build/build.xml dist
./zaproxy/build/zap/zap.sh

And then we have ZAP running. Smooth.

Extensions, Add-ons

Lots of ZAP’s “logic” has been extracted from the core repo into so-called add-ons, which are located into the zap-extensions repo.

Here you can find a general overview about Add-ons: github.com/zaproxy/zap-core-help/wiki/HelpStartConceptsAddons.

As we are going to work on these as well, let’s clone the repository alongside the zaproxy one.

git clone git@github.com:zaproxy/zap-extensions.git

Again, its related wiki might be of good help.

There are (as far as I know), two ways to get add-ons in ZAP: via the “marketplace” (located in “Manage Add-ons” in the “Top Level Toolbar”) or load them from a file.

In our case, as we want to edit add-ons and watch their brand new behaviour, we will use the latter.

In an upcoming post, we will talk about how to bring changes to an Add-on, but before that, let’s ensure we can build them normally.

Build and use the Add-on “as is”

As we are going to have a look at a specific issue related to Zest, this is the add-on we are going to look at.

Depending on the extension we want to work on, we gonna have to checkout the related branch. In our case, we want to work on Zest, so let’s checkout the beta branch, and build this Add-on specifically.

cd zap-extensions
git checkout beta
ant -f build/build.xml deploy-zest

Then, in ZAP, press Ctrl+l (or go to “File > Load Add-on File”), then select the brand new plugin file (usually, it has been deployed to the “zap/zaproxy/src/plugin/” folder).

At this point, you should have the Zest extension working fine in ZAP, and that’s enough for today.

pamplemousse’s blog

Solving LinkedIn’s Queens game with CodeQL

Introduction

CodeQL

Logic Programming

Queens

On paper

Implementation

Top level query

Cell representation

Rules predicate

All queens are on different rows, columns, and not adjacent

One queen per colored zone

A word on inefficiency

Plumbing

Conclusion

Handle function calls during static analysis in angr

Context

But what if such a statement is a function call?

… And what if this function is an external function? For example provided by a dynamically linked library?

So?

Usage and description

Examples

Binary to analyse

The simplest analysis

Handle local functions

Handling external functions

One step beyond: Inter-procedural analysis

Conclusion

Use SMT Solvers to generate crossword grids (3)

Formula generation

Variables

Grid

Stop waving your hands. Where is the code?

Results

Improvements

Last words

Use SMT Solvers to generate crossword grids (2)

Crosswords

Definitions

So, back to the problem: What are we trying to do?

From grid to formula back to grid

A single valid word

Two valid words

Two valid intersecting words

Use SMT Solvers to generate crossword grids (1)

Introduction

SMT: Satisfiability Modulo Theory

Solvers

Z3, the theorem Prover

Constraint programming

Example

NixOS on a Dell XPS15 9560

Foreword

Until…

Full disk encryption

Why this partitioning?

And if /dev/mapper/cryptkey gets corrupted?

Here is how we proceeded:

And what the NixOS configuration looks like:

Machine freezing

Conclusion

Warning / Need to be improved

Scanning “modern” web applications with OWASP ZAP

ZAP and “modern” web applications

Some ZAP concepts

“Modern” web applications

The FrontEndScanner

The front-end-tracker

How does it work?

Conclusion

Patch option in Git

Patch

Examples

git add -p

git checkout -p

git reset -p

Google Hangouts with Irssi on Nixos

The protagonists

Add them to the system

And if `/dev/mapper/cryptkey` gets corrupted?