Memory Model Specification and Verification for Distributed Applications

About this Page

On this page we describe our current work on Memory Model specification and verification. This work is part of a GOA project, with title Embedded Systems for Multimedia Applications Toward an Efficient Design Methodology.

Additional funding and support from the Prof. Dr. Wuytack Fund and the Department of Pure Mathematics and Computer Algebra.

Benjamin De Leeuw

prof. dr. Albert Hoogewijs

Gent University

Belgium

Motivation

Distributed programming is a model for installing software units on physically separated computers that are remotely accessed through a network. Inherent difficulties arise when programming in a distributed way. Execution replay is impossible because of high degrees of non-determinism in the shared information. Verification is unfeasible without a complex formalism on the model of execution. Two kinds of distributed systems have been proposed over the years: distributed systems with message-based communication between processes and distributed systems based on shared memory access. We focus on shared memory distributed processing. The architectural setting is one of a multithreaded or multiprocessor configuration. Every thread has exclusive access to its own local memory and processors or threads communicate through shared memory accesses. Using the classification of Flynn, we research a language and a model to describe Multiple Instruction and Multiple Data stream systems (MIMD systems).

In the shared memory models, we see two approaches. On the one hand, there are processor centric models, where the main issue consists of the cache coherency of every processor in a distributed system. On the other hand we have shared memory centric models. By this the main issue is the constraining effect on the state space of the memory locations.

A memory model for a multithreaded or distributed system is a means to reason about memory actions (e.g. reads and writes) as they appear in a program execution. The main question that is tackled is which value each read of a memory location may return at a specific point in the program. This information makes it possible to reason about which compiler or hardware transformations are allowed without changing the meaning of the program.

On this page we provide a Prolog implementation to understand and verify memory configurations in an intuitive way. Prolog turns out to be an excellent choice of language to describe the shared memory behavior of distributed systems in.

Memory Model Background Information

Memory models have a long history in research as well as in industry. Distributed processing is the dominating effort in the evolution towards future computing. Parallel computing becomes ever more present, even more so now with the availability of dual core architectures and multithreaded programming languages for the average computer user. A very informative and well maintained webpage on the subject of the Java memory model can be found here. We took this model as a starting point for the implementation we provide on this page. For more information on the matter, refer to that webpage.

Downloads

To download the WIN32 program memmod.exe v1.00 and its required resources (SWI-prolog stand alone libraries), click here.

We also provide some example memory configurations. These are the examples, used in JSR-133, an official Java service request, which remedies the flawed memory model and thread specification of the Java 2 specification. The examples are used to illustrate certain classes of problems. Look up the source document for more information here. To download all example configurations, click here. A detailed list of examples can be found here.

A user guide to the program memmod.exe v1.00 is available here.

Source

The program memmod.exe v1.00 is written in SWI-prolog, and has been compiled as a stand alone release for WIN32 systems. The necessary prolog libraries are included in our distribution, and fall under the GPL. See the SWI-prolog homepage for more details.

We plan on releasing the prolog code of the program, as soon as it has been cleared of bugs and other compromising material that might make us look stupid. We hope this will be pretty soon.

We want to make the workings of memmod.exe v1.00 available as a pre-compilation processor for distributed specifications in Java. When a Java program is compiled during developement, we want to observe this process, and log the memory behavior in a memory configuration file. This file can be consulted by our engine and be translated in a report file. The information in this report can be coupled back to the developement environment, and be made available to the distributed programmer. The information in the report file can also be used to guide different program refactorings. We will tend to provide an Eclipse plugin which supports and implements these operations.

User Guide of memmod.exe v1.00

This user guide intends to outline the basic mechanism of defining and consulting memory configurations.

Tutorial

The basic operation of our program goes as follows:

Compose a text file < filename1 > .txt which follows the grammar which is specified here
Put the text file in the directory of memmod.exe
Start the program memmod.exe
When prompted, type < filename1 >
Remarks on the value calculation are displayed in the console
When prompted, type < filename2 >
Compile the produced file < filename2 > .tex with Latex to < filename2 > .pdf
Use < filename2 > .pdf as desired

We instantiate this template with an example configuration:

Download the memory configuration file PughFig01a.txt
Put the file PughFig01a.txt in the directory of memmod.exe
Start the program memmod.exe
When prompted, type PughFig01a, as displayed in Fig. 1
There are no remarks on the calculation for this configuation
When prompted, type PughFig01aReport, as shown in Fig.2
Compile the produced file PughFig01aReport.tex with Latex to PughFig01aReport.pdf
PughFig01aReport.pdf can now be used as desired

Figure 1

Figure 2

Composing the Memory Configuation File

We devised a new language for memory configuration specification. It has a couple of advantages in comparison to other memory configuration specification languages. It also displays a few drawbacks (see Discussion).

In memory models, we would like to talk about some kind of working value for each read and write instruction. This value, is the value seen from some viewpoint. The predominant ability of calculation units in distributed systems, is that they can see different values, at some measurable point in the execution of the whole system. If they couldn't, it would not be a distributed system. Every memory instruction in our new language, has such a working value. This value can only be influenced by instructions (reads for example) appearing in the same thread and vice versa. We model this working value as a local register (lr), which denotes the current working value, for each thread, at every measurable point in the execution. Figure 3 displays this architecture, with lr depicted as the "eye" of every thread or process. Local and global memory locations are also shown.

Figure 3

Every memory instruction has its own id. These ids are grouped and ordered in threads, using the directive

program_order([i1,i2,i3,i4,...]).
program_order([i3,i21,i22,i23,...]).

in the configuration file. This order denotes the so called program order (po). For a distributed system, this is always a partial order. In the example above, i1 comes before i2,i2 before i3,i3 before i4, and also i3 before i21, i22 and i4 are not ordered to eachother, and so on.

Our action language is specified by a context free grammar, which can be found here. The action semantics are all related to corresponding constructs in contemporary object-oriented programming languages like Java. The meaning of the actions is straightforward, and is displayed here.

The fact that locks and unlocks, and if-fi constructs are properly paired and nested, is not something we are concerned with. A standard compiler checks these things, so why would we repeat them. We therefore do no additional testing on following two ordering constructs in the configuration file:

lock_pair([(i1,i3),(i2,i4),...]).
if_pair([(i6,i8),(i10,i12),...]).

The correctness of these constraints is taken for granted. Both can be derived from the compilation process, or by a manual effort.

For every lock action, one or more most recent unlock (mru) actions are specified by the directive

synchro_order([(i3,i20),...]).

This is a form of synchronization, which can be determined at execution time only, when the different threads are scheduled. The po combined with the synchronization order (so) must also be a valid partial order relation. Our program reports it, when this is not the case. For more information on the usage of the configuration file, see the examples.

Grammar Specification

We use a CFG to formally define the content of a memory configuration textfile:

< memory_instruction > := mem(< id >,< thread >,< type >,< data >).

< type > := re | wr | lo | un | cr | rc | if | fi | du

< variable > := < variable_name > | a(< variable >,< offset >)

< data > := < variable > | < value > | < guard > | null

< value > := v(< content >) | c(< object >,< default_list >)

< guard > := < data > | n(< data >)

< default_list > := LIST of (< offset >,< content >)

< id >,< thread >,< variable_name >,< object >,< offset >,< filename > := SET of ATOM

< content > := SET



< module_name > := :- module(extend, [mem/4,program_order/1,if_pair/1,lock_pair/1,synchro_order/1]).

< program_order > := program_order(< id_list >).

< if_pair > := if_pair(< id_pair_list >).

< lup > := lock_pair(< id_pair_list >).

< synchronization_order > := synchro_order(< id_pair_list >).

< id_list > := LIST of < id > 

< id_pair_list > := LIST of < id_pair >

< id_pair > := (< id >,< id >)



< module > := < module_name > 
	      < memory_instruction > * 
	      < program_order > * 
	      < if_pair > 
	      < lup > 
	      < synchronization_order >

A legal configuration file, is a textfile, which contains one < module >.

Not all combinations of (< type >,< data >) in < memory_instruction > are allowed. In this table we sum up the different possibilities and their semantics.

(< type >,< data >) Semantics

(re,< variable >) read a local or global variable in lr

(wr,< variable >) write the value of lr to a local or global memory location

(lo,< variable_name >) lock access for a variable

(un,< variable_name >) unlock access for a variable

(cr,< value >) create the given value in lr

(rc,c(< object >,< default_list >)) refactor an object but leave lr unchanged

(if,< guard >) is fulfilled if at least one value of lr matches the guard

(if,n(< guard >)) is fulfilled if at least one value of lr doesn't match the guard

(fi,null) closes an if construct

(du,null) nop, does nothing

(< type >,< data >)	Semantics
(re,< variable >)	read a local or global variable in lr
(wr,< variable >)	write the value of lr to a local or global memory location
(lo,< variable_name >)	lock access for a variable
(un,< variable_name >)	unlock access for a variable
(cr,< value >)	create the given value in lr
(rc,c(< object >,< default_list >))	refactor an object but leave lr unchanged
(if,< guard >)	is fulfilled if at least one value of lr matches the guard
(if,n(< guard >))	is fulfilled if at least one value of lr doesn't match the guard
(fi,null)	closes an if construct
(du,null)	nop, does nothing

The Report

After the program has executed, a memory report file is generated. This is a LaTeX document, which can be compiled to a .pdf file with any standard LaTeX compiler. .pdf files can be viewed with Adobe Acrobat Reader.

1. Grammar Sets

Displays the different grammar sets for the memory configuration concordant with the given grammar.

2. Configuration

Displays the configuration of the memory in a readable format. This information is present in the configuration files, and composed by the user, or gathered automatically in a pre-compile process.

Thread's start and stop synchronization is displayed by outgoing and incoming arrows respectively, next to the < memory_instruction >.

3. Properties

Displays the values seen by every memory instruction as sets next to the instruction < id > 's.

Most recent writes are displayed by sets next to incoming arrows right of the < memory_instruction >.

Examples

Following examples are from JSR-133. They all illustrate a specific problem with naive memory models. We translated these examples to our system, and produced the different reports. We named them, using the expression PughFigNNx, where NN stands for the number of the respective figures of the JSR-133 document, and x is an optional letter. They are useful to learn what kinds of problems are present in naive memory models, and show that our system can correctly handle the different memory constraints. At some points our developed language allows for a different flavour of expressivity in comparison to the language used in JSR-133. See for example PughFig04a.txt, PughFig04b.txt, PughFig19a.txt and PughFig19b.txt.

There is also a class of complex inference and security problems discussed in JSR-133, which we discarded in favour of model simplicity. See for example PughFig16a.txt, PughFig16b.txt, PughFig16c.txt, PughFig17.txt and PughFig18.txt.

Here is the complete list:

Configuration Report Problem Type

PughFig01a.txt PughFig01a.pdf statement reordering

PughFig01b.txt PughFig01b.pdf statement reordering

PughFig02a.txt PughFig02a.pdf forward substitution

PughFig02b.txt PughFig02b.pdf forward substitution

PughFig03a.txt PughFig03a.pdf happens-before order

PughFig03b.txt PughFig03b.pdf happens-before order

PughFig04a.txt PughFig04a.pdf final fields

PughFig04b.txt PughFig04b.pdf final fields

PughFig06.txt PughFig06.pdf cyclic if-if dependency

PughFig07.txt PughFig07.pdf cyclic read-write dependency

PughFig08a.txt PughFig08a.pdf read elimination

PughFig08b.txt PughFig08b.pdf read elimination

PughFig09.txt PughFig09.pdf global analysis

PughFig10.txt PughFig10.pdf happens-before order

PughFig12.txt PughFig12.pdf happens-before order

PughFig14a.txt PughFig14a.pdf if order

PughFig14b.txt PughFig14b.pdf if order

PughFig16a.txt PughFig16a.pdf if order hack

PughFig16b.txt PughFig16b.pdf if order hack

PughFig16c.txt PughFig16c.pdf if order hack

PughFig17.txt PughFig17.pdf security issues

PughFig18.txt PughFig18.pdf security issues

PughFig19a.txt PughFig19a.pdf refactoring objects

PughFig19b.txt PughFig19b.pdf refactoring objects

Configuration	Report	Problem Type
PughFig01a.txt	PughFig01a.pdf	statement reordering
PughFig01b.txt	PughFig01b.pdf	statement reordering
PughFig02a.txt	PughFig02a.pdf	forward substitution
PughFig02b.txt	PughFig02b.pdf	forward substitution
PughFig03a.txt	PughFig03a.pdf	happens-before order
PughFig03b.txt	PughFig03b.pdf	happens-before order
PughFig04a.txt	PughFig04a.pdf	final fields
PughFig04b.txt	PughFig04b.pdf	final fields
PughFig06.txt	PughFig06.pdf	cyclic if-if dependency
PughFig07.txt	PughFig07.pdf	cyclic read-write dependency
PughFig08a.txt	PughFig08a.pdf	read elimination
PughFig08b.txt	PughFig08b.pdf	read elimination
PughFig09.txt	PughFig09.pdf	global analysis
PughFig10.txt	PughFig10.pdf	happens-before order
PughFig12.txt	PughFig12.pdf	happens-before order
PughFig14a.txt	PughFig14a.pdf	if order
PughFig14b.txt	PughFig14b.pdf	if order
PughFig16a.txt	PughFig16a.pdf	if order hack
PughFig16b.txt	PughFig16b.pdf	if order hack
PughFig16c.txt	PughFig16c.pdf	if order hack
PughFig17.txt	PughFig17.pdf	security issues
PughFig18.txt	PughFig18.pdf	security issues
PughFig19a.txt	PughFig19a.pdf	refactoring objects
PughFig19b.txt	PughFig19b.pdf	refactoring objects