TDF Token Register

January 1998

1 - Introduction

1.1 - Background
1.2 - Token Register Objectives

2 - Naming scheme

3 - Target dependency tokens

3.1 - Integer variety representations
3.2 - Floating variety representations
3.3 - Non-numeric representations
3.4 - Common conversion routines

4 - Basic mapping tokens

4.1 - C mapping tokens
4.2 - Fortran mapping tokens

5 - TDF Interface tokens

5.1 - Exception handling
5.2 - TDF Diagnostic Specification
5.3 - Accessing variable parameter lists

6 - Language Programming Interfaces

6.1 - The DRA C LPI
6.2 - The DRA C++ LPI
6.3 - The Etnoteam Fortran LPI

7 - Application Programming Interfaces

7.1 - ANSI C standard functions
7.2 - Common exceptional cases

TDF is an interface used for architecture neutral and programming language neutral representation of program. It is used both within portable language specific compilation systems, and for architecture neutral distribution of compiled programs. For full details see TDF Specification, Issue 4.0 (Revision 1).

TDF tokens offer a general encapsulation and expansion mechanism which allows any implementation detail to be delayed to the most appropriate stage of program translation. This provides a means for encapsulating any target dependencies in a neutral form, with specific implementations defined through standard TDF features. This raises a natural opportunity for well understood sets of TDF tokens to be included along with TDF itself as interface between TDF tools.

This first revision includes additional tokens for accessing variable parameter lists (see section 5.3), and a C mapping token to support the optional type long long int.

1.2. Token Register Objectives

As TDF tokens may be used to represent any piece of TDF, they may be used to supplement any TDF interface between software tools. However, that raises the issue of control authority for such an interface. In many cases, the interfaces may be considered to `belong' to a particular tool. In other cases, the names and specifications of tokens need to be recorded for common use.

This token register is used to record the names and specifications of tokens which may need to be assumed by more than one software tool. It also defines a naming scheme which should be used consistently to avoid ambiguity between tokens.

Five classes of tokens are identified:

target dependency tokens, which are concerned with describing target architecture or translator detail;
basic mapping tokens, which relate general language features to architecture detail;
TDF interface tokens, which may be required to complete the specification of some TDF constructs;
language programming interfaces (LPI) which may be specific to a particular producer;
application programming interfaces (API).

These classes are discussed separately, in sections 3 to 7 below.

2. Naming scheme

A flat name space will suffice for TDF token names if producer writers adopt the simple constraints described here. TDF has separate provision for a hierarchic unique naming scheme, but that was intended for a specific purpose that has not yet been realised.

External names for program or application specific tokens should be confined to `simple names', which we define to mean that they consist only of letters, digits and underscore, the characters allowed in C identifiers. Normally there will be very few such external names, as tokens internal to a single capsule do not require to be named. All other token names will consist of some controlled prefix followed by a simple name, with the prefix identifying the control authority.

For API tokens, the prefix will consist of a sequence of simple names, each followed by a dot, where the first simple name is the name of the API as listed or referred to in section 7.

The prefix for producer specific and target dependency tokens will begin and end with characters that distinguish them from the above cases. However, common tools such as DISP, TNC and PL-TDF assume that token names contain only letters, digits, underscore, dot, and/or twiddle.

The following prefixes are currently reserved:

~: TDF interface tokens as specified in section 5 below, and also LPI tokens specific to DRA's C producer.
.~: Registered target dependency tokens as specified in section 3 below, and basic mapping tokens specified in section 4.
~cpp.: LPI tokens specific to DRA's C++ producer, other than those it shares with the C producer.
.Et~: LPI tokens specific to Etnoteam's Fortran77 producer.

3. Target dependency tokens

Target dependency tokens provide a common interface to simple constructs where the required detail for any specific architecture can be expressed within TDF, but the detail will be architecture specific. Every installer should have associated with it, a capsule containing the installer specific definitions of all the tokens specificed within this section 3.

Some of these tokens provide information about the integer and floating point variety representations supported by an installer, in a form that may be used by TDF analysis tools for architecture specific analysis, or by library generation tools when generating an architecture specific version of a library. Other target dependency tokens provide commonly required conversion routines.

It is recommended that these tokens should not be used directly within application programs. They are designed for use within LPI definitions, which can provide a more appropriate interface for applications.

3.1. Integer variety representations

Since TDF specifies integer representations to be twos-complement, the number of bits required to store an integer variety representation fully specifies that representation. The minimum or maximum signed or unsigned integer that can be represented within any variety representation can easily be determined from the number of bits.

3.1.1. .~rep_var_width

	w:	NAT
		-> NAT

If w lies within the range of VARIETY sizes supported by the associated installer, rep_var_width(w) will be the number of bits required to store values of VARIETY var_width( b,w), for any BOOL b.

If w is outside the range of VARIETY sizes supported by the associated installer, rep_var_width(w) will be 0.

3.1.2. .~rep_atomic_width

		-> NAT

.~rep_atomic_width will be the number of bits required to store values of some VARIETY v such that assign and assign_with_mode are atomic operations if the value assigned has SHAPE integer(v). The TDF specification guarantees existence of such a number.

3.2. Floating variety representations

Floating point representations are much more diverse than integers, but we may assume that each installer will support a finite set of distinct representations. For convenience in distinguishing between these representations within architecture specific TDF, the set of distinct representations supported by any specific installer are stated to be ordered into a sequence of non-decreasing memory size. An analysis tool can easily count through this sequence to determine the properties of all supported representations, starting at 1 and using .~rep_fv_width to test for the sequence end.

3.2.1. .~rep_fv

	n:	NAT
		-> FLOATING_VARIETY

.~rep_fv(n) will be the FLOATING_VARIETY whose representation is the nth of the sequence of supported floating point representations. n will lie within this range.

3.2.2. .~rep_fv_width

	n:	NAT
		-> NAT

If n lies within the sequence range of supported floating point representations, .~rep_fv_width(n) will be the number of bits required to store values of FLOATING_VARIETY .~rep_fv(n).

If n is outside the sequence range of supported floating point representations, .~rep_fv_width(n) will be 0.

3.2.3. .~rep_fv_radix

	n:	NAT
		-> NAT

.~rep_fv_radix(n) will be the radix used in the representation of values of FLOATING_VARIETY .~rep_fv(n).