Language Translation Compilation vs. interpretation Compilation diagram Step 1: compile Step 2: run...

Language Translation

Compilation vs. interpretation

Compilation diagram

Step 1: compile

Step 2: run

program Compiled programcompiler

input outputCompiled program

• compilation is translation from one language to another, where the translated form is typically easier to execute; a pure compiler produces language that will be directly executed by hardware

• compilation allows one translation and then multiple executions of the executable file (sometimes called a binary file, or load module); thus a fairly large amount of time can be spent by the compiler doing analysis and optimization once, in order to produce an executable that runs quickly each time it is run

• a compiled program typically runs fast but is harder to debug

• compiler example: gcc

Interpretation diagram

single step program

interpreter

output

• interpretation skips the intermediate step of producing a form of the program in another language and combines translation and execution

• interpretation starts from the source code each time you want to run the program; it performs the same analysis as a compiler but on a source-line-by-source-line basis;

• a pure interpreter keeps no results from this analysis even when encountering the same source line repeatedly within the body of a loop (this means an interpreted program will run faster if you make all the variable and function names only one or two characters in length and remove all the comments -- but I don't recommend doing this!)

• an interpreted program typically runs slow but is easier to debug because of better run-time error diagnostics

• interpreted languages easily support dynamic typing and dynamic scoping of variables

• interpreter examples: shells, m4 or python on the command line; also, formatted I/O (e.g., printf) relies on interpretation

hybrid approach diagram

Step 1:

Step 2:

program byte codecompiler

byte code

outputJ VM

• Java compiler and JVM interpreter - a hybrid translation model

− "javac" produces byte code, which is easy to interpret

− "java" interprets byte code

• provides for portability of byte code files across numerous systems

• Perl also has a hybrid translation model

• other hybrid translation models include just-in-time (JIT) compilers, which compile functions/procedures at run-time, on the first call

• terminology - source code that needs to be compiled is typically

− called a "program" while source code that is interpreted may be

− called a "script" (but may be called a "program" also)

Major translators in the compilation model

1. language preprocessor - textual substitution and conditional compilation (direct execution of special statements)

2. compiler - lexical analysis, parsing, code generation, optimization

3. macro processor - textual substitution and conditional assembly

4. assembler - translate symbols into addresses and machine code

Major translators in the compilation model

5. linker - external symbol resolution plus relocation, produces executable

6. loader - relocation according to load address, produces memory image

(note many compilers generate object code directly - without calling a separate assembler)

Compile steps

assemblylanguage

(.s)(.asm)

source(.c)

expandedsourcecode

object code(.o)

(.obj)

executableload module

(a.out)(.exe)

assemblysource

w/ macros (.m)

library routine

languagepreprocessor

compiler(ccom).

compiletime

assembler(as).

linker(ld).

macroexpansion and

conditionalcompilation

assemblytime

linktime.

macro processor(m4)

macro expansion and conditional assembly

staticlinking

Load and run steps

search for file name

executable(load module)

(a.out)(.exe)

library files(Microsoft

shared objects(.so)

command interpreter(shell) loader

fetch/decode/execute in CPU

load-time linking(early Windows)

dynamic linking

run-time linking(most systems)

memory . . . . . (. . . machine langguage. . . . .). .image. . . . . . . (. . . instructions and data . . . ). . . .

Translators (language preprocessor, e.g, for C)

− special syntax for preprocessor statements, e.g., #include

− macro facility, #define - trivially used for constant substitution

− conditional compilation, #ifdef - used for versioning

#ifdef VERBOSE

printf( "value of a is %d\n", a );

#endif

where "#define VERBOSE" is included in the program source or where you compile with "gcc -DVERBOSE"

Translators (compiler)

− lexical analysis: extracting lexical items ("tokens") from the input

− syntactic analysis: parsing statements according to the grammar rules of the language, generates a parse tree

− semantic analysis: determining the meaning of operations according to the datatypes of the variables in the parse tree, may involve adding conversion operators to the parse tree

− intermediate code generation

− machine-independent optimizations, e.g., loop transformations

− machine-specific code generation and register allocation

− machine-dependent optimizations, e.g., branch delay slot scheduling

consider the statement a = b + 2*c; in the following code

float a,b; extern float c; ... a = b + 2*c; ...

lexical analysis extracts eight tokens and assigns symbolic identifiers to entries in the symbol table

`a' `=' `b' `+' `2' `*' `c' `;'

symtab[0] `= ' symtab[1] `+' `2' `*' symtab[2] `;'

syntactic analysis builds a parse tree

symtab[0] +

symtab[1] *

`2' symtab[2]

semantic analysis determines meaning

=:float

symtab[0]:float +:float

symtab[1]:float *:float

convert_to_float symtab[2]:float

intermediate code generation yields something like

convert_to_float( 2 , temp_float_0 )

multiply_float( temp_float_0 , symtab[2] , temp_float_1 )

add_float( symtab[1] , temp_float_1 , temp_float_2 )

store_float( temp_float_2 , symtab[0] )

machine-independent optimization goes ahead and either does the conversion at compile time or strength reduces the multiply by 2 to an add

add_float( symtab[2] , symtab[2] , temp_float_1 )

add_float( symtab[1] , temp_float_1 , temp_float_2 )

store_float( temp_float_2 , symtab[0] )

from this registers would be assigned and ARM code would be generated (including storage allocation and addressing for variables)

Translators (macro processor)

− simple abstraction through textual substitution ("open" subroutines)

− provides either keyword or positional parameter substitution

− extends instruction set by synthesizing instructions using macro definitions

− cost occurs at assembly time of expanding macro definition, not at run

− time of procedure call, register save/restore, and procedure return

− conditional assembly is same idea as #ifdef facility of C preprocessor

comparison of macro with run-time functions

macro function

invocation in-line substitution run-time call and return

parameters untyped typed

evaluated at each evaluated once at time appearance of call

trade-offs fast but one copy of more overhead per call but code at each call site only one copy of code

Translators (assembler)

• translates program written in assembly language to binary machine code

• resolving local symbolic addresses; typically this is 1-to-1 translation

Translators (assembler)

• forward references generally require 2-pass assemblers

pass 1: find symbolic labels and assign them addresses

run location counter (virtual instruction pointer)

determine instruction size

record addresses in symbol table

pass 2: use symbol table information to construct instructions

symbolic -> binary

alternative to 2-pass approach is 1-pass with fixup (i.e., backpatching)

other assembler facilities include data layout directives (pseudo-ops)

Translators (linker)

separate assembly or compilation means the assembler does not know all the addresses, thus the assembler produces only partially-resolved object files

linker combines separate object files into a single executable

− layout pieces of code & data (storage allocation based on sizes)

− resolve external references

− perform relocation of absolute addresses

two pass:

1. assign code and data to memory addresses and build symbol table from public symbols

2. use table to resolve external addresses and produce load module

• object module file format (this is early UNIX; ELF is more complex)

- header (includes sizes of text, data, and bss sections)

- text section (read only)

- data section (read/write)

- relocation/external symbol entries for text section

- relocation/external symbol entries for data section

- symbol table

- string table (symbol table entries index into string table)

Translators (command interpreter)

• command interpreter (shell) - a program that reads command lines from the keyboard (or from a script file) and either directly executes the command or searches for an executable file having that command name and then loadsand branches to that loaded program

Translators (loader)

• bring a program into memory in preparation for execution

• read file header to find size of pieces

• allocate memory area(s)

• read instructions and data from file into memory

• relocation - adjusting absolute addresses relative to load point

• jump to startup code

Binding times

The assembler, linker, and loader are all programs taking input files and producing output.

Decisions and translations made by these programs are said to be done at "assembly time", at "link time", and at "run time", respectively.

Actual execution (i.e., instruction interpretation by the hardware, such as performing adds, branches, etc.) takes place at "run time".

Binding times

• During execution, you can also talk of things happening at specific times, such as register saving at procedure call time.

• Dynamic linking is an example of a late decision, or "late binding".

− It is the linking of separate procedures at either load time or run time,

− and it typically requires that the normal (static) linker include a simple table that names the needed routines (for load-time linking) or include simple "stub" routines that find and link to the shared library routines on their first calls (for run-time linking).

Binding times

• Another form of delayed binding is "just-in-time" (JIT). This is used in several Java compilers, where methods are not compiled until the first call.

− Many storage allocation decisions are made at each step. For example, offsets are assigned to labels at assembly time, under the assumption that

− any absolute addresses will be updated by the linker and loader later.

(When we later study virtual memory, we will see that it is also an example of late binding - specifically one where physical memory allocation decisions that might be made by a traditional loader are instead deferred to run time and made by the operating system.)

other programming tools

other programming tools / components of a program development environment

editors (e.g., vim, gedit, emacs)

beautifiers (e.g., indent)

project control (e.g., make)

version control (e.g., sccs)

GUI toolkit (e.g., widget library)

test coverage (e.g., gcov)

debuggers (e.g., gdb, dbx, ddd)

other programming tools

debugging tools (e.g., Purify)

reading or writing beyond the bounds of an array

reading or writing freed memory

freeing memory multiple times

reading uninitialized memory

reading or writing through null pointers

overflowing the stack by recursive function calls

reading or writing memory addresses on which a watch-point has been set

portability advisors (e.g., lint)

style checkers (e.g., CodeCheck)

exceeding a given input line length

exceeding a given nesting depth of if-else stmts.

not aligning open and close curly braces (Horstmann)

performance profilers (e.g., gprof)

Language Translation Compilation vs. interpretation Compilation diagram Step 1: compile Step 2: run...

Documents

Self Compiled Plot - Step to Understanding - Ana Rink - Ronny Reichmann

What is Culture? - University of WarwickCore Concepts _ What is Culture? _ A Compilation of Quotations Compiled by Helen Spencer-Oatey Reference for this compilation Spencer-Oatey,

Kermeta in compiled mode€¦ · Kermeta in compiled mode Compilation process • A compilation process is executed – in Eclipse – by a right-click on the main Kmt of the Kermeta

Compilation of NYSERDA sponsored marketplace & · PDF fileCompilation of NYSERDA sponsored marketplace & technology transfer ... Market Potential (overview) ... compiled from unit

USER MANUAL TABLE OF CONTENTS · Step 2: Disable Compilation • Log into Magento Admin Panel and go to System → Tools → ompilation and disable the compilation. After step 5 you

GF2016-9 - British Columbia€¦ · During data compilation ... data compilation step. These include errors such as unrealistic ... each source publication are checked against the

Full Proof Cryptography: Verifiable Compilation of Efficient ...of each compilation step except code generation. The top level compiler is independent of the bottom level verification

DNL eBook Printing - AbundantHope.orgkrishnamurti.abundanthope.org/index_htm_files/H.P... · H. P. Blavatsky on Occultism Compilation of texts by Helena Petrovna alavatsky Compiled

Compiled AASB 112 (Oct 2009)€¦ · AASB 112-compiled 3 CONTENTS CONTENTS COMPILATION DETAILS COMPARISON WITH IAS 12 ACCOUNTING STANDARD AASB 112 INCOME TAXES Paragraphs …

Compiled AASB 112 (Oct 2009) · AASB 112-compiled 6 COMPILATION DETAILS (e) Entities may elect to apply this Standard to annual reporting periods beginning on or after 1 January 2005

Step By Step Writing Compiled by Karadean Grayson from Step Up To Writing by Maureen E. Auman

Compiled AASB 117 (June 2009) · 2013. 11. 22. · AASB 117-compiled 3 CONTENTS CONTENTS COMPILATION DETAILS COMPARISON WITH IAS 17 ACCOUNTING STANDARD AASB 117 LEASES Paragraphs

Compiled AASB 2 (Jul 2009) - Australian Accounting ...€¦ · aasb 2-compiled 3 contents contents compilation details comparison with ifrs 2 accounting standard aasb 2 share-based

Unofficial Compilation - hawaii.govfiles.hawaii.gov/dlnr/dobor/rules/compiled/HAR235...Unofficial Compilation 235-6 the necessity for a hearing for any activity which does or may endanger

Impairment of Assets - aasb.gov.au · aasb 136-compiled 3 contents contents compilation details comparison with international pronouncements accounting standard aasb 136 impairment

Southampton: Oct 99Asynchronous Circuit Compilation- 1 AMULET3-H n Asynchronous macrocell ARM compatible processor core Full custom RAM Compiled ROM Balsa

A COMPILATION OF NEWSPAPER CUTTINGS CONCERNING: ST … · 2019-02-05 · a compilation of newspaper cuttings concerning: st anns church school – founded 4th october 1865 compiled

Dataset compilation and statistical analysis · Box S1 | Dataset compilation and statistical analysis Data on development compounds were compiled independently by the four companies

Compiled Auditing Standard...This compilation was prepared on 8 July 2020 taking into account amendments made by ASA 2013- 2, ASA 2015-1, ASA 2018-1 and ASA 2020-2. Compilation Number:

Montana Compilation of School Discipline Laws and Regulationssafesupportivelearning.ed.gov/sites/default/files... · regulations were compiled through exhaustive searches of legislative