PureBASIC Archives — TAJGA FASM Tutorial

This is a reprint of the TAJGA FASM Tutorial by «Vid», from the TAJGA Team. License: unrestricted. Last edited by the author in 2004-12-27.

Ported to Markdown, edited and reprinted by Tristano Ajmone @tajmone (2017-01-24). Original tutorial files downloaded from:

CHANGES — Some changes made to original text:

English text polished to improve readability.
Some paragraphs rewritten to improve clarity.
Fixed a few code typos/errors.
Added or corrected links to mentioned references.
Styled notes, tips and warnings with CSS Alert Boxes.
Applied highlighter style to key passages.

The original website of the TAJGA Team is now defunct (tajga.kallimagarden.com), but the team has now an organization profile and a website on GitHub.

Introduction

For whom?

It is meant for beginners, so almost no programing knowledge is required. Altough learning assembly as first programing language is quite hard, it is possible. You only need to know how to use the command line (command.com in DOS/Win95/98, cmd in WinNT/XP). Some programming knowledge is very helpful, but not nescessary.

What OS?

I decided to write tutorial for DOS, because it allows you to use whole your machine however you wish, unlike Windoze. I just want this tutorial to cover more assembly topics than Windows allows, so DOS is my only choice. Porting from DOS to Windows isn’t very difficult, if you already know protected mode. IczLion’s tutorial covers this, and is also translated to FASM. I personally don’t like starting learning assembly under Windows, because there are too many things abstracted from you.

Development status of tutorial

This tutorial is very far from being complete, but it already contains enough informations to be worth reading for beginners and intermediates.

Additional articles

There are also some other articles connected with assembly and FASM included in this tutorial package.

Legal stuff

This tutorial is without any warranty (you are using it at your own risk :) ). That also means you can use it however you like, and/or do whatever you like with it. If you would like to include tutorial or some part of it in a project you are working on, or you want to translate it to some other language, or something like that, then it would be nice if you wrote me an e-mail and mention me somewhere in your work.

Translating this tutorial

There were several requests for permission to translate this tutorial. Of course, I’ve granted it; but I realized that it is hard to maintain such a translation: if I edit the already-writen text, the maintainer doesn’t always know about it. So I decided to create change log, inside which all changes will be logged. The log will be provided on request.

Also, if you find that something in this tutorial isn’t explained comprehensibly enough , please tell what is it, where in tutorial is it mentioned, and what you don’t understand about it, so I can add it for future readers.

This tutorial was translated to HTML (from original plain-text version) by Decard. He also created a utility to convert FASM source to HTML text with syntax highlighting used for code blocks in this tutorial. There is also online version of this tutorial hosted at his site.

1. Getting Started

I assume you have some basic knowledge about what bytes are, and an idea on what ASCII code is. Maybe I’ll describe ASCII in a next versions of tutorial.

First, try to compile and empty source file. Just create an empty “empty.asm” file and type in the command line:

You should see that a “empty.bin” file is created, and it’s length is zero.

Now we will create binary file containing some data. Create a text file containing the follwoing line:

and compile it (I hope you already know how). When you look at the created file you should see that it’s 1 byte long and it contains character “a”.

Now let’s analyze the source: db is a “directive” (a directive is command to the compiler, remember this!) which means “define byte”. So this directive will put a byte into the destination file. Value of byte should follow this directive. For example, db 0 will insert a byte with value 0 into the destination file. But if you wanted to insert a character, you would have to remember its ASCII value. In this case, you can enter the character enclosed in straight single quotes (') and the compiler will “get” its value for you. This is how the above code works.

directive

A command to the compiler.

I think it should be clear how this works: it stores three bytes into the destination file, which should now contain a simple line with 123. By the way you can’t write:

because every directive must be on separate line. But if you want to define more bytes, you can use a simple db directive followed by multiple values, sperated by commas (,):

But what if you wanted to define something longer, for example a file containing This is my first long string in FASM? You could write:

but this isn’t very nice. For this reason, if you want define more consecutive characters using db , you can use this form:

string, quoted string

Text enclosed in quotes is called a “string”. In general, a “string” is an array of characters. The term for denoting a string inside source code is “quoted string”.

2. First Program

You may wonder why I’m fooling about creating text files when you want to learn assembly. But text files are just some “arrays” of bytes. You haven’t learnt just how to create a text file: you learnt how to define a file containing any data you want! And this is what a runnable program is — a special “data” file, an array of numeric values, called “machine code”. You only have to know the meaning of these values :). Of course, it’s very hard to remember all the values and their’s meanings, and this is what an assembler is for: It translates programs from a human acceptable language to machine code. Therefore, you only have to learn this human acceptable language :).

machine code

An array of numeric values that represents instructions to the processor (CPU).

Now we’ll look into DOS .COM (COM files) programs (occasionaly called “memory image” — you will learn why later on, when you get into it). These are the simplest executable (runnable) files under DOS and Windows.

Compile this to a .COM file and run it. Nothing should happen. Now let’s look at what those two lines of code mean. (This is going to be funny…)

Right now, I won’t explain what this directive does. Just put this line in the begginning of every .COM file! It doesn’t define any data, it doesn’t even do anything that you can notice. We’ll get back to this later on.

This is an “instruction”. An instruction is a command for the processor, which is stored in the created file as one or more bytes. When you run a .COM executable file, the processor walks through it and decodes its instructions from machine code and does what these instructions instruct it to. Instruction int 20h says that this is the end of execution of the file. So, the first instruction in this code tells the processor to stop execution, therefore the executable file does nothing, as you saw.

instruction

A single command to the processor.

BY THE WAY: int 20h is NOT the processor instruction which ends the execution of the .COM program. It instructs the processor to call a system procedure. The system procedure to be called is chosen by the number following int — in our case: number 20h (it IS a sort of number), which means calling the procedure to end a .COM file.

int could be followed by a different number, and a different system procedure would be called. But right now, we can abstract away from this, forget about it and take int 20h as the instruction to stop a program.

So, “machine code” is a set of “instructions”. There is a difference between directives and instructions. Directives are commands for the compiler — how it should define data, and what data it should define. Instructions are defined data which encodes what the processor will do when you execute the program. For example, db 0,0 is a directive which defines two zero bytes, but it is an instruction in case it is executed, because two zero bytes have special meaning for the processor (don’t bother about their meaning right now). org 256 is a directive, not an instruction, because it doesn’t define any data. You will get into this by practice.

Instruction int 20h is simple, it doesn’t need any arguments (=parameters, or values which changes its effect). But what if some instruction DOES need arguments? For this reason the processor has it’s own “variables” (variable is a general term for a space which stores some value). These variables are called “registers”. The first registers we’ll learn are al, ah, bl, bh, cl, ch, dl, and dh, which are byte-sized (they contain a value within the range 0 to 255).

register

An “internal” processor’s variable.

BY THE WAY: int 20h takes its argument in the AL register, but, again, we can abstract from this. And, in fact, value 20h is an instruction argument too, but we abstracted from this before. This is what I was talking about when I mentioned that this was “going to be funny”.

Now, how do we set the value of a register? There is a instruction which does this, for example:

this instruction sets the value of the al register to 10. mov stands for “move”. The destination of “moving” follows mov (separated with spaces) — in our case, it’s the al register. Then comes the source of “moving”, separated by a comma (,) — in our case, it’s number 10. So this instruction “moves” value 10 to register al. Another example of moving:

This copies the value in the bl register to the al register. It won’t change the value in the bl register. The source of mov always remains unchanged.

NOTE: You will often find people talking about the mov instruction. But mov is not instruction, and int is not an instruction either. mov al,bl and int 20h are instructions, for example. mov and int are called “instruction mnemonics”. But just accept it: everyone calls it “instruction”; and probably you will too, after some time — and I probably will too, sorry :).

Arguments of an instruction (the part of the instruction without instruction mnemonics, like al and 10 in mov al,10) are called “instruction operands” (or “instruction arguments”)

instruction mnemonics (this term is not so improtant right now)

instruction operand

Now let’s get to how registers are used. We will use the int 21h instruction which can do MANY things depending on the value in ah register. We won’t learn the meaning of every value, right now we will talk only about value 2. If value 2 is in the ah register when instruction int 21h is executed, then the character in dl (more precisely: the character whose ASCII code is in dl) is printed to screen (console).

NOTE: if you are using a Windows file manager (like Total Commander), you’ll see a window appear for a very short time and then disappear. Our character is being displayed in this window and you probably can’t notice it. You must run a shell (cmd on XP, command on older Windozes) and run your program from it. Anyway, if you can’t handle this, forget about assembly for a while and learn using your operating system first — and then, don’t forget to return to assembly!

this moves character “a” into dl register. (In fact, there is nothing like “character a” in assembly. You might have noticed that I wrote that registers can contain numeric values. Nothing about characters. The way this works is that the compiler translates a character enclosed in quotes into its numeric (ASCII) code, which is then recognized by int 21h as the code for this character. In assembly, character “a” means ASCII code for character “a”)

In this case, when “ah” contains value 2, this prints the character in “dl”

And we musn’t forget to stop execution. Otherwise, the program will most probably crash.

NOTE: In assembly a character enclosed in quotes is the same as the ASCII code of that character.

we don’t have to set ah to 2 again, for the second int 21h, because ah will retrain the value of 2 previously set. Also dl will retain its value, therefore the following code:

3. Labels, Addresses and Variables

Okay, let’s get to variables. In the previous chapter I wrote that variable is general term for a space which stores some value. Registers, for example, are variables. But there is a limited number of registers (VERY limited: around 8 + a few special ones), and their number is rarely sufficient. For this reason memory (RAM — random access memory) is used.

NOTE: When someone says “variable” he almost always means memory variable.

3.1. Labels

The problem is that you have to know WHERE in memory some value is being stored. A position in memory (called “address”) is given by number. But it’s quite hard to remember this number (address) for every variable.

term: address

A number which gives a position in memory.

Another problem with addresses is that when you change your program, addresses might changed too, so you would have to correct their number everywhere they are used. For this reason addresses are represented by “labels”. A label is just some word (not a string, it is not enclosed in quotes) which, in your program, represents an address in memory. When you compile your program, the compiler will replace the label with the proper address. Labels consists of alphabet characters (“a” to “z”, “A” to “Z”), numbers (“0” to “9”), underscores (“_”) and dots (“.”). But the first character of a label can’t be a number or a dot. Also, a label can’t have the same name as a directive or an instruction (instruction mnemonics). Labels are case sensitive in FASM (“a” is NOT same as “A”).

LABEL	STATUS
`name`	a label
`a`	a label
`A`	a label, different from “`a`”
`name2`	a label
`name.NAME2`	a label
`name._NAME2`	a label
`_name`	a label
`_`	a label
`.name`	not a label, because is starts with a dot
`1`	not a label, because it starts with a number
`1st_name`	not a label, for the same reason
`name1 name2`	not a label, because it contains a space
`mov`	not a label, because “`mov`” is an instruction mnemonic

NOTE: Labels starting with dot have special meaning in
FASM, which you will learn later.

term:
label

A placeholder for some address; ie: a placeholder for some number, because an address is a number.

In FASM you can use a label the same way as any other number (not really, but it doesn’t really matter for you right now).

You can define labels using the “label” directive. This directive should be followed by the label itself (the label name). For example:

This directive defines a label that will then represent the address of the data defined behind it.

DIRECTIVE	LABEL STATUS
`label name`	a label definition, it defines label “`name`”
`label _name`	a label definition, it defines label “`_name`”
`label label`	not a label definition, because a label can’t be named “`label`”

directive: label

label definition

The label directive followed by a label-name.

A shorter way to define a label is to write just the label name followed by colon (:), like this:

3.2. Defining Variables

Now let’s go back to the problem with variables: how to define a variable in memory. The program you create (a compiled program, in machine code) is loaded to memory at execution time, where the processor executes it, instruction by instruction. Look at this program:

This program will probably crash, because after the processor executes mov al,10 it reaches a string. Inside a program there is no difference between strings and instructions in machine code. Both are translated into an array of numeric values (bytes). There is no way the processor can distinguish whether a numeric value is the translation of a string or the translation of an instruction. In this example, the processor will execute instructions whose numeric representation (in machine code) is the same as the ASCII representation of the string “this is a string”. Now look at this:

This program will not crash, because before reaching the bytes defined by the string the processor reaches instruction int 20h, which ends the program’s execution. Therefore the bytes defined with a string will not be executed, they will just take up some space. This is how you can define a variable — define some data in a place where the processor won’t try to execute it (beyond int 20h, in our case).

Now, how can we access a variable? First, we must know the address of the variable. For this purpose we can use a label (described above, re-read it if you have forgotten):

So we already know the address of variable: it’s represented by the label my_first_variable. Now, how do we access it? You might think that you could do this:

but you can’t! Remember, I told that a label (my_first_variable in our case) stands for the address of the variable. So this instruction will move the address of the variable to the al register, not the variable’s contents. To access the contents of a variable (or the contents of any memory location) you must enclose its address in square brackets ([ and ]). Therefore, to access the contents of our variable, and copy it’s value to al, we use:

To set the value of variable1 (exactly: to set the value of a variable which is stored at the address represented by variable1) to 10 we could try:

but this will cause an error (try it yourself if you wish). The problem here is that you know that you are changing the variable at address variable1 to 10. But what is the size of the variable? In the previous two cases a byte-size could be determined because you used the al register which is byte sized, so the compiler decided that the variable at variable1 is byte sized too, because you can’t move between operands with different sizes. But in this case, value 10 could be of any size, so it can’t decide the memory size of the variable. To solve this we use “size operators”. We will talk about two size operators for now: byte and word. You can put the size operator before the instruction operand when accessing it, to let the compiler know what the variable size is:

in this case the compiler knows that value 10 being moved is byte sized, so it decides that the variable is byte-sized too (because we can move a byte sized value only to a byte sized variable).

But it would be hard to always remember and always write the size of a variable when you access it. For this reason you can assign the size of the variable to its label when you define it. Just write the size operator after the label’s name in its definition:

now, every time you use [variable1] it will have the same meaning as byte [variable1] (or word [variable1] in the second example). So mov [variable1],10 will work — in the first case, it will store value 10 into the byte at address variable1; in the second case, it will store it into a word.

NOTE: You can’t move values between variables of different size:

mov byte [variable1], word 10

mov [variable1],al
...
label variable1 word
dw 0

NOTE: You can’t access two memory locations in one instruction (except for some special instructions). This is wrong, and it won’t compile:

mov [variable1],[variable2]

use this instead:

mov al,[variable2]
mov [variable1],al

This will cause you some problems in the beginning, but it will force you to write faster code — which is the main reason for coding in assembly.

NOTE: The size operator assigned to a label in its definition has lower priority than a size operator within an instruction for accessing a variable; therefore:

mov byte [variable],10
label variable word
dw 0

will access a BYTE, while

mov [variable],10

will access a WORD.

I think you noticed that having two lines to define one variable is too much. There is a shorter way to define variables:

notice that size of variable is defined too. In general, if data definiton (using db or dw directive) is preceded by a label, then it will define this label too, and assign to the label the same size of the defined data. It can be used with words too:

3.3. Addresses and basics of segmentation

Now we will discuss addresses a little further. I told you that an address is number (!) which refers to a position in memory. You’ve learnt how to represent this number with labels, so that their numeric addresses are managed by the compiler. But you still don’t know anything about the format of this number. I will try to explain it a little in this chapter.

As you probably know, data in memory are stored in “bits” which can have value 0 or 1. You can consider memory as a (one dimensional) array of bits. 8 consecutive bits make one byte. An address is the number (index, position in array) of a byte. For example address “0” is the address of the first bit of memory (or address of the first byte), address “1” is the address of the eighth bit (or address of the second byte) of memory, etc. The easiest way to comprehend it is to think of memory as a (one dimensional) array of bytes

is wrong. It may work if var1 is less than 256 (so it fits into a byte sized register), but as a general rule store addresses in word-sized variables — we’ll talk about them later on.

here the address represented by variable1 is 0, whereas variable2 stands for 1, and variable3 for 2.

OK, this looks nice except that it is’nt true, at all! The problem is that usually there are multiple programs loaded in memory at the same time (operating system, mouse driver, you program, etc.). In this context the program would have to know WHERE in memory will it be loaded so it can access it’s variables. For this reason addresses are “relative”. It means that for each loaded program is reserved a region in memory called “segment”. All memory addresses accessed by this program are going to be relative to the begginning of this region. So [0] doesn’t mean the first byte of memory, but the first byte of the segment.

segment

A consecutive region of memory reserved for a program.

How does this work? The processor has a few special registers (segment registers) which hold the address of the segment (ie: the address of the first byte of the segment). Every time you access memory in your program the content of this segment register is added to the address you provided; therfore mov al,[0] accesses the first byte of your segment.

NOTE: I have told that memory addresses in .COM programs are words. That means they can be in the range 0 to 65535. Therefore the maximal size of a segment is 65536 bytes. This can be “tricked” by changing the content of segment registers, but don’t bother with this right now.

NOTE: A segment is a region in memory. But the term “segment” is often used to indicate the beginning address of this region. Sad but true.

So an absolute address in memory has two parts: segment (exactly: the address of the segment’s beggining) and, as second part, a word-sized value called “offset” which is the address relative to the segment (ie: address of segment’s beginning).

offset

An address relative to a segment, or address “inside” a segment.
(the first definition is more exact, but the second is easier to comprehend)

IMPORTANT: I stated that labels represent the address of a variable. As a matter of fact, labels in FASM represent the offset of a variable. This is why FASM is called “flat” assembler — you’ll understand this later on (much much later :)).

I won’t get deeper into segment registers — on how a segment’s beginning address is stored in them (there IS difference). Right now, take segment registers as some kind of black box that works even if we ignore how.

3.4. The ‘org’ directive explained

As your program is loaded, it often needs some external info from the program that launched it. The best example is command line arguments; or it may need know WHO launched it; etc. This value must, of course, be stored in the same segment of the program. In .COM files this data (passed to your program by the program that you launched it from) is stored in the first 256 bytes of the segment. Therefore, your program is loaded from offset 256 onward.

NOTE: The 256-bytes structure at the beginning of a .COM file is called “PSP”, which stands for “program segment prefix”

(notice: no org 256 directive). Instruction mov al, [variable1] takes up 3 bytes, int 20h takes up 2 bytes, therefore variable1 will stand for offset 5. Therefore instruction mov al,[variable1] is mov al,[5]. So this instruction accesses the 6th byte of the segment (first byte is at offset 0). But I already told you that the first 256 bytes of the segment store some informations, and that your program is loaded beyond them, from offset 256 onward. So you don’t want variable1 to be 5, you want it to be 256+5. And this is what the org directive does: It sets the “origin” of the file’s addresses. org 256 tells FASM to add 256 to the offset held by every label defined beyond this directive (before the next org directive). And this is exactly what we want in .COM files.

Therefore the previous code example won’t access the variable you want, it will access something in PSP (first 256 bytes of segment). To make it work properly use:

NOTE: org affects labels at defintion-time (for example at label variable byte or variable db 0), not when they are used (like at mov ax,[variable]). That means, that if you change addresses’ “origin” via the org directive after defining some label, that label will still hold the same value before and beyond the org directive.

I won’t tell you anything about the data contained in the PSP, you dont have to worry about it for now.

4. Endian Encodings and Word Registers

4.1. Endian encodings

We should already have a precise idea about byte variables. You already know they are 8 bit wide (not so important now) and that they can contain a numeric value ranging from 0 to 255. Regarding word variables, you know that they are 16 bits wide and they contain a value ranging from 0 to 65535.

Whether you can see it or not, a word has the same size as two bytes. Now let’s deal with how values are stored in two bytes. Both bytes can contain a value ranging from 0 to 255. From their combination we get 256*256, that is 65536. But how is this value actually stored in two bytes?

Let’s say one of the bytes (byte#1) holds value 0. The other byte (byte#2) can hold a value from 0 to 255. In this case we can store numbers ranging from 0 to 255 in our word. Now let’s suppose that byte#1 holds 1; we can store in the other byte a value 0-255, which gives us numbers 256 to 511. When byte#1 contains 2, we can store 256 other possible values in the other byte, which gives us numbers 512 to 767; and so on. In total, we have 256*256 combinations which, as I said, amounts to 65536.

It is like with decimal numbers: every digit is a value 0 to 9, and the “true” value of a digit depends on it’s position. The last digit holds value 0 to 9, the previous one holds 10*(0 to 9), the next one 100*(0 to 9), and so on.

It’s the same with words: One of the two bytes hold value 0 to 255, the other one holds value 256*(0 to 255). The byte holding 0..255 is called “low order byte”, the other one (holding 256*(0..255)) is called “high order byte”.

terms: low order byte, high order byte

On last problem remains: The order of these bytes. (ie: which comes first, low order byte or high order byte?). This is handled differently on different computers. On IBM PCs (and compatible) low order byte comes first, and high order byte second. For example, with:

then byte [variable] is the low order byte, and byte [variable + 1] is the high order byte. (The + 1 addition to variable’s offset is carried out by the compiler, the value of variable is constant, so variable + 1 is constant as well. It means the next byte beyond variable’s offset. I think this should be clear enough to need no further explaination).

NOTE: When the low order byte comes first, then it’s called “little endian encoding”; when it’s the high order byte that comes first, then it’s called “big endian encoding”. But these terms are not important, especially for a beginner ASM coder.

4.2. Word registers

Beside the byte registers (like al,ah, dl…) the processor has also some word registrs, of course. As you know, a word is combination of two bytes, and it’s the same with registers. Word registers are a combination of byte registers. The first word registers we’ll learn are ax, bx, cx and dx.

ax is the combination of al and ah, where al is the low order byte, and ah the high order byte. The same goes for the rest: bx = bh:bl, cx = ch:cl, dx = dh:dl.

terms: word register

word registers: ax, bx, cx, dx

NOTE: The letters a, b, c and d stand for “accumulator”, “base”, “counter” and “data”, it has nothing to do with alphabetical order. The real order of these registers is ax, cx, dx, bx; but it is not important until you want to generate/change machine code yourself.

but it could also be done (no reason to do it this way in real coding, this is just to demonstrate word to byte:byte relations):

because 48 is equal to “12345 / 256”, and 57 is “12345 modulo 57” (modulo is the remainder after division).

NOTE: You know that the instruction operand can be a number (numeric constant), like “0”, “256”, “12345” etc. But every assembler I know of allows you to place an expression as operand. During compilation, the value of the expression is evaluated and the expression is “replaced” by it’s result. So mov dx,(1 + 5) is same as mov dx,6. Therefore the previous code example could be better written as

mov dh,12345/ 256
mov dl,12345 mod 256

(“/” is the division operator, mod (modulo) is the operator which returns the remainder of a division. You don’t have to know these operators right now, but you should already know something about expressions).

The processor has also other word registers: sp, bp, si, di. But you can’t access directly the byte parts of these registers, you must access the whole word. This is a limitation of the processor, so there’s nothing you can do about it. For example, if you want set the high order byte of si to 17 you must do it this way:

So first you copy the value of si to ax. The high order byte of ax can be accessed dirctly (it’s the ah register), so you set it to 17. The low order byte of the word remains unchanged. Then you copy back the value from ax to si. Now the word’s high order byte has been changed to 17, while its low order byte remains unchanged.

NOTE: Register sp always has a special function; bp usually has a special function (in code generated by most (all?) non-assembly compilers). Registers si and di can be used whenever you want. This means you shouldn’t change sp and bp unless you know what you are doing.

4.3. String output using int 21h/ah=9

This should belong to Chapter 3, about addresses, but you need to know the dx register which is explained here.

Here we will talk about another usage of int 21h. You should already know that when ah contains 2 then int 21h prints the character stored in dl. But if we wanted to display a long text we would have to set dl for every char, and this would be a bad method. Wouldn’t it be better if we just stored the string we want to display somewhere in a file (like we did in Chapter 1) and then just display it from here?

For this we can use int 21h with value 9 in ah and the string’s address in the dx register. Something like:

But another problems pops up: how to determine the length of the string, ie: the number of characters to display from the given address. There are different methods to achieve this, we will talk about the simplest one, the one used by int 21h/ah=9. It relies on a special character, which is reserved as end-of-string marker. With int 21h/ah=9, it’s the “$” character. So, to store the string “Hello World”, you define “Hello World$”, where “$” means end of string. Example of displaying a string:

This method of marking the end of a string has a limitation: you can’t print the “$” character. For example:

will of course print only “It costed 50”. This can be worked around this way:

The first part (first int 21h) will print “It costed 50”, then int 21h/ah=2, will print “$” and the second int 21h/ah=9 will print “, maybe more”. We won’t deal any further with this limitation, for now — this was just to improve on the explanation.

Let’s now take a closer look at int 21h/ah=9. As you maight have realized already, it will print every character (exactly: every character whose ASCII code is in byte form) from the address contained in dx until the first “$” character after the address in dx.

NOTE: ASCII codes 0 to 31 (I think) have a special meaning for int 21h/ah=9. These codes have characters assigned to them (smiling faces, diamonds etc.) but int 21h/ah=9 doesn’t print them, but does something else. For example, the character with ASCII code 7 will produce a short beep. Try this:

org 256
mov ah,9
mov dx,text
int 21h
int 20h
label text
db 'Beep',7,'$'

It should print “Beep” and then beep.

Other common values are 10 and 13: 10 causes the cursor to return to the first column of the current row; 13 causes cursor to down move one row (if them bottom of screen is reached, then the screen is scrolled). So a combination of these two causes the cursor to move to the first column of the next row. These two should (but don’t always do) work in any order, but you should always place 13 first. These two characters are often called EOL (end of line). Try this example:

NOTE: ASCII code 13 is called “CR” (carriage return) and code 10 is called “LF” (line feed).

Another example on addresses (previous chapter), but with word registers. Check by yourself whether you understood Chapter 3:

Here we load the dx register with the contents of address_of_text variable, which holds value text, and (as we already know) text is a placeholder for the offset of ‘Hello World$’ string. Thus the word-sized variable address_of_text holds the offset of that string. Therefore, loading dx with the contents of address_of_text will load it with the offset of the string we want to print. I hope you got it.

5. Jumps and Branching

You should know a little about how instructions are processed by the processor. It fetches an instruction in machine code, executes it, and then moves to next instruction. This is repeated until instruction int 20h is reached. In this chapter we will learn something about instructions which change this behaviour.

5.1. Instruction pointer

The processor loads the first instruction (it determines the number of bytes the instrution consists of), executes it and then moves to another instruction. But how does this mechanism works? The processor has a special word register “ip” which holds the address of the instruction currently executed. After the instruction is executed, the processor adds its size to “ip” and executes the instruction located at the (new) address in “ip”. This mechanism works like this:

NOTE: As with others pointers, “ip” doesn’t hold the full address of the instruction, just the offset part. Be we shouldn’t worry about this right now.

NOTE: “ip” stands for “instruction pointer”.

5.2. Jumps

The ip register is not like the other registers (ax, ah, bp, …). It’s contents can’t be changed using the mov instruction. mov ip,5 doesn’t work. But there is a special instruction which can change the value of ip register: it’s the jmp instruction (“jmp” = “jump”). This instruction has one operand, the new address for ip register. So jmp 5 has an effect like mov ip,5 would if it was an instruction. Example:

The first instruction sets the value of ip to the address of mov ah,9 instruction (its address is held in label Start). Thus the processor won’t try to execute the bytes defined by “Text to output” string and this program will work.

NOTE: Of course, when ip is changed by the jmp instruction, then the size of this instruction is NOT added to it.

5.3. Comparing and conditonal jumps

If you can write code in any language you should already know about branching, ie: conditional execution of some parts of code. For example, suppose you want a value not greater than 10 in al. If the value in al register is > 10, you will set al to 10. This is branching — if some condition is true then something is executed, otherwise it is not executed. Assembly implementation of this mechanism is that when a condition is false you will jump over the conditional code, when the condition is true you will just continue the execution. It is as if this C code:

The first problem is how to decide whether a condition is true. In assembly, there is an instruction which can compare two operands. It is cmp. Its operands follow the same rules as mov’s operands (almost every instruction follows these or very similar rules). Some examples of comparing:

This instructions checks whether the first operand is the same, greater or lesser compared to the second one.

instruction: cmp

OK, we can compare, but how are the results of comparison stored? The CPU (the processor) has a special register called “flags register” in which it stores results of comparison (and some other things too). This register (like ip) can’t be accessed with mov or similar instructions; its value is set by the cmp instruction. Right now you don’t have to bother HOW the result of comparison is stored in this register — you would need to understand bit arithmetics for that.

OK, we can compare, and we know that the result is stored in flags. The only thing we need now is the conditional jump itself. A conditional jump is a jump which is taken only when a condition you specified is true (in the flags register). It will be best explained on example. We compare ax to bx (cmp ax,bx). A conditional jump can jump if ax < bx, or when ax = bx, or when ax >= bx etc. These jumps are (op1 is first operand of cmp, op2 is second):

Example code (don’t try to compile it, it is not a .COM executable, it’s just a snippet of code):

this piece of code will check whether value in ax is less than or equal to 10, and if not (if the value in ax is greater than 10) it will set ax to 10. The corresponding C code is:

So compare ax to bx, if it is greater or equal then it already contains the greater value, so we dont need to change anything. If ax is less than bx then we must move bx‘s’ value (=greater value) to ax.

here we compare ax to bx, then if ax is less than bx the jump won’t take place and we continue with mov cx,bx, which stores the greater value in cx, as desired, and then jmp done skips the instructions used in case ax is greater. Otherwise, if ax is greater than bx, then jmp AX_bigger takes place, so the next instruction is mov cx,ax which moves the greater value (ie: the one in ax) to cx. As you can see, the code was divided into two “branches”: one for ax>bx, the other for ax<=bx. Finally, both branches reach the instruction beyond done:, and at this point cx always holds the greater value. By the way, there could be jae instead of ja, because for the case when ax=bx both branches have the same effect.

instructions: je, ja, jb, jae, jbe

But what can we do if we want to jump when operands are NOT equal? We could do something like this:

but this is not needed because there are instructions which jump when the condition is false. These are jne, jna, jnb, jnae and jnbe. Instruction jne jumps when operands are not equal, jna when first operand is not greater than second operand, etc. Therefore:

where the ... part is executed only if the value in ax is equal to the value in bx.

NOTE: jna is the same as as jbe, jnb is the same as as jae, jb is the same as as jnae, and ja is the same as as jnbe.

instructions: jne, jna, jnb, jnae, jnbe

IMPORTANT: Many instruction change the flags register, not just cmp, so conditional jumps should come right after cmp, with no instructions between them.

6. Bit Arithmetics

This is what most tutorials usually start with. After reading this you will be confused, it’s normal. You’ll master this through practice. Return to this chapter whenever needed. So let’s go.

6.1. Encoding numbers in bits

You know that computers use “bits”, which are variables that can contain one of two possible values: 0 or 1. When a bit’s value is 0, we say that it’s “clear”, when it’s 1, we say that it’s “set”

terms:

bit — A variable containing 0 or 1.

clear bit — A bit containing 0.

set bit — A bit containing 1.

Now, how can we store a number in these bits? It’s similar to storing a word in two bytes (Chapter 4.2, re-read it). One bit contains a 0 or 1 value, therefore a number that consists in just one bit can only contain values 0 and 1. When we add another bit, we can still store 0 and 1 in the first bit, but we have another bit which now can hold 2*(0 or 1). A further bit holds 4*(0 or 1), and then 8*(0 or 1), etc.

which is value ranging from 0 (when all bits are 0) to 1+2+4+8+16+32+64+128 = 255 (when all bits are 1). Can you see it?

It is similar with a word, except you have 16 bits instead of 8; check it yourself if you wish.

Now some terms: the bit which holds 1*(0 or 1) is bit#0; the next one, which holds 2*(0 or 1) is bit#1; and so on until bit#7, which holds 128*(0 or 1). So bits are enumerated starting from 0, not from 1 — as you would maybe exepect. Bit#0 is called the “low order bit”, the highest bit (which holds the greatest value) is the “high order bit”. For example, the high order bit of a byte value is bit#7, and the high order bit of a word value is bit#15.

terms: low-order bit, high-order bit

IMPORTANT: Bits are enumerated starting from 0, not from 1, so the first bit is bit#0.

6.2. Binary constants

You have been using numeric constants before, probably without realizing you were using them. These numeric constants were just numbers you wrote in a source file which was assembled into a binary file. Examples of numeric constants ar: “0”, “50”, “-100”, “123456”.

These numbers were decimal numbers, the type which is normally used by people. The assembler then translated them to binary form. But sometimes you want to specify numbers directly in binary format. Of course you don’t have to manually translate them to decimal, you can specify them directly in binary. Here are some examples of binary numbers: 0, 101011, 1101011, 11111, etc. To distinguish them from decimal numbers, every binary number must end with the “b” character, therefore: “0b”, “101011b”, “1101011b”, “11111b” etc. Here the first binary digit (the first bit, the first 0 or 1) is the high-order bit, and the last one is the low-order bit. So if you write “1101”, then bit#0 = 1, bit#1 = 0, bit#2 =1, bit#3 = 1.

I could teach you a way to translate numbers between decimal and binary forms, but you won’t need it just now anyway, and plenty of other tutorials are full of such information.

decimal	binary
0	`0b`
1	`1b`
2	`10b`
3	`11b`
4	`100b`
5	`101b`
6	`110b`
7	`111b`
10	`1010b`
15	`1111b`
16	`10000b`

Binary numeric constants are just another way to express some number. Writing “5” is the same as writing “101b”. For example, this will work too:

org 100000000b is the same as org 256, and mov ah,1001b is the same as mov ah,9

6.3. Bit operations

Now let’s think about what we can do with a bit (which can hold a 0 or 1 value).

First, we can “set” it (set its value to 1) or “clear” it (set its value to 0). Then we can “flip” its value (from 0 to 1, from 1 to 0). And that is probably all. This operation is also called “bit complement” (0 is the complement of 1, and 1 is the complement of 0).

Now, what can we do with 2 bits? You can think of bits as boolean values, which can be either true (1) or false (0). Now, what operations can we make with boolean values? If you programmed before you’ll probably know the answer.

First of all, there is and (like “a and b” where “a” and “b” are boolean values). When we have two boolean values, the result of anding them is true only when they are both true, otherwise the result is false. (See Table below)

Then comes or. As you know, the result of oring two values is true when at least one of them is true. And finally — and less known — there is xor, which means “exclusive or” (the previous one was “inclusive or”, but everyone calls it just “or”). The result of xoring is 1 when one operand is 1 and the other is 0.

A	B	A and B	A or B	A xor B
0	0	0	0	0
0	1	0	1	1
1	0	0	1	1
1	1	1	1	0

NOTE: There are 16 possible operations on two bits, but we won’t talk about all of them.

Now the interesting part: In late times, processors designers didn’t like having lots of instructions on their processors. But as you saw, we defined 3 operations for a single bit and 3 for two bits. So they found a way to achieve operations on single bit by using operations on two bits. Remember, the operations on a single bit were: setting it to 0, setting it to 1 and flipping its value (0->1, and 1->0). How?

First let’s talk about clearing a bit (setting its value to 0). Note that the result of and is 0 whenever at least one of operands is 0. So if we and any bit (0 or 1) with 0 we always get 0, and when we and with 1 the bit will reamin unchanged. And this is what we wanted. It is similar to setting a bit (to 1). The result of oring is 1 when at least one operand is 1. So oring any bit with 1 will always produce 1, and oring with 0 will leave the bit unchanged.

How can we flip a bit? The result of xoring is 1 when one operand is 1 and the other is 0. So xoring any value with 1 will always produce that value’s complement, and xoring it with 0 will leave the bit unchanged. This last one is not so obvious, so you better look at it in the Table.

6.4. Binary operations instructions

First of all, you know the the smallest registers we have are the 8 bits (byte) registers. Also the smallest part of memory that we can access is one byte (8 bits). For this reason, the instructions used for binary operations will operate on two 8-bit numbers instead of on two bits. What will be the result? Bit#0 of the result will be the result of the operation between bit#0 of the first argument and bit#0 of the second argument. Bit#1 of the result will be the result of the operation on bits#1 of the arguments, etc. You ’ll see it.

first we load al with 00010001b, so it’s bits #0 and #4 contain 1, the remaining bits contain 0. Then we load bl with 00000001b, so it’s bit#0 contains 1, the others contain 0. When we and al with bl (this is how asm coders usually describe it) it works as al = al and bl would – ie: the result of anding al with bl is stored in al.

So what’s the final result (in al)? Bit#0 of al contained 1 and was anded with 1. “1 and 1” is 1, so bit#0 in al will be 1. Bits #1 to #2 and #5 to #7 would be “0 and 0” which is 0. Bit#3 would contain “0 and 1” which is 0 too. Bit#4 will contain “1 and 0” which is 0 again. So the result will be 00000001b.

(I used bl in the previous example only to simplify referencing the second number in the text).

… the result will be 00011001b. (see: or description, in previous section).

… the result will be 00011000b — bits xored with 0 will stay unchanged, bits xored with 1 will be flipped (to their complement).

instructions: and, or, xor

These instructions take the same arguments as mov — ie: the first argument can be a memory variable or a register, the second one can be a memory variable, a register or a constant. Both arguments must be of the same size, and only one of the arguments can be a memory variable.

6.5. Testing bits

If you have programmed before, you probably already know about boolean variables (ocassionaly called “logical”). They can hold two values: TRUE or FALSE. You can see that they can be stored in a bit wuite finely — 1 for TRUE, and 0 for FALSE.

term: boolean variable

The problem here is that the smallest data directly accessible is a byte (8 bits). As you know, you can access a byte register or a byte memory variable, not a bit. It’s truely this way: there are no instruction which can access just one bit. (Of course there are, you just don’t need to know about them right now :))

But when you work with boolean variables you want to access just one single bit, not all 8 bits or more. There are some tricks to achieve this:

Use only one bit of the whole byte and leave the other bits cleared. Thus if you want to verify if the bit is 0, you just check if the whole byte is equal to 0. If it isn’t, then our bit is 1. Example:

where byte_boolean_variable is a byte varaible with only one bit used. When this variable is 0 then its value is FALSE, otherwise its value is TRUE.

byte_boolean_variable_is_*** are labels used for branching, as shown in a previous chapter. By the way, a better “more assembly” way to implement the previous code is:

beacause in the first version the jnz conditonal jump would always take place, because the instruction is executed only when je didn’t take place. If you don’t understand it, read again the previous chapter.

But this approach leaves 7 bits unused, and this is a waste of space. (Not in case of a single variable, but surely so with an array of similar variables). Clearly, we can “pack” 8 boolean variables into a single byte (8 bits). The only problem is setting and reading it.

this would set all variables to zero (clear them). If we want to set some of them to one, we just set the bits in which they are stored.

First, how to clear one bit and leave all others unchanged? We handled this before: we can do it by anding:

… this will clear bit#3 (anded with 0 so the result will be 0), and leave all other bits unchanged (anded with 1 so they will reamain unchanged). This will clear bits #3 and #5:

… this sets bit#3 to 1 (oring with 1 always yelds 1) and leave the others unchanged (oring with 0 leaves unchanged).

These were just a reminder, now let’s deal with how to check the value of bits. Checking the value of bit is called “bit testing”.

term: bit testing

You often need to test the value of some boolean variable and then do something (jump somewhere) if it is (or isn’t) TRUE. We did this with a byte-sized boolean variable using the cmp instruction, but it is impossible to use cmp for testing just a single bit of a byte. For this reason, there is a test instruction. It takes the same arguments as mov, xor, and, cmp, etc.

It ands it’s operands and then sets flags accordingly, so that if the result of anding them was 0 then je will jump, otherwise (if result wasn’t zero) je won’t jump (and jnz will).

… but it doesn’t modify arg1 and you use jz (jump if zero) and jnz (jump if not zero) conditional jumps. jz jumps if the result of virtual anding (testing) is zero. Similary, jnz jumps, if result is not zero (eg. at least one of the tested bits is non-zero)

NOTE: In fact, jz is the same instruction as je, and jnz is the same as jne; therefore, in our and/cmp example, using jz would be the same as using je.

instruction: test

… all bits but the third one of eight_booleans are anded against 0 (but eight_booleans remains unmodified), which means they are cleared, only the value of bit#3 will remain. The result of this operation will be zero (and jz will jump) only if bit#3 is 0. If it’s 1, the result of the operation will be 00001000b, not 0, so jz won’t jump.

… bits #3 and #5 of eight_booleans will remain, so the result of the operation will be 0 (and je will jump) only when both these bits are 0. If at least one of these bits is 1 the result won’t be 0 (it can be 00001000b, 00100000b or 00101000b) and jz won’t jump. But testing two bits at once is not usual practice, at least not for beginners, I gave this example just to provide a better picture of how test works.

7. Arithmetic Instructions: More on Flags

In this chapter you will learn how to perform basic math operations in assembly language. Then we’ll look deeper into how the processor carries them out, and in doing so yoi’ll learn more about flags.

7.1. Addition and substraction

The simplest case of addition is addition by one, called “increment”. For example, if we increment a variable holding value 5, it will contain 6, etc.

The instruction to perform increment is inc (its name should be obvious). It has one operand, which tells what should be incremented (ie: to what will 1 be added). The operand can be a register or a memory variable. It can’t be a constant, obviously, because such an instruction (even if it existed) wouldn’t have any effect. Example of increment:

… I think it should be self-explaining (if it isn’t, you’ll see later why).

Substracting 1 from a value is called “decrement”. Decrement is the opposite of increment. The instruction which performs decrement is dec. Example:

terms: increment (adding 1), decrement (substracting 1)

instructions: inc, dec

If you wan’t to add or substract more than 1, you can use more incs or decs, but that is a rather ugly way to do it, requires more typing, and the code gets big and slow. So there is instruction which can add any value, this instruction is add. It takes two arguments, the first one is the destination (ie: the value being added to), and the second one is the value to be added. Argument types are the same as for mov: the first one can be a register or a memory variable, the second one can be a constant, a register or a memory variable (only if the first one isn’t memory variable! Always remember: a single instruction can’t access two memory locations). Example:

The instruction for substracting is sub. It’s the exact opposite of add, but it’s used the same way:

7.2. Overflows

There are some cases with addition and substraction which I haven’t yet mentioned. For example, if you try to add 10 to a byte-sized variable holding 250 (the biggest number a byte-sized variable can hold is 255). In such cases, we say that an overflow has occured.

But the question is, what happens to the result of an operation that has overflown? When the upper limit of a variable is crossed, the result of the operation will be the rest of the value to be added. We can say that the operation will be “wrapped” from maximum value to minimal value. For example:

There is also another case, when the result of the operation falls below the lower limit (which is 0 for all variable sizes). In this case the result of the operation will be wrapped from the lower limit to the upper limit. This case is called underflow. For example:

NOTE: The word oveflow is usually used for both overflow and underflow.

Terms: Overflow, Underflow

We also need to know how to check if an overflow has occured after performing an operation, to prevent bugs. For this purpose, flags are used. I already mentioned flags in Chapter 5.3. We used flags for checking the results of comparison at conditional jumps, and I also said that there shouldn’t be any instrcutions between a comparison and its jumps because many instructions change the flags (of course, you can place an instruction there if you are sure it won’t change any needed flag). Arithmetic instructions add and sub use a flags’ bit called CF (carry flag). If an overflow occurs, they set it to 1, otherwise they set it to 0. You can test the carry flag with conditional jumps jc and jnc (see Chapter 5.3 about conditional jumps). jc jumps if the carry flag is set, jnc jumps if the carry flag is not set. Here is an example of testing overflows:

carry flag (CF) — One bit (flag) of “flags” register.

conditional jump instructions: jc, jnc

7.3. Zero Flag

Instructions inc and dec don’t set CF, so you can’t test for overflows using CF with them. But there is another rule that can be used to prevent overflows with inc and dec. This rule is that when the result of an operation is zero, the flag called “zero flag” (ZF) is set. This flag is tested with jz (jump if zero flag is set) and jnz (jump if zero flag is clear) conditional jump instructions.

NOTE: You can optimize the previous code example to:

  org 256
  mov cx,5
  mov dl,'a'
  mov ah,2
here:
  int 21h
  dec cx
  jnz here

  int 20h

since the value of dl and ah isn’t changed anywhere in the loop, we don’t need to set them each time the loop repeats.

Not only add and sub instructions set ZF if result is zero (and clear it otherwise). All basic arithmetic instructions do this. So far, you’ve learned these arithmetic instructions: add, sub, and, xor and or. So, after any of these instruction, ZF tells you if destination (first argument) of the operation holds 0. For example, You can use this behavior to check if the value of a register is 0. So far, you’ve learnt to do this with:

… or won’t change ax, because 1 ored with 1 is 1, and 0 ored with 0 is 0. (Read again Chapter 6 if you aren’t following this.) Btw, this was used on older computers because such code is faster and a few bytes smaller than with cmp.

7.4. Carry flag: more binary arithmetic instructions

I mentioned the carry flag a little in connection with overflows. But CF is really a general-purpose flag because it can be tested easily (jc, jnc and a few more), and its value can be easily set. You will find many more uses of CF later on.

How to set CF? There are two instructions for this: stc and clc. stc stands for “SeT Carry”, and it “sets” the carry flag (ie: sets its value to 1) — so jc performs a jump, and jnc doesn’t, etc., etc. (you should understand this aleady). Instruction clc (CLear Carry) clears CF.

Once we know how to work with CF, we can learn the rest of bit arihmetic operations. First, let’s look at shl. It shift the bits of a register to the left, ie: 0th bit becomes 1st, 1st becomes 2nd, and so on. The last bit (7th in a byte, 15th in a word) is moved to CF. The first bit becomes 0. This way (if the highest bit was zero) we have multiplied the shifted register by 2.

Let me explain why the number is multipied by 2. If you remember the beginning of Chapter 6, you know that a number before shifting is:

Therefore, if the highest bit is zero the number is multiplied by two. This way we can easily multiply by powers of two (2, 2^2=4, 2^3=8, 2^4=16, etc.). Furthermore, the highest bit is stored in CF, so we can test with jc and jnc if the multiplication overflowed.

Usually we want to shift more than once (multiply by 4, 8, 16, …), so shl takes a second argument, which tells how many times we want to shift. If we shift by a number greater than 1, CF will contain 1 if ANY of the discarded bits (x highest bits, where x is the number of shifts) contained 1. This way we can still check for overflows. If you are beginner, don’t worry too much about checking for overflows, you probably won’t do it anyway :) (and therefore your program will probably contain bugs).

There is one limitation to shl: its arguments don’t follow the same rules as the other instructions you’ve learned (mov, add, etc.) The fisrt argument can be a register or a memory location, but the second one can only be a numeric constant or the CL register (really, no other!).

NOTE: Orignially, with 8086 (that’s 086, first of 80x86 series known as x86, like 286 or 486), there was only a shl instruction which could shift by one, and so for example shl ax,3 was compiled into 3 shls. There also wasn’t any shifting by register, you had to make a loop for that. Fortunately 80286 had shifting by constant and by CL, so it is OK now.

We’ve dealt with left shifting, but there is also another type of shifting, ie: shifting to the right. I hope you can by now imagine what it does, so I’ll drop just few notes about it. The instruction that performs this is shr (shift right). Its effect is division by two (or powers of two) without remainder. When shifting right by two, the remainder (0 or 1) is then found in CF; apart from this, CF beheaves like shifting left by a number greater than two: If the remainder isn’t 0 (ie: at least one of the discarded bits was 1) then CF is set, otherwise it is clear.

7.5. Some examples

At least we are now able to print the output of a number (print the number on the screen). It’s a pity that we can only write it in binary form. So here is our task: Write a program that outputs any binary number. For now, we will hardcode the number into the program, ie: move it into some register as a constant. Here is the source:

I hope you understand this, it’s quite simple. At each loop we shift the BX register left by one, so the upper bit is moved to CF, then we print ‘0’ or ‘1’ depending on the value of CF (previously the upper bit of the number) and continue to loop until we’ve printed 16 digits (because a word has 16 bits). Example of stepping through the code:

In my opinion, if you made it up to this point, having (generally) understood everything, you can consider yourself more than just a beginner — congratulations!!! There is still much to learn to become a well-armed assembly programmer, but now you have a solid grounding from which to start expanding your knowledge – with or without use of this tutorial. (But there are several parts which will be explained in further detail, which are hard to find in any tutorial).