HW2 Instructions
Exercise 2.3
[5] <§§2.2, 2.3> For the following C statement, write the corresponding RISC-V assembly code. Assume that the variables f, g, h, i
and j
are assigned to registers x5, x6, x7, x28
and x29
, respectively. Assume that the base address of the arrays A
and B
are in registers x10
and x11
, respectively.
1 |
|
Exercise 2.4
[10] <§§2.2, 2.3> For the RISC-V assembly instructions below, what is the corresponding C statement?Assume that the variables f, g, h, i
and j
are assigned to registers x5, x6, x7, x28
and x29
, respectively. Assume that the base address of the arrays A
and B
are in registers x10
and x11
, respectively.
1 |
|
Exercise 2.5
[5] <§2.3> Show how the value 0xabcdef12
would be arranged in memory of a little-endian and a big-endian machine. Assume the data are stored starting at address 0 and that the word size is 4 bytes.
little-endian | big-endian | |
---|---|---|
Address(Byte) | Data | Data |
3 | 0xab |
0x12 |
2 | 0xcd |
0xef |
1 | 0xef |
0xcd |
0 | 0x12 |
0xab |
Exercise 2.11
Assume that x5
holds the value .
2.11.1
[5] <§2.4> For the instruction add x30, x5, x6
, what is the range(s) of values for x6
that would result in overflow?
There will be an overflow if either x5
x6
or x5
x6
. That’s to say,
x6
x6
Given that x6
is between and , x6
.
2.11.2
[5] <§2.4> For the instruction sub x30, x5, x6
, what is the range(s) of values for x6
that would result in overflow?
There will be an overflow if either x5
x6
or x5
x6
. That’s to say,
x6
x6
Given that x6
is between and , x6
.
2.11.3
[5] <§2.4> For the instruction sub x30, x6, x5
, what is the range(s) of values for x6
that would result in overflow?
There will be an overflow if either x6
x5
or x6
x5
. That’s to say,
x6
x6
Given that x6
is between and , x6
.
Exercise 2.12
[5] <§§2.2, 2.5> Provide the instruction type and assembly language instruction for the following binary value:
Hint: Figure 2.20 may be helpful.
The operation code is 0110011
, which tells the instruction type is R-type. The R-type format is
funct7 | rs2 | rs1 | funct3 | rd | opcode |
---|---|---|---|---|---|
7 bits | 5 bits | 5 bits | 3 bits | 5 bits | 7 bits |
0000 000 | 0 0001 | 0000 1 | 000 | 00001 | 0110011 |
The assembly code is
1 |
|
Exercise 2.13
[5] <§§2.2, 2.5> Provide the instruction type and hexadecimal representation of the following instruction:
1 |
|
The instruction type is S-type.
The hexadecimal representation is 0x025F3023
.
Exercise 2.23
Consider a proposed new instruction named rpt
. This instruction combines a loop’s condition check and counter decrement into a single instruction. For example rpt x29, loop
would do the following:
1 |
|
2.23.1
[5] <§2.7, 2.10> If this instruction were to be added to the RISC-V instruction set, what is the most appropriate instruction format?
The most appropriate instruction format is J(UJ)-type.
2.23.2
[5] <§2.7> What is the shortest sequence of RISC-V instructions that performs the same operation?
1 |
|
Exercise 2.24
Consider the following RISC-V loop:
1 |
|
2.24.1
[5] <§2.7> Assume that the register x6
is initialized to the value 10. What is the final value in register x5
assuming the x5
is initially zero?
The final value in register x5
is 20.
2.24.2
[5] <§2.7> For the loop above, write the equivalent C code. Assume that the registers x5
and x6
are integers acc
and i
, respectively.
1 |
|
2.24.3
[5] <§2.7> For the loop written in RISC-V assembly above, assume that the register x6
is initialized to the value N. How many RISC-V instructions are executed?
For every loop there are 4 instructions are executed. There will be N loops if x6
is initialized to the value N. and in the loop there is still an instruction to jump out of the loop. Therefore, there instructions to be executed in total.
2.24.4
[5] <§2.7> For the loop written in RISC-V assembly above, replace the instruction beq x6, x0, DONE
with the instruction blt x6, x0, DONE
and write the equivalent C code.
1 |
|
Exercise 2.25
[10] <§2.7> Translate the following C code to RISC-V assembly code. Use a minimum number of instructions. Assume that the values of a, b, i
and j
are in registers x5, x6, x7
and x29
respectively. Also, assume that register x10
holds the base address of the array D
.
1 |
|
1 |
|
Exercise 2.26
[5] <§2.7> How many RISC-V instructions does it take to implement the C code from Exercise 2.25? If the variables a
and b
are initialized to 10 and 1 and all elements of D
are initially 0, what is the total number of RISC-V instructions executed to complete the loop?
There are 12 instructions. If the variables a
and b
are initialized to 10 and 1 and all elements of D
are initially 0, instructions are to be executed in total.
Exercise 2.31
[20] <§2.8> Translate function f into RISC-V assembly language. Assume the function declaration for g
is int g(int a, int b)
. The code for function f is as follows:
1 |
|
1 |
|
Exercise 2.35
Consider the following code:
1 |
|
Assume that the register x7
contains the address 0×10000000
and the data at address is 0x1122334455667788
.
2.35.1
[5] <§2.3, 2.9> What value is stored in 0x10000008
on a big-endian machine?
2.35.2
[5] <§2.3, 2.9> What value is stored in 0x10000008
on a little-endian machine?
big-endian | little-endian | |
---|---|---|
Address(Byte) | Data | Data |
0x10000008 |
0x11 |
0xFF |
0x10000007 |
0x00 |
0xFF |
0x10000006 |
0x00 |
0xFF |
0x10000005 |
0x00 |
0xFF |
0x10000004 |
0x00 |
0xFF |
0x10000003 |
0x00 |
0xFF |
0x10000002 |
0x00 |
0xFF |
0x10000001 |
0x00 |
0x88 |
0x10000000 |
0x11 |
0x88 |
Exercise 2.36
[5] <§2.10> Write the RISC-V assembly code that creates the 64-bit constant 0x1122334455667788
and stores that value to register x10
.
1 |
|
Exercise 2.40
Assume that for a given program 70% of the executed instructions are arithmetic, 10% are load/store, and 20% are branch.
2.40.1
[5] <§§1.6, 2.13> Given this instruction mix and the assumption that an arithmetic instruction requires two cycles, a load/store instruction takes six cycles, and a branch instruction takes three cycles, find the average CPI.
2.40.2
[5] <§§1.6, 2.13> For a 25% improvement in performance, how many cycles, on average, may an arithmetic instruction take if load/store and branch instructions are not improved at all?
Hence,
2.40.3
[5] <§§1.6, 2.13> For a 50% improvement in performance, how many cycles, on average, may an arithmetic instruction take if load/store and branch instructions are not improved at all?
Hence,
HW3 Arithmetic
Exercise 3.1
[5] <§3.2> What is 5ED4 − 07A4 when these values represent unsigned 16bit hexadecimal numbers? The result should be written in hexadecimal. Show your work.
Exercise 3.5
[5] <§3.2> What is 4365 − 3412 when these values represent signed 12-bit octal numbers stored in sign-magnitude format? The result should be written in octal. Show your work.
Exercise 3.8
[5] <§3.2> Assume 185 and 122 are signed 8-bit decimal integers stored in sign-magnitude format. Calculate 185 − 122. Is there overflow, underflow, or neither?
There is an overflow.
Exercise 3.9
[10] <§3.2> Assume 151 and 214 are signed 8-bit decimal integers stored in two’s complement format. Calculate 151 + 214 using saturating arithmetic. The result should be written in decimal. Show your work.
Exercise 3.13
[20] <§3.3> Using a table similar to that shown in Figure 3.6, calculate the product of the hexadecimal unsigned 8-bit integers 62 and 12 using the hardware described in Figure 3.5. You should show the contents of each register on each step.
iteration | step | multiplicand | multiplier/ product |
---|---|---|---|
0 | initial values | 110 010 |
000 000 001 010 |
1 | no op | 110 010 |
000 000 001 010 |
Right shift Product | 110 010 |
000 000 000 101 |
|
2 | Product += Mcand | 110 010 |
110 010 000 101 |
Right shift Product | 110 010 |
011 001 000 010 |
|
3 | no op | 110 010 |
011 001 000 010 |
Right shift Product | 110 010 |
001 100 100 001 |
|
4 | Product += Mcand | 110 010 |
111 110 100 001 |
Right shift Product | 110 010 |
011 111 010 000 |
|
5 | no op | 110 010 |
011 111 010 000 |
Right shift Product | 110 010 |
001 111 101 000 |
|
6 | no op | 110 010 |
001 111 101 000 |
Right shift Product | 110 010 |
000 111 110 100 |
The answer is 0764.
Exercise 3.19
[30] <§3.4> Using a table similar to that shown in Figure 3.10, calculate 74 divided by 21 using the hardware described in Figure 3.11. You should show the contents of each register on each step. Assume A and B are unsigned 6-bit integers. This algorithm requires a slightly different approach than that shown in Figure 3.9. You will want to think hard about this, do an experiment or two, or else go to the web to figure out how to make this work correctly.
Hint: one possible solution involves using the fact that Figure 3.11 implies the remainder register can be shifted either direction.
iteration | step | Divisor | Remainder/Quotient |
---|---|---|---|
0 | initial values | 010 001 |
000 000 111 100 |
1 | Left shift Remainder(Rem) | 010 001 |
000 001 111 000 |
Rem -= Divisor | 010 001 |
111 000 111 000 |
|
Rem < 0, then add back | 010 001 |
000 001 111 000 |
|
2 | Left shift Rem | 010 001 |
000 011 110 000 |
Rem -= Divisor | 010 001 |
110 010 110 000 |
|
Rem < 0, then add back | 010 001 |
000 011 110 000 |
|
3 | Left shift Rem | 010 001 |
000 111 100 000 |
Rem -= Divisor | 010 001 |
110 110 100 000 |
|
Rem < 0, then add back | 010 001 |
000 111 100 000 |
|
4 | Left shift Rem | 010 001 |
001 111 000 000 |
Rem -= Divisor | 010 001 |
111 110 000 000 |
|
Rem < 0, then add back | 010 001 |
001 111 000 000 |
|
5 | Left shift Rem | 010 001 |
011 110 000 000 |
Rem -= Divisor | 010 001 |
001 101 000 000 |
|
Rem > 0, then R0 = 1 | 010 001 |
001 101 000 001 |
|
6 | Left shift Rem | 010 001 |
011 010 000 010 |
Rem -= Divisor | 010 001 |
001 001 000 010 |
|
Rem > 0, then R0 = 1 | 010 001 |
001 001 000 011 |
Exercise 3.20
[5] <§3.5> What decimal number does the bit pattern 0x0C000000
represent if it is a two’s complement integer? An unsigned integer?
- two’s complement:
- unsigned integer:
Exercise 3.21
[10] <§3.5> If the bit pattern 0x0C000000
is placed into the Instruction Register, what RISC-V instruction will be executed?
sorry, i don’t know.
Exercise 3.22
[10] <§3.5> What decimal number does the bit pattern 0x0C000000
represent if it is a floating point number? Use the IEEE 754 standard.
- exponent:
000 1100 0
- 127 = 24 - 127 = -103 - fraction = 0
- answer =
Exercise 3.23
[10] <§3.5> Write down the binary representation of the decimal number 63.25 assuming the IEEE 754 single precision format.
- 63.23 =
111111.01
= - sign = 0
- exp = 5 + 127 = 132
- answer =
0 1000 0100 1111 1010 0000 0000 0000 000
=0x427D0000
Exercise 3.24
[10] <§3.5> Write down the binary representation of the decimal number 63.25 assuming the IEEE 754 double precision format.
- sign = 0
- exp = 5 + 1023 = 1028
- answer =
0 100 0000 0100 1111 1010 0000 ... 0000
=0x404FA00000000000
Exercise 3.27
[20] <§3.5> IEEE 754-2008 contains a half precision that is only 16 bits wide. The leftmost bit is still the sign bit, the exponent is 5 bits wide and has a bias of 15, and the mantissa is 10 bits long. A hidden 1 is assumed. Write down the bit pattern to represent assuming a version of this format, which uses an excess-16 format to store the exponent. Comment on how the range and accuracy of this 16-bit floating point format compares to the single precision IEEE 754 standard.
- =
-0.00101
= - sign = 1
- exp = -3 + 15 = 12
- answer =
1 01100 0100 0000 00
Exercise 3.29
[20] <§3.5> Calculate the sum of and by hand, assuming A and B are stored in the 16-bit half precision described in Exercise 3.27. Assume 1 guard, 1 round bit, and 1 sticky bit, and round to the nearest even. Show all the steps.