# Department of Electrical and Computer Engineering 

The University of Texas at Austin

EE 460N Fall 2022
Instructor: Yale N. Patt
TAs: Kayvan Mansoorshahi, Michael Chen
Exam 2
November 16, 2022

Name: $\qquad$

Problem 1 (20 points): $\qquad$
Problem 2 (10 points): $\qquad$
Problem 3 (15 points): $\qquad$
Problem 4 (25 points): $\qquad$

Problem 5 (30 points): $\qquad$

Total (100 points): $\qquad$

Note: Please be sure that your answers to all questions (and all supporting work that is required) are contained in the space provided.
Note: Please be sure your name is recorded on each sheet of the exam.

Please read the following sentence, and if you agree, sign where requested:
I have not given nor received any unauthorized help on this exam.

Signature: $\qquad$

Name: $\qquad$

Problem 1 ( 20 points): Answer the following questions.
Note: For each of the four answers below, if you leave the box empty, you will receive one point of the five.

Part a (5 points): Various storage structures are accessed in various ways. One way is the content addressable memory (CAM) access, in which part of the address is contained within the element being accessed. We have studied two storage structures that are accessed in this way. What are they?
$\square$
$\square$
Part b ( 5 points): Consider a cache that is physically indexed, physically tagged. What must be true if the TLB can be accessed at the same time as the tag store? Use 15 words or fewer.

Part c (5 points): A page fault often requires a page of virtual memory to be evicted from the frame of physical memory it is occupying in order to provide space for the page that needs to be brought into physical memory. What should be done with the evicted page. Please be complete but concise. No more than 20 words total.

Part d (5 points): Page mode allows a DRAM access to take less time than not-page-mode. What must be true for the access to be in page mode?
$\square$
How does page mode save time?

Name: $\qquad$

Problem 2 ( 10 points): An 8-bit floating point number, with its BIAS set in the spirit of the IEEE standard has a sign bit, 3 bits of fraction, and 4 bits of exponent.

| Sign | Exponent | Fraction |
| :--- | :--- | :--- |

Answer the following questions for this floating point format.
Part a. (2 points):What's the smallest positive normalized number that can be represented exactly? Answer with a fraction.


Part b. (3 points):What is the smallest positive number that can be represented exactly? Answer with a fraction.


There exists a number N such that every value greater than N will be represented by N or positive infinity.
Part c. (2 points): What is the value N (Express as a number)

Part d. ( $\mathbf{3}$ points): What determines whether values greater than N are represented by N or positive infinity. Explain in fewer than ten words.

Name: $\qquad$

## Problem 3 (15 points):

Consider a 64B, physically addressed, write-back cache for a CPU with byte-addressable memory. Assume LRU replacement. Initially all cache lines are invalid.

## Part a (5 points):

The processor makes the following consecutive memory accesses. Note the table shows physical addresses (i.e., after the translation has been made.)

|  | Physical Address in binary |
| :---: | :---: |
| 1 | 10010000 |
| 2 | 00110000 |
| 3 | 11110000 |
| 4 | 11111000 |
| 5 | 10010010 |
| 6 | 00110100 |
| 7 | 01000000 |

Assume the cache is fully-associative. Five of the accesses require going to main memory. What is line size of the cache?

## Part b (5 points):

If we change the associativity from fully associative to direct-mapped, but leave the line size unchanged, what is the cache hit ratio for processing the seven accesses?

## Part c (5 points):

Increasing associativity will usually reduce the number of conflict misses. If we now change the associativity from direct-mapped to set associative, but leave the line size unchanged, what is the minimum associativity that will minimize the number of conflict misses? Which accesses hit with this associativity?

Minimum associativity: $\square$

Accesses that hit:

Name: $\qquad$

Problem 4 ( 25 points):
A memory system has the following specification:

- Byte addressable memory.
- 1 channel and 1 rank.
- 4 banks per chip.
- 64 rows per bank.
- 4 columns per row.
- 64-bit (8-byte) memory data bus.
- A page mode access takes 1 cycle. A non-page mode access takes 3 cycles.


## Part a (1 points):

Given the above specification, how many bits is the physical address?

Next step: we wish to determine the format of the memory address. The generic structure is shown below.

| Other Bits | Row and Bank Bits | Byte on Bus |
| :---: | :---: | :---: |

We do this by executing five load/store instructions in program order. Each takes one cycle to fetch, one cycle to decode, and at the start of the third cycle, the physical address is sent to the memory controller. There are no dependencies associated with any of the five instructions. The first instruction is fetched in cycle 1 , decoded in cycle 2, accesses the memory controller in cycle 3 , and completes the instruction at the end of cycle 5 . The second instruction is fetched in cycle 2 , decoded in cycle 3 , accesses the memory controller in cycle 4 , and if there is no bank conflict, completes the instruction at the end of cycle 6 . The remaining instructions follow the same process. If there is a bank conflict, the subsequent access starts the cycle after the conflicting access completes.
The table below shows the cycle in which each load/store instruction completes execution.

| Physical Address | Cycle Finished |
| :---: | :---: |
| 0000_0011_0111_0101 | 5 |
| $0000 \_0011 \_0101 \_1111$ | 6 |
| $0000 \_1000 \_0010 \_1011$ | 9 |
| $0000 \_1000 \_1010 \_0011$ | 10 |
| $0000 \_0101 \_1010 \_0001$ | 13 |

## Part b (4 points):

Given the above accesses how many row buffer hits do we observe?

## Part c (10 points):

Your job is to identify with the letters B (bank), C (column), R (row), BoB (Byte on Bus), etc. each bit of the memory address in the diagram below. We have provided you with 16 bits. Use as many as you need. Recall the restrictions in the format diagram above.


## Part d (10 points):

Can this set of accesses be sped up by rearranging the bits of the physical address? If so, explain and write your new address format in the set of boxes below. If not, explain why not.


Name: $\qquad$

## Problem 5 ( 30 points):

We wish to add VAX-like virtual memory support to the LC-3b ISA with the following specifications:

- 16-bit virtual addresses.
- The memory management system uses the two-level page table scheme like VAX.
- Virtual memory is partitioned into two halves. User space starts at x 0000 and system space starts at x 8000 .
- There is 16 KB of physical memory.
- The page size is 256 bytes.
- The user page table starts at the beginning of the page. The system page table starts at the beginning of a frame.
- A PTE contains 16 bits and has the following form.

PTE:

| Valid | Read <br> Permission | Write <br> Permission | 0s | PFN |
| :---: | :---: | :---: | :---: | :---: |
| 1 bit |  |  |  |  |

where a " 1 " in a permission bit means that user programs have permission to make that access.

## Part a (6 points):

How many pages does system space contain?

How many pages does user space contain?

How many frames does main memory contain?

Part b (8 points):
Shown below is a VAX translation process diagram. Fill in the blanks in the table below with what X, Y, Z, and W represent (not actual values).


We wish to execute the instruction LDW R1, R0, \#0 with $\mathbf{R 0}=\mathbf{x 3 F C 8}$ initially.
Assume there is no TLB. Some of the physical memory values are provided below in ascending order. Assume all physical memory accesses of the VAX address translation during the execution of the LDW instruction can be found below. You do not need to worry about instruction fetch. No exceptions occur during the execution of this instruction.

| Physical Address | Word Value |
| :---: | :---: |
| x053F | x802F |
| x057E | x802F |
| x07C8 | x 1126 |
| x 133 F | x 0019 |
| x 17 A 0 | $\mathrm{xC005}$ |
| x 19 C 8 | x 0324 |
| x 245 C | xE 03 C |
| x 2 D 7 E | $\mathrm{xC03A}$ |
| x 2 FC 8 | x 0422 |
| x 3168 | x 6013 |
| x 38 B 2 | xE 02 D |
| x 3 C 7 E | x 6007 |
| $\mathrm{x} 3 \mathrm{AC8}$ | x 0928 |
| x 3 C 3 F | xE 007 |

Physical Memory

## Part c (16 points):

Fill in the blanks in the VAX translation process diagram with actual values in hexadecimal. Recall: The user page table starts at the beginning of the page. The system page table starts at the beginning of a frame.


