# **CORE-V-Docs Documentation**

**Davide Schiavone** 

Mar 29, 2022

# CONTENTS:

| 1 | Changelog                           | 3               |
|---|-------------------------------------|-----------------|
|   | 1.1 0.3.0                           | 3               |
|   | 1.2 0.2.0                           | 3               |
|   | 1.3 0.1.0                           | 3               |
| 2 |                                     | 5               |
|   |                                     | 5               |
|   | 2.2 Standards Compliance            | 6               |
|   | 2.3 Synthesis guidelines            | 8               |
|   | 2.3.1 ASIC Synthesis                | 8               |
|   | 2.3.2 FPGA Synthesis                | 8               |
|   |                                     | 9               |
|   |                                     | 9<br>9          |
|   |                                     | 9               |
|   |                                     | 9<br>10         |
|   |                                     | .0              |
| 3 | 8                                   | 11              |
|   | 3.1 Clock Gating Cell               | 11              |
| 4 | Core Integration                    | 13              |
|   | 4.1 Instantiation Template          | 13              |
|   | 4.2 Parameters                      | 15              |
|   | 4.3 Interfaces                      | 17              |
| 5 | Pipeline Details 1                  | 19              |
|   |                                     | 19              |
|   | 5.2 Hazards                         | 21              |
|   |                                     |                 |
| 6 |                                     | <b>23</b><br>24 |
|   |                                     | 24<br>24        |
|   | 0.2 Flotocol                        | 24              |
| 7 |                                     | 27              |
|   | 0                                   | 28              |
|   |                                     | 28              |
|   | 7.3         Write buffer         2  | 29              |
| 8 | Physical Memory Attribution (PMA) 3 | 31              |
|   |                                     | 31              |
|   | 8.2 Main memory vs I/O              | 31              |

|    | 8.3<br>8.4<br>8.5 | Bufferable and Cacheable       32         Atomic operations       32         Default attribution       32                                                                     |
|----|-------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 9  | Pogie             | ter File 33                                                                                                                                                                   |
| ,  | 9.1               | General Purpose Register File   33                                                                                                                                            |
| 10 | eXter             | asion Interface 35                                                                                                                                                            |
|    | 10.1              | CORE-V-XIF                                                                                                                                                                    |
|    |                   | 10.1.1 Compressed interface                                                                                                                                                   |
|    |                   | 10.1.2 Issue interface                                                                                                                                                        |
|    |                   | 10.1.3 Commit interface                                                                                                                                                       |
|    |                   | 10.1.4 Memory (request/response) interface                                                                                                                                    |
|    |                   | 10.1.5 Memory result interface                                                                                                                                                |
|    |                   | 10.1.6 Result interface                                                                                                                                                       |
|    | 10.2              | Integration                                                                                                                                                                   |
|    | 10.3              | Timing 38                                                                                                                                                                     |
| 11 | Fono              | e.i external handshake 41                                                                                                                                                     |
| 11 | генсо             | 41                                                                                                                                                                            |
| 12 | Sleep             |                                                                                                                                                                               |
|    | 12.1              | Startup behavior                                                                                                                                                              |
|    | 12.2              | WFI 44                                                                                                                                                                        |
| 13 | Cont              | rol and Status Registers 45                                                                                                                                                   |
| 10 |                   | CSR Map                                                                                                                                                                       |
|    |                   | CSR Descriptions                                                                                                                                                              |
|    | 10.2              | 13.2.1         Jump Vector Table (jvt)         47                                                                                                                             |
|    |                   | 13.2.2         Machine Status (mstatus)         48                                                                                                                            |
|    |                   | 13.2.3 Machine ISA (misa)                                                                                                                                                     |
|    |                   | 13.2.4 Machine Interrupt Enable Register (mie) - SMCLIC == $0$                                                                                                                |
|    |                   | 13.2.5 Machine Interrupt Enable Register (mie) - SMCLIC == 1                                                                                                                  |
|    |                   | 13.2.6 Machine Trap-Vector Base Address (mtvec) - SMCLIC == 0                                                                                                                 |
|    |                   | 13.2.7 Machine Trap-Vector Base Address (mtvec) - SMCLIC == 1                                                                                                                 |
|    |                   | 13.2.8 Machine Trap Vector Table Base Address (mtvt) 51                                                                                                                       |
|    |                   | 13.2.9 Machine Status (mstatush) 52                                                                                                                                           |
|    |                   | 13.2.10 Machine Counter-Inhibit Register (mcountinhibit)                                                                                                                      |
|    |                   | 13.2.11 Machine Performance Monitoring Event Selector (mhpmevent3 mhpmevent31) 52                                                                                             |
|    |                   | 13.2.12 Machine Scratch (mscratch)                                                                                                                                            |
|    |                   | 13.2.13 Machine Exception PC (mepc)                                                                                                                                           |
|    |                   | 13.2.14 Machine Cause (mcause) - SMCLIC == $0$                                                                                                                                |
|    |                   | 13.2.15 Machine Cause (mcause) - SMCLIC == $1 \dots $                   |
|    |                   | 13.2.16 Machine Trap Value (mtval)                                                                                                                                            |
|    |                   | 13.2.17 Machine Interrupt Pending Register (mip) - SMCLIC == $0 \dots $ |
|    |                   | 13.2.18 Machine Interrupt Pending Register (mip) - SMCLIC == 1                                                                                                                |
|    |                   | 13.2.19 Machine Next Interrupt Handler Address and Interrupt Enable (mnxti) 55                                                                                                |
|    |                   | 13.2.20 Machine Interrupt Status (mintstatus)                                                                                                                                 |
|    |                   | 13.2.21 Machine Interrupt-Level Threshold (mintthresh) 56                                                                                                                     |
|    |                   | 13.2.22 Machine Scratch Swap for Priv Mode Change (mscratchcsw)                                                                                                               |
|    |                   | 13.2.23 Machine Scratch Swap for Interrupt-Level Change (mscratchcswl)                                                                                                        |
|    |                   | 13.2.24 CLIC Base (mclicbase) 57                                                                                                                                              |
|    |                   | 13.2.25 Trigger Select Register (tselect)                                                                                                                                     |
|    |                   | 13.2.26 Trigger Data 1 (tdata1)                                                                                                                                               |
|    |                   | 13.2.27         Match Control Type 6 (mcontrol6)         58                                                                                                                   |
|    |                   | 13.2.28 Exception Trigger (etrigger)                                                                                                                                          |

|    |       | 13.2.29 Trigger Data Register 2 (tdata2)                                              | 60       |
|----|-------|---------------------------------------------------------------------------------------|----------|
|    |       | 13.2.30 Trigger Data Register 3 (tdata3)                                              | 60       |
|    |       | 13.2.31 Trigger Info (tinfo)                                                          | 61       |
|    |       | 13.2.32 Trigger Control (tcontrol)                                                    | 61       |
|    |       | 13.2.33 Machine Context Register (mcontext)                                           | 61       |
|    |       | 13.2.34 Machine Supervisor Context Register (mscontext)                               | 62       |
|    |       | 13.2.35 Debug Control and Status (dcsr)                                               | 62       |
|    |       | 13.2.36 Debug PC (dpc)                                                                | 63       |
|    |       | 13.2.37 Debug Scratch Register $0/1$ (dscratch $0/1$ )                                | 63       |
|    |       | 13.2.38 Machine Cycle Counter (mcycle)                                                | 63       |
|    |       | 13.2.39 Machine Instructions-Retired Counter (minstret)                               | 63       |
|    |       | 13.2.40 Machine Performance Monitoring Counter (mpmcounter3 mpmcounter31)             | 64       |
|    |       | 13.2.41 Upper 32 Machine Cycle Counter (mcycleh)                                      | 64       |
|    |       | 13.2.42 Upper 32 Machine Instructions-Retired Counter (minstreth)                     | 64       |
|    |       | 13.2.43 Upper 32 Machine Performance Monitoring Counter (mhmcounter3h mhpmcounter31h) |          |
|    |       | 13.2.44 Machine Vendor ID (mvendorid)                                                 | 65       |
|    |       | 13.2.45 Machine Architecture ID (marchid)                                             | 65       |
|    |       |                                                                                       | 65       |
|    |       | 13.2.46 Machine Implementation ID (mimpid)                                            | 66       |
|    |       | 13.2.47 Hardware Thread ID (mhartid)                                                  | 66       |
|    |       | 13.2.48 Machine Configuration Pointer (mconfigptr)                                    |          |
|    |       | 13.2.49 Cycle Counter (cycle)                                                         | 66       |
|    |       | 13.2.50 Instructions-Retired Counter (instret)                                        | 66       |
|    |       | 13.2.51 Performance Monitoring Counter (hpmcounter3 hpmcounter31)                     | 67       |
|    |       | 13.2.52 Upper 32 Cycle Counter (cycleh)                                               | 67       |
|    |       | 13.2.53 Upper 32 Instructions-Retired Counter (instreth)                              | 67       |
|    |       | 13.2.54 Upper 32 Performance Monitoring Counter (hpmcounter3h hpmcounter31h)          | 67       |
| 14 | Donfo | ormance Counters                                                                      | 69       |
| 14 | 14.1  |                                                                                       | 69       |
|    |       |                                                                                       | 70       |
|    |       | Controlling the counters from software                                                | 70       |
|    |       | Parametrization at synthesis time                                                     | 70       |
|    | 14.4  | Time Registers (time(h))                                                              | 70       |
| 15 | Exce  | ptions and Interrupts                                                                 | 71       |
| 10 |       | Basic Interrupt Architecture                                                          | 71       |
|    | 15.1  | 15.1.1 Interrupt Interface                                                            | 72       |
|    |       | 15.1.2 Interrupts                                                                     | 72       |
|    |       |                                                                                       | 73       |
|    | 15.2  | 15.1.3       Nested Interrupt Handling         CLIC Interrupt Architecture            | 74       |
|    | 13.2  | 15.2.1 Interrupt Interface                                                            | 74       |
|    |       |                                                                                       | 74       |
|    |       | 1                                                                                     | 75       |
|    | 15.3  | 1 0                                                                                   | 75<br>75 |
|    |       | Non Maskable Interrupts                                                               | 75       |
|    | 15.4  | Exceptions                                                                            | 15       |
| 16 | Dehu  | ig & Trigger                                                                          | 77       |
| 10 |       | Interface                                                                             | 78       |
|    | 16.2  | Core Debug Registers                                                                  | 78       |
|    | 16.3  | Debug state                                                                           | 79       |
|    | 16.4  | EBREAK Behavior                                                                       | 79<br>79 |
|    | 10.4  |                                                                                       |          |
|    |       | 16.4.1 Scenario 1 : Enter Exception                                                   | 79       |
|    |       | 16.4.2 Scenario 2 : Enter Debug Mode                                                  | 80       |
|    |       | 16.4.3 Scenario 3 : Exit Program Buffer & Restart Debug Code                          | 80       |
| 17 | RISC  | C-V Formal Interface                                                                  | 81       |
|    |       |                                                                                       |          |

# iii

|     | 17.1 New Additions                                |  | 81 |  |  |  |  |  |  |
|-----|---------------------------------------------------|--|----|--|--|--|--|--|--|
|     | 17.2 Compatibility                                |  |    |  |  |  |  |  |  |
|     | 17.3 Trace output file                            |  |    |  |  |  |  |  |  |
|     | 17.4 Trace output format                          |  | 86 |  |  |  |  |  |  |
| 18  | CORE-V Instruction Set Extensions                 |  | 87 |  |  |  |  |  |  |
| 19  | Core Versions and RTL Freeze Rules                |  | 89 |  |  |  |  |  |  |
|     | 19.1 What happens after RTL Freeze?               |  | 89 |  |  |  |  |  |  |
|     | 19.1.1 A bug is found                             |  |    |  |  |  |  |  |  |
|     | 19.1.2 RTL changes on non-verified yet parameters |  | 89 |  |  |  |  |  |  |
|     | 19.1.3 PPA optimizations and new features         |  |    |  |  |  |  |  |  |
|     | 19.2 Released core versions                       |  | 90 |  |  |  |  |  |  |
| 20  | Glossary                                          |  | 91 |  |  |  |  |  |  |
| Bił | Bibliography                                      |  |    |  |  |  |  |  |  |

Editor: Davide Schiavone davide@openhwgroup.org

### ONE

### CHANGELOG

#### 1.1 0.3.0

Released on 2022-03-29 - GitHub

### 1.2 0.2.0

Released on 2022-03-18 - GitHub

# 1.3 0.1.0

Released on 2022-02-16 - GitHub

TWO

# INTRODUCTION



CV32E40X is a 4-stage in-order 32-bit RISC-V processor core. Figure 2.1 shows a block diagram of the core.

Figure 2.1: Block Diagram of CV32E40X RISC-V Core

# 2.1 License

Copyright 2020 OpenHW Group.

Copyright 2018 ETH Zurich and University of Bologna.

Copyright and related rights are licensed under the Solderpad Hardware License, Version 0.51 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://solderpad. org/licenses/SHL-0.51. Unless required by applicable law or agreed to in writing, software, hardware and materials distributed under this License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

# 2.2 Standards Compliance

CV32E40X is a standards-compliant 32-bit RISC-V processor. It follows these specifications:

Many features in the RISC-V specification are optional, and CV32E40X can be parameterized to enable or disable some of them.

CV32E40X supports one of the following base integer instruction sets from [RISC-V-UNPRIV].

| Base Integer Instruction Set                      | Version                | Configurability                            |  |  |  |  |  |  |
|---------------------------------------------------|------------------------|--------------------------------------------|--|--|--|--|--|--|
| RV32I: RV32I Base Integer Instruction Set         | 2.1                    | optionally enabled with the RV32 parameter |  |  |  |  |  |  |
| <b>RV32E</b> : RV32E Base Integer Instruction Set | 1.9 (not ratified yet) | optionally enabled with the RV32 parameter |  |  |  |  |  |  |

Table 2.1: CV32E40X Base Instruction Set

In addition, the following standard instruction set extensions are available from [RISC-V-UNPRIV], [RISC-V-ZBA\_ZBB\_ZBC\_ZBS], [RISC-V-CRYPTO] and [RISC-V-ZCA\_ZCB\_ZCMB\_ZCMP\_ZCMT].

| Standard Extension                                                    | Version               | Configurability    |
|-----------------------------------------------------------------------|-----------------------|--------------------|
| C: Standard Extension for Compressed Instructions                     | 2.0                   | always enabled     |
| M: Standard Extension for Integer Multiplication and Division         | 2.0                   | optionally enabled |
|                                                                       |                       | with the M_EXT     |
|                                                                       |                       | parameter          |
| Zicntr: Standard Extension for Base Counters and Timers               | 2.0                   | always enabled     |
| Zihpm: Standard Extension for Hardware Performance Counters           | 2.0                   | always enabled     |
| Zicsr: Control and Status Register Instructions                       | 2.0                   | always enabled     |
| Zifencei: Instruction-Fetch Fence                                     | 2.0                   | always enabled     |
| Zca: Subset of the standard Zc Code-Size Reduction extension          | v0.70.1 (not ratified | optionally enabled |
| consisting of a subset of C with the FP load/stores removed.          | yet; version will     | with the ZC_EXT    |
|                                                                       | change)               | parameter          |
| Zcb: Subset of the standard Zc Code-Size Reduction extension          | v0.70.1 (not ratified | optionally enabled |
| consisting of simple operations.                                      | yet; version will     | with the ZC_EXT    |
|                                                                       | change)               | parameter          |
| Zcmb: Subset of the standard Zc Code-Size Reduction extension         | v0.70.1 (not ratified | optionally enabled |
| consisting of load/store byte/half which overlap with c.fld, c.fldsp, | yet; version will     | with the ZC_EXT    |
| c.fsd.                                                                | change)               | parameter          |
| Zcmp: Subset of the standard Zc Code-Size Reduction exten-            | v0.70.1 (not ratified | optionally enabled |
| sion consisting of push/pop and double move which overlap with        | yet; version will     | with the ZC_EXT    |
| c.fsdsp.                                                              | change)               | parameter          |
| Zcmt: Subset of the standard Zc Code-Size Reduction extension         | v0.70.1 (not ratified | optionally enabled |
| consisting of table jump.                                             | yet; version will     | with the ZC_EXT    |
|                                                                       | change)               | parameter          |
| A: Atomic Instructions                                                | 2.1                   | optionally enabled |
|                                                                       |                       | with the A_EXT     |
|                                                                       |                       | parameter          |
| Zba: Bit Manipulation Address calculation instructions                | Version 1.0.0         | optionally enabled |
|                                                                       |                       | with the B_EXT     |
|                                                                       |                       | parameter          |
| <b>Zbb</b> : Bit Manipulation Base instructions                       | Version 1.0.0         | optionally enabled |
|                                                                       |                       | with the B_EXT     |
|                                                                       |                       | parameter          |
| Zbc: Bit Manipulation Carry-Less Multiply instructions                | Version 1.0.0         | optionally enabled |
|                                                                       |                       | with the B_EXT     |
|                                                                       |                       | parameter          |
| Zbs: Bit Manipulation Bit set, Bit clear, etc. instructions           | Version 1.0.0         | optionally enabled |
|                                                                       |                       | with the B_EXT     |
|                                                                       |                       | parameter          |
| Zkt: Data Independent Execution Latency                               | Version 1.0.0         | always enabled     |
| Zbkc: Constant time Carry-Less Multiply                               | Version 1.0.0         | optionally enabled |
|                                                                       |                       | with the B_EXT     |
|                                                                       |                       | parameter          |
| Zmmul: Multiplication subset of the M extension                       | Version 0.1           | optionally enabled |
|                                                                       |                       | with the M_EXT     |
|                                                                       | 1                     | parameter          |

| Table 2.2: 0 | CV32E40X | Standard | Instruction | Set Extensions |
|--------------|----------|----------|-------------|----------------|
|--------------|----------|----------|-------------|----------------|

The following custom instruction set extensions are available.

| Custom Extension      | Version                              | Configurability                           |
|-----------------------|--------------------------------------|-------------------------------------------|
| Xif: eXtension Inter- | 0.1 (not finalized yet; version will | optionally enabled with the X_EXT parame- |
| face                  | change)                              | ter                                       |

| Table 2.3: CV32E40X Custom Instruction Set Extensions |
|-------------------------------------------------------|
|-------------------------------------------------------|

**Note:** CV32E40X does not implement the **F** extension for single-precision floating-point instructions internal to the core. The **F** extension can be supported by interfacing the CV32E40X to an external FPU via the eXtension interface.

Most content of the RISC-V privileged specification is optional. CV32E40X currently supports the following features according to the RISC-V Privileged Specification [RISC-V-PRIV].

- M-Mode
- All CSRs listed in Control and Status Registers
- Base Counters, Timers and Hardware Performance Counters as described in *Performance Counters* controlled by the NUM\_MHPMCOUNTERS parameter
- Trap handling supporting direct mode or vectored mode as described at Exceptions and Interrupts
- Physical Memory Attribution (PMA) as described in *Physical Memory Attribution (PMA)*

#### 2.3 Synthesis guidelines

The CV32E40X core is fully synthesizable. It has been designed mainly for ASIC designs, but FPGA synthesis is supported as well.

All the files in the rtl and rtl/include folders are synthesizable. The top level module is called cv32e40x\_core.

The user must provide a clock-gating module that instantiates the clock-gating cells of the target technology. This file must have the same interface and module name of the one provided for simulation-only purposes at bhv/cv32e40x\_sim\_clock\_gate.sv (see *Clock Gating Cell*).

The constraints/cv32e40x\_core.sdc file provides an example of synthesis constraints. No synthesis scripts are provided.

#### 2.3.1 ASIC Synthesis

ASIC synthesis is supported for CV32E40X. The whole design is completely synchronous and uses positive-edge triggered flip-flops. A technology specific implementation of a clock gating cell as described in *Clock Gating Cell* needs to be provided.

#### 2.3.2 FPGA Synthesis

FPGA synthesis is supported for CV32E40X. The user needs to provide a technology specific implementation of a clock gating cell as described in *Clock Gating Cell*.

### 2.4 Verification

The verification environment (testbenches, testcases, etc.) for the CV32E40X core can be found at core-v-verif. It is recommended that you start by reviewing the CORE-V Verification Strategy.

# 2.5 Contents

- Getting Started with CV32E40X discusses the requirements and initial steps to start using CV32E40X.
- *Core Integration* provides the instantiation template and gives descriptions of the design parameters as well as the input and output ports.
- CV32E40X Pipeline described the overal pipeline structure.
- The instruction and data interfaces of CV32E40X are explained in *Instruction Fetch* and *Load-Store-Unit (LSU)*, respectively.
- Physical Memory Attribution (PMA) describes the Physical Memory Attribution (PMA) unit.
- The register-file is described in *Register File*.
- eXtension Interface describes the custom eXtension interface.
- *Sleep Unit* describes the Sleep unit.
- The control and status registers are explained in Control and Status Registers.
- *Performance Counters* gives an overview of the performance monitors and event counters available in CV32E40X.
- Exceptions and Interrupts deals with the infrastructure for handling exceptions and interrupts.
- Debug & Trigger gives a brief overview on the debug infrastructure.
- RISC-V Formal Interface gives a brief overview of the RVFI module.
- Glossary provides definitions of used terminology.

#### 2.6 History

CV32E40X started its life as a fork of the CV32E40P from the OpenHW Group <a href="https://www.openhwgroup.org">https://www.openhwgroup.org</a>>.

#### 2.7 References

- Gautschi, Michael, et al. "Near-Threshold RISC-V Core With DSP Extensions for Scalable IoT Endpoint Devices." in IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 25, no. 10, pp. 2700-2713, Oct. 2017
- Schiavone, Pasquale Davide, et al. "Slow and steady wins the race? A comparison of ultra-low-power RISC-V cores for Internet-of-Things applications." 27th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS 2017)

### 2.8 Contributors

Andreas Traber (\*atraber@iis.ee.ethz.ch\*)

Michael Gautschi (\*gautschi@iis.ee.ethz.ch\*) Pasquale Davide Schiavone (\*pschiavo@iis.ee.ethz.ch\*) Arjan Bink (\*arjan.bink@silabs.com\*) Paul Zavalney (\*paul.zavalney@silabs.com\*)

Micrel Lab and Multitherman Lab University of Bologna, Italy

Integrated Systems Lab ETH Zürich, Switzerland

#### THREE

### **GETTING STARTED WITH CV32E40X**

This page discusses initial steps and requirements to start using CV32E40X in your design.

# 3.1 Clock Gating Cell

CV32E40X requires clock gating cells. These cells are usually specific to the selected target technology and thus not provided as part of the RTL design. A simulation-only version of the clock gating cell is provided in cv32e40x\_sim\_clock\_gate.sv. This file contains a module called cv32e40x\_clock\_gate that has the following ports:

- clk\_i: Clock Input
- en\_i: Clock Enable Input
- scan\_cg\_en\_i: Scan Clock Gate Enable Input (activates the clock even though en\_i is not set)
- clk\_o: Gated Clock Output

And the following Parameters: \* LIB : Standard cell library (semantics defined by integrator)

Inside CV32E40X, the clock gating cell is used in cv32e40x\_sleep\_unit.sv.

The cv32e40x\_sim\_clock\_gate.sv file is not intended for synthesis. For ASIC synthesis and FPGA synthesis the manifest should be adapted to use a customer specific file that implements the cv32e40x\_clock\_gate module using design primitives that are appropriate for the intended synthesis target technology.

# **CORE INTEGRATION**

The main module is named  $cv32e40x\_core$  and can be found in  $cv32e40x\_core.sv$ . Below, the instantiation template is given and the parameters and interfaces are described.

# 4.1 Instantiation Template

| v32e40x_core #(               |     |           |          | <br> | <br> |          |               |                   |
|-------------------------------|-----|-----------|----------|------|------|----------|---------------|-------------------|
| .LIB                          | (   | ٥         | ),       |      |      |          |               |                   |
| .RV32                         | (   | RV32I     |          |      |      |          |               |                   |
| .A_EXT                        | Ć   |           | ),       |      |      |          |               |                   |
| .B_EXT                        | (   | B_NONE    |          |      |      |          |               |                   |
| .M_EXT                        | (   |           | ),       |      |      |          |               |                   |
| .X_EXT                        | (   |           | ),<br>), |      |      |          |               |                   |
| .X_NUM_RS                     | (   |           |          |      |      |          |               |                   |
|                               | (   |           | ),       |      |      |          |               |                   |
| .X_ID_WIDTH                   | (   |           | ),       |      |      |          |               |                   |
| .X_MEM_WIDTH                  | (   | 32        |          |      |      |          |               |                   |
| .X_RFR_WIDTH                  | (   | 32        |          |      |      |          |               |                   |
| .X_RFW_WIDTH                  | (   | 32        |          |      |      |          |               |                   |
| .X_MISA                       | (   | 32'h0     |          |      |      |          |               |                   |
| .X_ECS_XS                     | (   | 2'b0      |          |      |      |          |               |                   |
| .ZC_EXT                       | (   |           | ),       |      |      |          |               |                   |
| .DBG_NUM_TRIGGERS             | (   |           | ),       |      |      |          |               |                   |
| .NUM_MHPMCOUNTERS             | (   |           | ),       |      |      |          |               |                   |
| .PMA_NUM_REGIONS              | (   |           | ),       |      |      |          |               |                   |
| .PMA_CFG                      | ( ] | PMA_CFG[] |          |      |      |          |               |                   |
| .SMCLIC                       | (   |           | ),       |      |      |          |               |                   |
| .SMCLIC_ID_WIDTH              | (   | 5         | )        |      |      |          |               |                   |
| u_core (                      |     |           |          |      |      |          |               |                   |
| <pre>// Clock and reset</pre> |     |           |          |      |      |          |               |                   |
| .clk_i                        | О,  |           |          |      |      |          |               |                   |
| .rst_ni                       | О,  |           |          |      |      |          |               |                   |
| .scan_cg_en_i                 | О,  |           |          |      |      |          |               |                   |
|                               |     |           |          |      |      |          |               |                   |
| <pre>// Configuration</pre>   |     |           |          |      |      |          |               |                   |
| .boot_addr_i                  | О,  |           |          |      |      |          |               |                   |
| .mtvec_addr_i                 | О,  |           |          |      |      |          |               |                   |
| .dm_halt_addr_i               | О,  |           |          |      |      |          |               |                   |
| .dm_exception_addr_i          | (), |           |          |      |      |          |               |                   |
| .mhartid_i                    | О,  |           |          |      |      |          |               |                   |
|                               |     |           |          |      |      | (continu | (continues on | (continues on nex |

|                                   |            | (continued from previous page |
|-----------------------------------|------------|-------------------------------|
| .mimpid_patch_i                   | (),        |                               |
| // Instruction memory             | nterface   |                               |
| .instr_req_o                      | О,         |                               |
| .instr_gnt_i                      | (),        |                               |
| .instr_addr_o                     | О,         |                               |
| .instr_memtype_o                  | Ο,         |                               |
| .instr_prot_o                     | Ο,         |                               |
| .instr_dbg_o                      | Ο,         |                               |
| .instr_rvalid_i                   | 0,         |                               |
| .instr_rdata_i                    | 0,         |                               |
| .instr_err_i                      | Ο,         |                               |
| // Data memory interfac           | е          |                               |
| .data_req_o                       | (),        |                               |
| .data_gnt_i                       | 0,         |                               |
| .data_addr_o                      | (),        |                               |
| .data_atop_o                      | 0,         |                               |
| .data_be_o                        | (),        |                               |
| .data_memtype_o                   | (),<br>(), |                               |
| .data_prot_o                      | (),<br>(), |                               |
| .data_dbg_o                       | (),<br>(), |                               |
| .data_wdata_o                     | (),<br>(), |                               |
| .data_we_o                        | (),<br>(), |                               |
| .data_we_0<br>.data_rvalid_i      |            |                               |
|                                   | (),        |                               |
| .data_rdata_i                     | (),        |                               |
| .data_err_i                       | (),        |                               |
| .data_exokay_i                    | (),        |                               |
| // Cycle Count                    |            |                               |
| .mcycle_o                         | (),        |                               |
| <pre>// eXtension interface</pre> |            |                               |
| .xif_compressed_if                | (),        |                               |
| .xif_issue_if                     | О,         |                               |
| .xif_commit_if                    | Ο,         |                               |
| .xif_mem_if                       | Ο,         |                               |
| .xif_mem_result_if                | Ο,         |                               |
| .xif_result_if                    | Ο,         |                               |
| <pre>// Interrupt interface</pre> |            |                               |
| .irq_i                            | (),        |                               |
| .clic_irq_i                       | О,         |                               |
| .clic_irq_id_i                    | (),        |                               |
| .clic_irq_level_i                 | О,         |                               |
| .clic_irq_priv_i                  | О,         |                               |
| .clic_irq_shv_i                   | Ο,         |                               |
| // Fencei flush handsha           | ke         |                               |
| .fencei_flush_req_o               | О,         |                               |
| .fencei_flush_ack_i               | О,         |                               |

(continued from previous page)

|    | <pre>// Debug interface .debug_req_i .debug_havereset_o .debug_running_o .debug_halted_o</pre> | (),<br>(),<br>(),<br>(), |
|----|------------------------------------------------------------------------------------------------|--------------------------|
| ); | <pre>// Special control sign .fetch_enable_i .core_sleep_o</pre>                               | nals<br>(),<br>()        |

### 4.2 Parameters

**Note:** All eXtension interface parameters (X\_NUM\_RS, X\_ID\_WIDTH, X\_MEM\_WIDTH, X\_RFR\_WIDTH and X\_RFW\_WIDTH) must be set with values matching the actual if\_xif instance and the coprocessor/interconnect available outside of CV32E40X.

| Name     | Type/Ra    | ın ge- | Description                                                                               |  |
|----------|------------|--------|-------------------------------------------------------------------------------------------|--|
|          |            | fault  |                                                                                           |  |
| LIB      | int        | 0      | Standard cell library (semantics defined by integrator)                                   |  |
| RV32     | rv32_e     | RV32I  | Base Integer Instruction Set. RV32 = RV32I: RV32I Base Integer Instruction Set.           |  |
|          |            |        | RV32 = RV32E: RV32E Base Integer Instruction Set.                                         |  |
| A_EXT    | bit        | 0      | Enable Atomic Instruction (A) support (not implemented yet)                               |  |
| B_EXT    | b_ext_e    | B_NON  | Enable Bit Manipulation support. B_EXT = B_NONE: No Bit Manipulation instru-              |  |
|          |            |        | tions are supported. B_EXT = ZBA_ZBB_ZBS: Zba, Zbb and Zbs are supported.                 |  |
|          |            |        | B_EXT = ZBA_ZBB_ZBC_ZBS: Zba, Zbb, Zbc and Zbs are supported.                             |  |
| M_EXT    | m_ext_e    | М      | Enable Multiply / Divide support. M_EXT = M_NONE: No multiply / divide instruc-           |  |
|          |            |        | tions are supported. M_EXT = ZMMUL: The multiplication subset of the M extension          |  |
|          |            |        | is supported. $M_EXT = M$ : The M extension is supported.                                 |  |
| X_EXT    | bit        | 0      | Enable eXtension Interface (X) support, see <i>eXtension Interface</i>                    |  |
| X_NUM_F  | Sint       | 2      | Number of register file read ports that can be used by the eXtension interface.           |  |
|          | (23)       |        |                                                                                           |  |
| X_ID_WI  |            | 4      | Identification width for the eXtension interface.                                         |  |
|          | (332)      |        |                                                                                           |  |
| X_MEM_W  | VIDTH(32   | 32     | Memory access width for loads/stores via the eXtension interface.                         |  |
|          | 64,        | -      | · · · • • · · · · · · · · · · · · · · ·                                                   |  |
|          | 128,       |        |                                                                                           |  |
|          | 256)       |        |                                                                                           |  |
| X_RFR_W  | VIDTH32,   | 32     | Register file read access width for the eXtension interface.                              |  |
|          | 64)        | -      |                                                                                           |  |
| X_RFW_W  | ,          | 32     | Register file write access width for the eXtension interface.                             |  |
|          | 64)        | _      |                                                                                           |  |
| X_MISA   | logic      | 32'h0  | MISA extensions implemented on the eXtension interface, see Machine ISA (misa).           |  |
|          | [31:0]     |        | X_MISA can only be used to set a subset of the following: {P, V, F, D, Q, X, M}.          |  |
| X_ECS_X  |            | 2'b0   | Default value for mstatus.XS if X_EXT = 1, see <i>Machine Status (mstatus)</i> .          |  |
|          | [1:0]      |        | _ , , , , , , , , , , , , , , , , , , ,                                                   |  |
| ZC_EXT   | bit        | 0      | Enable Zca, Zcb, Zcmb, Zcmp, Zcmt extension support.                                      |  |
| NUM_MHF  | MGOUNTER   | RS1    | Number of MHPMCOUNTER performance counters, see <i>Performance Counters</i>               |  |
|          | (029)      |        | 1 , 5                                                                                     |  |
| DBG_NUM  | 1_iFRIGGER | RSI    | Number of debug triggers, see Debug & Trigger                                             |  |
|          | (04)       |        |                                                                                           |  |
| PMA_NUM  | LREGIONS   | 5 0    | Number of PMA regions                                                                     |  |
|          | (016)      |        | C                                                                                         |  |
| PMA_CFC  |            | PMA R  | _IDENEAU of ma_cfg_t with PMA_NUM_REGIONS entries, see                                    |  |
| _        |            |        | Physical Memory Attribution (PMA)                                                         |  |
| SMCLIC   | int        | 0      | Is Smclic supported?                                                                      |  |
| -        | (01)       |        | 11                                                                                        |  |
| SMCLIC   | IDntWIDTH  | 15     | Width of clic_irq_id_i and clic_irq_id_o. The maximum number of sup-                      |  |
| <b>-</b> | (110       | -      | ported interrupts in CLIC mode is 2^SMCLIC_ID_WIDTH. Trap vector table align-             |  |
|          |            |        | ment is restricted as described in <i>Machine Trap Vector Table Base Address (mtvt)</i> . |  |
|          | /          |        |                                                                                           |  |

# 4.3 Interfaces

| Sig-<br>nal(s)                                                               | Width                                        | Dir            | Description                                                                                                                                                                                                                                                                                                                 |
|------------------------------------------------------------------------------|----------------------------------------------|----------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| clk_i                                                                        | 1                                            | in             | Clock signal                                                                                                                                                                                                                                                                                                                |
| rst_ni                                                                       | 1                                            | in             | Active-low asynchronous reset                                                                                                                                                                                                                                                                                               |
| scan_co                                                                      |                                              | in             | Scan clock gate enable. Design for test (DfT) related signal. Can be used during scan testing operation to force instantiated clock gate(s) to be enabled. This signal should be 0 during normal / functional operation.                                                                                                    |
| boot_ad                                                                      |                                              | in             | Boot address. First program counter after reset = boot_addr_i. Must be word aligned. Do not change after enabling core via fetch_enable_i                                                                                                                                                                                   |
| mtvec_a                                                                      | addr_i                                       | in             | mtvec address. Initial value for the address part of <i>Machine Trap-Vector Base Address</i> $(mtvec) - SMCLIC == 0$ . Must be 128-byte aligned (i.e. mtvec_addr_i[6:0] = 0). Do not change after enabling core via fetch_enable_i                                                                                          |
|                                                                              | _32ldr_i                                     |                | Address to jump to when entering Debug Mode, see <i>Debug &amp; Trigger</i> . Must be word aligned. Do not change after enabling core via fetch_enable_i                                                                                                                                                                    |
|                                                                              | ept2ion_a                                    | d <b>dr_</b> i | Address to jump to when an exception occurs when executing code during Debug Mode, see <i>Debug &amp; Trigger</i> . Must be word aligned. Do not change after enabling core via fetch_enable_i                                                                                                                              |
| mharti                                                                       | _                                            | in             | Hart ID, usually static, can be read from Hardware Thread ID (mhartid) CSR                                                                                                                                                                                                                                                  |
| _                                                                            | _p&tch_i                                     |                | Implementation ID patch. Must be static. Readable as part of <i>Machine Implementation ID (mimpid)</i> CSR.                                                                                                                                                                                                                 |
|                                                                              |                                              |                | nterface, see Instruction Fetch                                                                                                                                                                                                                                                                                             |
|                                                                              |                                              |                | terface, see Load-Store-Unit (LSU)                                                                                                                                                                                                                                                                                          |
| -                                                                            | •                                            | ounter Ou      | 1                                                                                                                                                                                                                                                                                                                           |
| irq_* Interrupt inputs, see Exceptions and Interrupts                        |                                              |                |                                                                                                                                                                                                                                                                                                                             |
| clic_*_iCLIC interface, see Exceptions and Interrupts                        |                                              |                |                                                                                                                                                                                                                                                                                                                             |
| debug_ <sup>3</sup>                                                          | debug_* Debug interface, see Debug & Trigger |                |                                                                                                                                                                                                                                                                                                                             |
| fetch_e                                                                      | enable_i                                     | in             | Enable the instruction fetch of CV32E40X. The first instruction fetch after reset de-<br>assertion will not happen as long as this signal is 0. fetch_enable_i needs to be set<br>to 1 for at least one cycle while not in reset to enable fetching. Once fetching has been<br>enabled the value fetch_enable_i is ignored. |
| core_sl                                                                      | _eep_o                                       | out            | Core is sleeping, see <i>Sleep Unit</i> .                                                                                                                                                                                                                                                                                   |
| xif_com                                                                      | nper%etsesneid                               | unicompre      | essed interface, see Compressed interface                                                                                                                                                                                                                                                                                   |
| xif_issueXtefision issue interface, see Issue interface                      |                                              |                |                                                                                                                                                                                                                                                                                                                             |
| xif_commeXteinstion commit interface, see Commit interface                   |                                              |                |                                                                                                                                                                                                                                                                                                                             |
| xif_mem_dixtension memory interface, see Memory (request/response) interface |                                              |                |                                                                                                                                                                                                                                                                                                                             |
| xif_mem_executioninfiemory result interface, see Memory result interface     |                                              |                |                                                                                                                                                                                                                                                                                                                             |
| xif_resulXtenfion result interface, see Result interface                     |                                              |                |                                                                                                                                                                                                                                                                                                                             |



Figure 4.1: CV32E40X Pipeline

# **PIPELINE DETAILS**

CV32E40X has a 4-stage in-order completion pipeline, the 4 stages are:

- **Instruction Fetch (IF)** Fetches instructions from memory via an aligning prefetch buffer, capable of fetching 1 instruction per cycle if the instruction side memory system allows. The IF stage also pre-decodes RVC instructions into RV32I base instructions. See *Instruction Fetch* for details.
- **Instruction Decode (ID)** Decodes fetched instruction and performs required register file reads. Jumps are taken from the ID stage.
- **Execute (EX)** Executes the instructions. The EX stage contains the ALU, Multiplier and Divider. Branches (with their condition met) are taken from the EX stage. Multi-cycle instructions will stall this stage until they are complete. The address generation part of the load-store-unit (LSU) is contained in EX as well.
- Writeback (WB) Writes the result of ALU, Multiplier, Divider, or Load instructions instructions back to the register file.

# 5.1 Multi- and Single-Cycle Instructions

Table 5.1 shows the cycle count per instruction type. Some instructions have a variable time, this is indicated as a range e.g. 1..32 means that the instruction takes a minimum of 1 cycle and a maximum of 32 cycles. The cycle counts assume zero stall on the instruction-side interface and zero stall on the data-side memory interface.

|        |                                                  | counts per instruction type                                                                                          |
|--------|--------------------------------------------------|----------------------------------------------------------------------------------------------------------------------|
| In-    | Cycles                                           | Description                                                                                                          |
| struc- |                                                  |                                                                                                                      |
| tion   |                                                  |                                                                                                                      |
| Туре   |                                                  |                                                                                                                      |
| Inte-  | 1                                                | Integer Computational Instructions are defined in the                                                                |
| ger    |                                                  | RISCV-V RV32I Base Integer Instruction Set.                                                                          |
| Com-   |                                                  | C                                                                                                                    |
| pu-    |                                                  |                                                                                                                      |
| ta-    |                                                  |                                                                                                                      |
| tional |                                                  |                                                                                                                      |
| CSR    | 4 (mstatus, mepc, mtvec, mcause, mcycle,         | CSR Access Instruction are defined in 'Zicsr' of the RISC-V                                                          |
| Ac-    | minstret, mhpmcounter*, mcycleh, min-            | specification.                                                                                                       |
| cess   | streth, mhpmcounter*h, mcountinhibit, mh-        | -F                                                                                                                   |
| ••••   | pmevent*, dscr, dpc, dscratch0, dscratch1,       |                                                                                                                      |
|        | privlv)                                          |                                                                                                                      |
|        | 1 (all the other CSRs)                           |                                                                                                                      |
| Load/S |                                                  | Load/Store is handled in 1 bus transaction using both EX and                                                         |
| Loudin | 2 (non-word aligned word transfer)               | WB stages for 1 cycle each. For misaligned word transfers                                                            |
|        | 2 (halfword transfer crossing word bound-        | and for halfword transfers that cross a word boundary 2 bus                                                          |
|        | ary)                                             | transactions are performed using EX and WB stages for 2                                                              |
|        | ary)                                             | cycles each.                                                                                                         |
| Mul-   | 1 (mul)                                          | CV32E40X uses a single-cycle 32-bit x 32-bit multiplier                                                              |
| tipli- | 4 (mulh, mulhsu, mulhu)                          | with a 32-bit result. The multiplications with upper-word                                                            |
| ca-    | 4 (muni, munisu, muniu)                          | result take 4 cycles to compute.                                                                                     |
| tion   |                                                  | result take 4 cycles to compute.                                                                                     |
| Di-    | 3 - 35                                           | The number of cycles depends on the divider operand value                                                            |
| vi-    | 3 - 35                                           | (operand b), i.e. in the number of leading bits at 0. The                                                            |
| sion   | 5 - 55                                           | minimum number of cycles is 3 when the divider has zero                                                              |
| Re-    |                                                  | leading bits at 0 (e.g., 0x8000000). The maximum number                                                              |
| main-  |                                                  | of cycles is 35 when the divider is 0                                                                                |
| der    |                                                  | of cycles is 55 when the divider is 0                                                                                |
|        | 2                                                | Jumps are performed in the ID stage. Upon a jump the IF                                                              |
| Jump   | 3 (target is a non-word-aligned non-RVC in-      | stage (including prefetch buffer) is flushed. The new PC re-                                                         |
|        |                                                  | quest will appear on the instruction-side memory interface                                                           |
|        | struction)                                       | the same cycle the jump instruction is in the ID stage.                                                              |
| mrat   | 2                                                |                                                                                                                      |
| mret   |                                                  | Mret is performed in the ID stage. Upon an mret the IF stage (including prefetch buffer) is flushed. The new PC re-  |
|        | 3 (target is a non-word-aligned non-RVC in-      |                                                                                                                      |
|        | struction)                                       | quest will appear on the instruction-side memory interface                                                           |
| Dana   | 1                                                | the same cycle the mret instruction is in the ID stage.                                                              |
| Branch | 1 1                                              | Any branch where the condition is not met will not stall.                                                            |
| (Not-  |                                                  |                                                                                                                      |
| Taken) | 2                                                |                                                                                                                      |
| Branch |                                                  | The EX stage is used to compute the branch decision. Any                                                             |
| (Taken | ) 4 (target is a non-word-aligned non-RVC in-    | branch where the condition is met will be taken from the EX                                                          |
|        | struction)                                       | stage and will cause a flush of the IF stage (including prefetch                                                     |
|        | 54 404011                                        | 1 (Ferry) and 1 (D) of a second                                                                                      |
|        |                                                  | buffer) and ID stage.                                                                                                |
| In-    | 5                                                | The FENCE.I instruction as defined in 'Zifencei' of the                                                              |
| struc- | 5<br>6 (target is a non-word-aligned non-RVC in- | The FENCE.I instruction as defined in 'Zifencei' of the RISC-V specification. Internally it is implemented as a jump |
|        | 5                                                | The FENCE.I instruction as defined in 'Zifencei' of the                                                              |

| Table 5.1: Cycle counts per instru | ction type |
|------------------------------------|------------|
|------------------------------------|------------|

#### 5.2 Hazards

The CV32E40X experiences a 1 cycle penalty on the following hazards.

- Load data hazard (in case the instruction immediately following a load uses the result of that load)
- Jump register (jalr) data hazard (in case that a jalr depends on the result of an immediately preceding non-load instruction)

The CV32E40X experiences a 2 cycle penalty on the following hazards.

• Jump register (jalr) data hazard (in case that a jalr depends on the result of an immediately preceding load instruction)

### **INSTRUCTION FETCH**

The Instruction Fetch (IF) stage of the CV32E40X is able to supply one instruction to the Instruction Decode (ID) stage per cycle if the external bus interface is able to serve one instruction per cycle. In case of executing compressed instructions, on average less than one 32-bit instruction fetch will we needed per instruction in the ID stage.

For optimal performance and timing closure reasons, a prefetcher is used which fetches instructions via the external bus interface from for example an externally connected instruction memory or instruction cache.

The prefetch unit performs word-aligned 32-bit prefetches and stores the fetched words in an alignment buffer with three entries. As a result of this (speculative) prefetch, CV32E40X can fetch up to three words outside of the code region and care should therefore be taken that no unwanted read side effects occur for such prefetches outside of the actual code region.

Table 6.1 describes the signals that are used to fetch instructions. This interface is a simplified version of the interface that is used by the LSU, which is described in *Load-Store-Unit (LSU)*. The difference is that no writes are possible and thus it needs fewer signals.

| Signal                          | Di-            | Description                                                               |
|---------------------------------|----------------|---------------------------------------------------------------------------|
|                                 | rec-           |                                                                           |
|                                 | tion           |                                                                           |
| instr_req_o                     | out-           | Request valid, will stay high until instr_gnt_i is high for one cycle     |
|                                 | put            |                                                                           |
| instr_gnt_i                     | input          | The other side accepted the request. instr_addr_o, instr_memtype_o and    |
|                                 |                | <pre>instr_prot_o may change in the next cycle.</pre>                     |
| instr_addr_o[3                  | 10 <b>0</b> t] | Address, word aligned                                                     |
|                                 | put            |                                                                           |
| instr_memtype_o@1t:0]           |                | Memory Type attributes (cacheable, bufferable)                            |
|                                 | put            |                                                                           |
| instr_prot_o[2                  | :0jt-          | Protection attributes                                                     |
|                                 | put            |                                                                           |
| instr_dbg_o                     | out-           | Debug mode access                                                         |
|                                 | put            |                                                                           |
| <pre>instr_rvalid_i input</pre> |                | instr_rdata_i and instr_err_i are valid when instr_rvalid_i is high. This |
|                                 |                | signal will be high for exactly one cycle per request.                    |
| instr_rdata_i[                  | 3ihp01]        | Data read from memory                                                     |
| instr_err_i                     | input          | An instruction interface error occurred                                   |

| Table 6.1: | Instruction | Fetch | interface | signals |
|------------|-------------|-------|-----------|---------|
|            |             |       |           |         |

# 6.1 Misaligned Accesses

Externally, the IF interface performs word-aligned instruction fetches only. Misaligned instruction fetches are handled by performing two separate word-aligned instruction fetches. Internally, the core can deal with both word- and half-word-aligned instruction addresses to support compressed instructions. The LSB of the instruction address is ignored internally.

# 6.2 Protocol

The instruction bus interface is compliant to the OBI protocol (see [OPENHW-OBI] for detailed signal and protocol descriptions). The CV32E40X instruction fetch interface does not implement the following optional OBI signals: we, be, wdata, auser, wuser, aid, rready, ruser, rid. These signals can be thought of as being tied off as specified in the OBI specification. The CV32E40X instruction fetch interface can cause up to two outstanding transactions.

Figure 6.1 and Figure 6.3 show example timing diagrams of the protocol.

Figure 6.1: Back-to-back Memory Transactions

Figure 6.2: Back-to-back Memory Transactions with bus errors on A2/RD2 and A4/RD4

Figure 6.3: Multiple Outstanding Memory Transactions

Figure 6.4: Multiple Outstanding Memory Transactions with bus error on A1/RD1

#### SEVEN

# LOAD-STORE-UNIT (LSU)

The Load-Store Unit (LSU) of the core takes care of accessing the data memory. Load and stores on words (32 bit), half words (16 bit) and bytes (8 bit) are supported.

Table 7.1 describes the signals that are used by the LSU.

| Signal     | Di-               | Description                                                                                 |
|------------|-------------------|---------------------------------------------------------------------------------------------|
|            | rec-              |                                                                                             |
|            | tion              |                                                                                             |
| data_req_o | o out-            | Request valid, will stay high until data_gnt_i is high for one cycle                        |
|            | put               |                                                                                             |
| data_gnt_: | i in-             | The other side accepted the request. data_addr_o, data_atop_o, data_be_o,                   |
|            | put               | data_memtype_o[2:0], data_prot_o, data_wdata_o, data_we_o may change in the next            |
|            |                   | cycle.                                                                                      |
| data_addr  |                   | OAddress, sent together with data_req_o.                                                    |
|            | put               |                                                                                             |
| data_atop  | _∞[i∄÷0           | ] Atomic attributes, sent together with data_req_o.                                         |
|            | put               |                                                                                             |
| data_be_o  | [ 30a 00 ]        | Byte Enable. Is set for the bytes to write/read, sent together with data_req_o.             |
|            | put               |                                                                                             |
| data_memty | vpæ <u>n</u> te [ | 1 Mogmory Type attributes (cacheable, bufferable), sent together with data_req_o.           |
|            | put               |                                                                                             |
| data_prot_ | _co[i£i÷0         | ] Protection attributes, sent together with data_req_o.                                     |
|            | put               |                                                                                             |
| data_dbg_o | o out-            | Debug mode access, sent together with data_req_o.                                           |
|            | put               |                                                                                             |
| data_wdata | a_cou[t31         | : Data to be written to memory, sent together with data_req_o.                              |
|            | put               |                                                                                             |
| data_we_o  | out-              | Write Enable, high for writes, low for reads. Sent together with data_req_o.                |
|            | put               |                                                                                             |
| data_rval: | idni              | data_rvalid_i will be high for exactly one cycle to signal the end of the response phase of |
|            | put               | for both read and write transactions. For a read transaction data_rdata_i holds valid data  |
|            |                   | when data_rvalid_i is high.                                                                 |
| data_rdata | a_im_{31          | : Data read from memory. Only valid when data_rvalid_i is high.                             |
|            | put               |                                                                                             |
| data_err_: | i in-             | A data interface error occurred. Only valid when data_rvalid_i is high.                     |
|            | put               |                                                                                             |
| data_exoka | ayini             | Exclusive transaction status. Only valid when data_rvalid_i is high.                        |
|            | put               |                                                                                             |

### 7.1 Misaligned Accesses

Misaligned transaction are supported in hardware for Main memory regions, see *Physical Memory Attribution (PMA)*. For loads and stores in Main memory where the effective address is not naturally aligned to the referenced datatype (i.e., on a four-byte boundary for word accesses, and a two-byte boundary for halfword accesses) the load/store is performed as two bus transactions in case that the data item crosses a word boundary. A single load/store instruction is therefore performed as two bus transactions for the following scenarios:

- · Load/store of a word for a non-word-aligned address
- Load/store of a halfword crossing a word address boundary

In both cases the transfer corresponding to the lowest address is performed first. All other scenarios can be handled with a single bus transaction.

Misaligned transactions are not supported in I/O regions and will result in an exception trap when attempted, see *Exceptions and Interrupts*.

### 7.2 Protocol

The data bus interface is compliant to the OBI protocol (see [OPENHW-OBI] for detailed signal and protocol descriptions). The CV32E40X data interface does not implement the following optional OBI signals: auser, wuser, aid, rready, ruser, rid. These signals can be thought of as being tied off as specified in the OBI specification. The CV32E40X data interface can cause up to two outstanding transactions.

The OBI protocol that is used by the LSU to communicate with a memory works as follows.

The LSU provides a valid address on data\_addr\_o, control information on data\_we\_o, data\_be\_o (as well as write data on data\_wdata\_o in case of a store) and sets data\_req\_o high. The memory sets data\_gnt\_i high as soon as it is ready to serve the request. This may happen at any time, even before the request was sent. After a request has been granted the address phase signals (data\_addr\_o, data\_we\_o, data\_be\_o and data\_wdata\_o) may be changed in the next cycle by the LSU as the memory is assumed to already have processed and stored that information. After granting a request, the memory answers with a data\_rvalid\_i set high if data\_rdata\_i is valid. This may happen one or more cycles after the request has been granted. Note that data\_rvalid\_i must also be set high to signal the end of the response phase for a write transaction (although the data\_rdata\_i has no meaning in that case). When multiple granted requests are outstanding, it is assumed that the memory requests will be kept in-order and one data\_rvalid\_i will be signalled for each of them, in the order they were issued.

Figure 7.1, Figure 7.2, Figure 7.3 and Figure 7.4 show example timing diagrams of the protocol.

Figure 7.1: Basic Memory Transaction

Figure 7.2: Back-to-back Memory Transactions

Figure 7.3: Slow Response Memory Transaction

Figure 7.4: Multiple Outstanding Memory Transactions

# 7.3 Write buffer

CV32E40X contains a single entry write buffer that is used for bufferable transfers. A bufferable transfer is a write transfer originating from a store instruction, where the write address is inside a bufferable region defined by the PMA (*Physical Memory Attribution (PMA)*). Note that Store Conditional (SC) and Atomic Memory Operation (AMO) instructions will not utilize the write buffer.

The write buffer (when not full) allows CV32E40X to proceed executing instructions without having to wait for  $data_gnt_i = 1$  and  $data_rvalid_i = 1$  for these bufferable transers.

**Note:** On the OBI interface data\_gnt\_i = 1 and data\_rvalid\_i = 1 still need to be signaled for every transfer (as specified in [OPENHW-OBI]), also for bufferable transfers.

Bus transfers will occur in program order, no matter if transfers are bufferable and non-bufferable. Transactions in the write buffer must be completed before the CV32E40X is able to:

- Retire a fence instruction
- Enter SLEEP mode

CHAPTER

# PHYSICAL MEMORY ATTRIBUTION (PMA)

The CV32E40X includes a Physical Memory Attribution (PMA) unit that allows compile time attribution of the physical memory map. The PMA is configured through the top level parameters PMA\_NUM\_REGIONS and PMA\_CFG[]. The number of PMA regions is configured through the PMA\_NUM\_REGIONS parameter. Valid values are 0-16. The configuration array, PMA\_CFG[], must consist of PMA\_NUM\_REGIONS entries of the type pma\_cfg\_t, defined in cv32e40x\_pkg.sv:

```
typedef struct packed {
   logic [31:0] word_addr_low;
   logic [31:0] word_addr_high;
   logic main;
   logic bufferable;
   logic cacheable;
   logic atomic;
   } pma_cfg_t;
```

In case of address overlap between PMA regions, the region with the lowest index in PMA\_CFG[] will have priority. The PMA can be deconfigured by setting PMA\_NUM\_REGIONS=0. When doing this, PMA\_CFG[] should be left unconnected.

# 8.1 Address range

The address boundaries of a PMA region are set in word\_addr\_low/word\_addr\_high. These contain bits 33:2 of 34bit, word aligned addresses. To get an address match, the transfer address addr must be in the range {word\_addr\_low, 2'b00} <= addr[33:0] < {word\_addr\_high, 2'b00}. Note that addr[33:32] = 2'b00 as the CV32E40X does not support Sv32.

# 8.2 Main memory vs I/O

Memory ranges can be defined as either main (main=1) or I/O (main=0). Code execution is allowed from main memory and main memory is considered to be idempotent. Non-aligned transactions are supported in main memory. Code execution is not allowed from I/O regions and an instruction access fault (exception code 1) is raised when attempting to execute from such regions. I/O regions are considered to be non-idempotent and therefore the PMA will prevent speculative accesses to such regions. Non-aligned transactions are not supported in I/O regions. An attempt to perform a non-naturally aligned load access to an I/O region causes a precise load access fault (exception code 5). An attempt to perform a non-naturally aligned store access to an I/O region causes a precise store access fault (exception code 7).

# 8.3 Bufferable and Cacheable

Accesses to regions marked as bufferable (bufferable=1) will result in the OBI mem\_type[0] bit being set, except if the access was an instruction fetch, a load, or part of an atomic memory operation. Bufferable stores will utilize the write buffer, see *Write buffer*.

Accesses to regions marked as cacheable (cacheable=1) will result in the OBI mem\_type[1] bit being set.

**Note:** The PMA must be configured such that accesses to the external debug module are non-cacheable, to enable its program buffer to function correctly.

# 8.4 Atomic operations

Regions supporting atomic operations can be defined by setting atomic=1. An attempt to perform a Load-Reserved to a region in which Atomic operations are not allowed will cause a precise load access fault (exception code 5). An attempt to perform a Store-Conditional or Atomic Memory Operation (AMO) to a region in which Atomic operations are not allowed will cause a precise store/AMO access fault (exception code 7). Note that the atomic attribute is only used when the RV32A extension is included.

# 8.5 Default attribution

If the PMA is deconfigured (PMA\_NUM\_REGIONS=0), the entire memory range will be treated as main memory (main=1), non-bufferable (bufferable=0), non-cacheable (cacheable=0) and atomics will be supported (atomic=1).

If the PMA is configured (PMA\_NUM\_REGIONS >  $\emptyset$ ), memory regions not covered by any PMA regions are treated as I/O memory (main= $\emptyset$ ), non-bufferable (bufferable= $\emptyset$ ), non-cacheable (cacheable= $\emptyset$ ) and atomics will not be supported (atomic= $\emptyset$ ).

Every instruction fetch, load and store will be subject to PMA checks and failed checks will result in an exception. PMA checks cannot be disabled. See *Exceptions and Interrupts* for details.

#### CHAPTER

## NINE

# **REGISTER FILE**

Source file: rtl/cv32e40x\_register\_file.sv

CV32E40X has 31 32-bit wide registers which form registers x1 to x31. Register x0 is statically bound to 0 and can only be read, it does not contain any sequential logic.

The number of read ports and the number of write ports of the register file depends on the parameter settings of CV32E40X. The register file has two read ports and one write port for the default parameter settings. If  $X_EXT = 1$ , then depending on the other eXtension interface parameters up to three read ports and two write ports can be instantiated. Register file reads are performed in the ID stage. Register file writes are performed in the WB stage.

# 9.1 General Purpose Register File

The general purpose register file is flip-flop-based. It uses regular, positive-edge-triggered flip-flops to implement the registers.

#### CHAPTER

# **EXTENSION INTERFACE**

The eXtension interface, also called CORE-V-XIF, enables extending CV32E40X with (custom or standardized) instructions without the need to change the RTL of CV32E40X itself. Extensions can be provided in separate modules external to CV32E40X and are integrated at system level by connecting them to the eXtension interface.

The eXtension interface provides low latency (tightly integrated) read and write access to the CV32E40X register file. All opcodes which are not used (i.e. considered to be invalid) by CV32E40X can be used for extensions. It is recommended however that custom instructions do not use opcodes that are reserved/used by RISC-V International.

The eXtension interface enables extension of CV32E40X with:

- Custom ALU type instructions.
- Custom load/store type instructions.
- Custom CSRs and related instructions.

Control-Tranfer type instructions (e.g. branches and jumps) are not supported via the eXtension interface.

Note: CV32E40X does for example not implement the  $\mathbf{F}$  (single-precision floating-point),  $\mathbf{P}$  (Packed SIMD) or  $\mathbf{V}$  (Vector) extensions internal to the core. Such extensions are considered good candidates to be implemented as external coprocessor functionality connected via the eXtension interface.

# 10.1 CORE-V-XIF

The eXtension interface of complies to the [OPENHW-XIF] specification. The reader is deferred to [OPENHW-XIF] for explanation of the interface protocol and semantics. Here we only list the top level interface pins to clarify the mapping of CV32E40X's SystemVerilog interfaces to CV32E40X signals.

#### 10.1.1 Compressed interface

Table 10.1 describes the compressed interface signals.

| Signal                     | Туре            | Di-                      | Description                                                           |
|----------------------------|-----------------|--------------------------|-----------------------------------------------------------------------|
|                            |                 | rec-                     |                                                                       |
|                            |                 | tion                     |                                                                       |
| <pre>xif_compressed_</pre> | i <b>f</b> ogic | out-                     | Compressed request valid. Request to uncompress a compressed instruc- |
| compressed_vali            | d               | put                      | tion.                                                                 |
| <pre>xif_compressed_</pre> | i <b>f</b> ogic | in-                      | Compressed request ready. The transactions signaled via               |
| compressed_ready           |                 | put                      | compressed_req and compressed_resp are accepted when                  |
|                            |                 |                          | compressed_valid and compressed_ready are both 1.                     |
| <pre>xif_compressed_</pre> | i fxcompres     | ss <b>od</b> <u>t</u> re | q_Compressed request packet.                                          |
| compressed_req             |                 | put                      |                                                                       |
| <pre>xif_compressed_</pre> | i fxcompres     | sienal-re                | spCompressed response packet.                                         |
| compressed_resp            |                 | put                      |                                                                       |

Table 10.1: Compressed interface signals

# 10.1.2 Issue interface

Table 10.2 describes the issue interface signals.

| Signal                   | Туре      | Di-               | Description                                                               |
|--------------------------|-----------|-------------------|---------------------------------------------------------------------------|
|                          |           | rec-              |                                                                           |
|                          |           | tion              |                                                                           |
| <pre>xif_issue_if.</pre> | logic     | out-              | Issue request valid. Indicates that CV32E40X wants to offload an instruc- |
| issue_valid              |           | put               | tion.                                                                     |
| <pre>xif_issue_if.</pre> | logic     | input             | Issue request ready. The transaction signaled via issue_req and           |
| issue_ready              |           |                   | issue_resp is accepted when issue_valid and issue_ready are both          |
|                          |           |                   | 1.                                                                        |
| <pre>xif_issue_if.</pre> | x_issue_i | eq <u>u</u> t-    | Issue request packet.                                                     |
| issue_req                |           | put               |                                                                           |
| <pre>xif_issue_if.</pre> | x_issue_1 | esip <u>p</u> tit | Issue response packet.                                                    |
| issue_resp               |           |                   |                                                                           |

Table 10.2: Issue interface signals

# **10.1.3 Commit interface**

Table 10.3 describes the commit interface signals.

Table 10.3: Commit interface signals

| Signal   | Туре     | Di-              | Description                                                                               |
|----------|----------|------------------|-------------------------------------------------------------------------------------------|
|          |          | rec-             |                                                                                           |
|          |          | tion             |                                                                                           |
| xif_comm | i togif. | out-             | Commit request valid. Indicates that CV32E40X has valid commit or kill informa-           |
| commit_v | alid     | put              | tion for an offloaded instruction. There is no corresponding ready signal (it is implicit |
|          |          |                  | and assumed 1). The coprocessor shall be ready to observe the commit_valid and            |
|          |          |                  | commit_kill signals at any time coincident or after an issue transaction initiation.      |
| xif_comm | it_ċ61   | n <b>ouit_</b> t | Commit packet.                                                                            |
| commit   |          | put              |                                                                                           |

# 10.1.4 Memory (request/response) interface

Table 10.4 describes the memory (request/response) interface signals.

| Signal                 | Туре   | Di-              | Description                                                                   |  |  |  |
|------------------------|--------|------------------|-------------------------------------------------------------------------------|--|--|--|
|                        |        | rec-             |                                                                               |  |  |  |
|                        |        | tion             |                                                                               |  |  |  |
| <pre>xif_mem_if.</pre> | logic  | in-              | Memory (request/response) valid. Indicates that the coprocessor wants to per- |  |  |  |
| <pre>mem_valid</pre>   |        | put              | form a memory transaction for an offloaded instruction.                       |  |  |  |
| <pre>xif_mem_if.</pre> | logic  | out-             | Memory (request/response) ready. The memory (request/response) signaled via   |  |  |  |
| mem_ready              |        | put              | mem_req is accepted by CV32E40X when mem_valid and mem_ready are both         |  |  |  |
|                        |        |                  | 1.                                                                            |  |  |  |
| <pre>xif_mem_if.</pre> | x_mem_ | reiqn <u>-</u> t | Memory request packet.                                                        |  |  |  |
| mem_req                |        | put              |                                                                               |  |  |  |
| <pre>xif_mem_if.</pre> | x_mem_ | resp <u>t-</u> t | Memory response packet. Response to memory request (e.g. PMA check re-        |  |  |  |
| mem_resp               |        | put              | sponse). Note that this is not the memory result.                             |  |  |  |

| Table 10.4: Memory (request/response) interface signal | Table 10.4: | Memory | (request/resp | onse) interface | signals |
|--------------------------------------------------------|-------------|--------|---------------|-----------------|---------|
|--------------------------------------------------------|-------------|--------|---------------|-----------------|---------|

#### 10.1.5 Memory result interface

Table 10.5 describes the memory result interface signals.

| Table 10.5: | Memory | result interface | signals |
|-------------|--------|------------------|---------|
|-------------|--------|------------------|---------|

| Signal                | Туре      | Di-       | Description                                                                     |
|-----------------------|-----------|-----------|---------------------------------------------------------------------------------|
|                       |           | rec-      |                                                                                 |
|                       |           | tion      |                                                                                 |
| xif_mem_resu          | allogicf. | out-      | Memory result valid. Indicates that CV32E40X has a valid memory result for the  |
| mem_result_v          | valid     | put       | corresponding memory request. There is no corresponding ready signal (it is im- |
|                       |           |           | plicit and assumed 1). The coprocessor must be ready to accept mem_result when- |
|                       |           |           | ever mem_result_valid is 1.                                                     |
| xif_mem_resu          | ılxt_mi€n | n_onetsul | t <u>Memory</u> result packet.                                                  |
| <pre>mem_result</pre> |           | put       |                                                                                 |

# 10.1.6 Result interface

Table 10.6 describes the result interface signals.

Table 10.6: Result interface signals

| Signal                    | Туре   | Di-             | Description                                                                    |  |  |
|---------------------------|--------|-----------------|--------------------------------------------------------------------------------|--|--|
|                           |        | rec-            |                                                                                |  |  |
|                           |        | tion            |                                                                                |  |  |
| <pre>xif_result_if.</pre> | logic  | input           | Result request valid. Indicates that the coprocessor has a valid result (write |  |  |
| result_valid              |        |                 | data or exception) for an offloaded instruction.                               |  |  |
| <pre>xif_result_if.</pre> | logic  | out-            | Result request ready. The result signaled via result is accepted by the        |  |  |
| result_ready              |        | put             | core when result_valid and result_ready are both 1.                            |  |  |
| <pre>xif_result_if.</pre> | x_resu | t <u>i</u> nput | Result packet.                                                                 |  |  |
| result                    |        |                 |                                                                                |  |  |

# **10.2 Integration**

When integrating the eXtension interface, all parameters used by both CV32E40X, the SystemVerilog interface and the coprocessor/interconnect must match. Parameters or localparams should be used at the hierarchy level above CV32E40X as shown in Figure 10.1.



Figure 10.1: eXtenstion interface integration

# 10.3 Timing

For optimal system level performance CV32E40X, the coprocessor(s) and the optional interconnect are advised to adhere to the timing budgets shown in Figure 10.2.

All eXtension interface signals not explicitly covered in Figure 10.2 should follow the generic timing budget that is outlined - 20% for the processor, 20% for the interconnect and 60% for the coprocessor.

The CV32E40X github repository contains a constraints file as seen from the processor: cv32e40x\_core.sdc



Figure 10.2: eXtenstion interface timing budgets

CHAPTER

# **ELEVEN**

# FENCE.I EXTERNAL HANDSHAKE

CV32E40X includes an external handshake that will be exercised upon execution of the fence.i instruction. The handshake is composed of the signals fencei\_flush\_req\_o and fencei\_flush\_ack\_i and can for example be used to flush an externally connected cache.

The fencei\_flush\_req\_o signal will go high upon executing a fence.i instruction once possible earlier store instructions have fully completed (including emptying of the the write buffer). The fencei\_flush\_req\_o signal will go low again the cycle after sampling both fencei\_flush\_req\_o and fencei\_flush\_ack\_i high. Once fencei\_flush\_req\_o has gone low again a branch will be taken to the instruction after the fence.i thereby flushing possibly prefetched instructions.

Fence instructions are not impacted by the distinction between main and I/O regions (defined in *Physical Memory Attribution (PMA)*) and execute as a conservative fence on all operations, ignoring the predecessor and successor fields.

**Note:** If the fence.i external handshake is not used by the environment of CV32E40X, then it is recommended to tie the fencei\_flush\_ack\_i to 1 in order to avoid stalling fence.i instructions indefinitely.

#### CHAPTER

# TWELVE

# **SLEEP UNIT**

Source File: rtl/cv32e40x\_sleep\_unit.sv

The Sleep Unit contains and controls the instantiated clock gate, see *Clock Gating Cell*, that gates clk\_i and produces a gated clock for use by the other modules inside CV32E40X. The Sleep Unit is the only place in which clk\_i itself is used; all other modules use the gated version of clk\_i.

The clock gating in the Sleep Unit is impacted by the following:

- rst\_ni
- fetch\_enable\_i
- wfi instruction

Table 12.1 describes the Sleep Unit interface.

| Table 12.1: | Sleep | Unit | interface | signals |
|-------------|-------|------|-----------|---------|
|-------------|-------|------|-----------|---------|

| Signal   | Di-    | Description                                                                                                      |
|----------|--------|------------------------------------------------------------------------------------------------------------------|
|          | rec-   |                                                                                                                  |
|          | tion   |                                                                                                                  |
| core_sle | epputo | Core is sleeping because of a <b>wfi</b> instruction. If <b>core_sleep_o =</b> 1, then <b>clk_i</b> is gated off |
|          | put    | internally and it is allowed to gate off clk_i externally as well. See WFI for details.                          |

# 12.1 Startup behavior

clk\_i is internally gated off (while signaling core\_sleep\_o = 0) during CV32E40X startup:

- clk\_i is internally gated off during rst\_ni assertion
- clk\_i is internally gated off from rst\_ni deassertion until fetch\_enable\_i = 1

After initial assertion of fetch\_enable\_i, the fetch\_enable\_i signal is ignored until after a next reset assertion.

# 12.2 WFI

The **wfi** instruction can under certain conditions be used to enter sleep mode awaiting a locally enabled interrupt to become pending. The operation of **wfi** is unaffected by the global interrupt bits in **mstatus**.

A wfi will not enter sleep mode, but will be executed as a regular nop, if any of the following conditions apply:

- debug\_req\_i = 1 or a debug request is pending
- The core is in debug mode
- The core is performing single stepping (debug)
- The core has a trigger match (debug)

If a **wfi** causes sleep mode entry, then **core\_sleep\_o** is set to 1 and **clk\_i** is gated off internally. **clk\_i** is allowed to be gated off externally as well in this scenario. A wake-up can be triggered by any of the following:

- A locally enabled interrupt is pending
- A debug request is pending
- Core is in debug mode

Upon wake-up core\_sleep\_o is set to 0, clk\_i will no longer be gated internally, must not be gated off externally, and instruction execution resumes.

If one of the above wake-up conditions coincides with the **wfi** instruction, then sleep mode is not entered and core\_sleep\_o will not become 1.

Figure 12.1 shows an example waveform for sleep mode entry because of a wfi instruction.

Figure 12.1: wfi example

CHAPTER

# THIRTEEN

# **CONTROL AND STATUS REGISTERS**

# 13.1 CSR Map

Table 13.1 lists all implemented CSRs. To columns in Table 13.1 may require additional explanation:

The **Parameter** column identifies those CSRs that are dependent on the value of specific compile/synthesis parameters. If these parameters are not set as indicated in Table 13.1 then the associated CSR is not implemented. If the parameter column is empty then the associated CSR is always implemented.

The **Privilege** column indicates the access mode of a CSR. The first letter indicates the lowest privilege level required to access the CSR. Attempts to access a CSR with a higher privilege level than the core is currently running in will throw an illegal instruction exception. This is largely a moot point for the CV32E40X as it only supports machine and debug modes. The remaining letters indicate the read and/or write behavior of the CSR when accessed by the indicated or higher privilege level:

- **RW**: CSR is **read-write**. That is, CSR instructions (e.g. csrrw) may write any value and that value will be returned on a subsequent read (unless a side-effect causes the core to change the CSR value).
- RO: CSR is read-only. Writes by CSR instructions raise an illegal instruction exception.

Writes of a non-supported value to **WLRL** bitfields of a **RW** CSR do not result in an illegal instruction exception. The exact bitfield access types, e.g. **WLRL** or **WARL**, can be found in the RISC-V privileged specification.

Reads or writes to a CSR that is not implemented will result in an illegal instruction exception.

| CSR Address  | Name          | Privilege | Parameter  | Description                       |  |  |  |
|--------------|---------------|-----------|------------|-----------------------------------|--|--|--|
| Machine CSRs |               |           |            |                                   |  |  |  |
| 0x300        | mstatus       | MRW       |            | Machine Status (lower 32 bits).   |  |  |  |
| 0x301        | misa          | MRW       |            | Machine ISA                       |  |  |  |
| 0x304        | mie           | MRW       |            | Machine Interrupt Enable Register |  |  |  |
| 0x305        | mtvec         | MRW       |            | Machine Trap-Handler Base Addre   |  |  |  |
| 0x307        | mtvt          | MRW       | SMCLIC = 1 | Machine Trap-Handler Vector Table |  |  |  |
| 0x310        | mstatush      | MRW       |            | Machine Status (upper 32 bits).   |  |  |  |
| 0x320        | mcountinhibit | MRW       |            | (HPM) Machine Counter-Inhibit Re  |  |  |  |
| 0x323        | mhpmevent3    | MRW       |            | (HPM) Machine Performance-Mon     |  |  |  |
|              |               |           | · · ·      |                                   |  |  |  |
| 0x33F        | mhpmevent31   | MRW       |            | (HPM) Machine Performance-Mon     |  |  |  |
| 0x340        | mscratch      | MRW       |            | Machine Scratch                   |  |  |  |
| 0x341        | mepc          | MRW       |            | Machine Exception Program Count   |  |  |  |
| 0x342        | mcause        | MRW       |            | Machine Trap Cause                |  |  |  |
| 0x343        | mtval         | MRW       |            | Machine Trap Value                |  |  |  |

Table 13.1: Control and Status Register Map

|             | Nomo                                  |           | Deremeter            |                                     |
|-------------|---------------------------------------|-----------|----------------------|-------------------------------------|
| CSR Address | Name                                  | Privilege | Parameter            | Description                         |
| 0x344       | mip                                   | MRW       |                      | Machine Interrupt Pending Register  |
| 0x345       | mnxti                                 | MRW       | SMCLIC = 1           | Interrupt handler address and enabl |
| 0x346       | mintstatus                            | MRW       | SMCLIC = 1           | Current interrupt levels            |
| 0x347       | mintthresh                            | MRW       | SMCLIC = 1           | Interrupt-level threshold           |
| 0x348       | mscratchcsw                           | MRW       | SMCLIC = 1           | Conditional scratch swap on priv m  |
| 0x349       | mscratchcswl                          | MRW       | SMCLIC = 1           | Conditional scratch swap on level c |
| 0x34A       | mclicbase                             | MRW       | SMCLIC = 1           | CLIC Base Register                  |
| 0x7A0       | tselect                               | MRW       | DBG_NUM_TRIGGERS > 0 | Trigger Select Register             |
| 0x7A1       | tdata1                                | MRW       | DBG_NUM_TRIGGERS > 0 | Trigger Data Register 1             |
| 0x7A2       | tdata2                                | MRW       | DBG_NUM_TRIGGERS > 0 | Trigger Data Register 2             |
| 0x7A3       | tdata3                                | MRW       | DBG_NUM_TRIGGERS > 0 | Trigger Data Register 3             |
| 0x7A4       | tinfo                                 | MRW       | DBG_NUM_TRIGGERS > 0 | Trigger Info                        |
| 0x7A5       | tcontrol                              | MRW       | DBG_NUM_TRIGGERS > 0 | Trigger Control                     |
| 0x7A8       | mcontext                              | MRW       | DBG_NUM_TRIGGERS > 0 | Machine Context Register            |
| 0x7AA       | mscontext                             | MRW       | DBG_NUM_TRIGGERS > 0 | Machine Context Register            |
| 0x7B0       | dcsr                                  | DRW       |                      | Debug Control and Status            |
| 0x7B1       | dpc                                   | DRW       |                      | Debug PC                            |
| 0x7B2       | dscratch0                             | DRW       |                      | Debug Scratch Register 0            |
| 0x7B3       | dscratch1                             | DRW       |                      | Debug Scratch Register 1            |
| 0xB00       | mcycle                                | MRW       |                      | (HPM) Machine Cycle Counter         |
| 0xB02       | minstret                              | MRW       |                      | (HPM) Machine Instructions-Retire   |
| 0xB03       | mhpmcounter3                          | MRW       |                      | (HPM) Machine Performance-Mon       |
|             | · · · · · · · · · · · · · · · · · · · |           |                      |                                     |
| 0xB1F       | mhpmcounter31                         | MRW       |                      | (HPM) Machine Performance-Mon       |
| 0xB80       | mcycleh                               | MRW       |                      | (HPM) Upper 32 Machine Cycle Co     |
| 0xB82       | minstreth                             | MRW       |                      | (HPM) Upper 32 Machine Instructi    |
| 0xB83       | mhpmcounterh3                         | MRW       |                      | (HPM) Upper 32 Machine Performa     |
|             | · · · ·                               |           |                      |                                     |
| 0xB9F       | mhpmcounterh31                        | MRW       |                      | (HPM) Upper 32 Machine Performa     |
| 0xF11       | mvendorid                             | MRO       |                      | Machine Vendor ID                   |
| 0xF12       | marchid                               | MRO       |                      | Machine Architecture ID             |
| 0xF13       | mimpid                                | MRO       |                      | Machine Implementation ID           |
| 0xF14       | mhartid                               | MRO       |                      | Hardware Thread ID                  |
| 0xF15       | mconfigptr                            | MRO       |                      | Machine Configuration Pointer       |
| -           | JF                                    |           | <u> </u>             |                                     |

Table 13.2: Control and Status Register Map (Unprivileged and User-Level CSRs)

| CSR Address                                                          | Name | Privilege | Parameter | Description |  |
|----------------------------------------------------------------------|------|-----------|-----------|-------------|--|
| Unprivileged and User-Level CSRs                                     |      |           |           |             |  |
| 0x017 jvt URW ZC_EXT = 1 Table jump base vector and control register |      |           |           |             |  |

| CSR Address | Name     | Privilege | Parameter | Description                           |
|-------------|----------|-----------|-----------|---------------------------------------|
| User CSRs   |          |           |           |                                       |
| 0xC00       | cycle    | URO       |           | Cycle Counter                         |
| 0xC02       | instret  | URO       |           | Instructions-Retired Counter          |
| 0xC80       | cycleh   | URO       |           | Upper 32 Cycle Counter                |
| 0xC82       | instreth | URO       |           | Upper 32 Instructions-Retired Counter |

|         |     |               |        | e       | •                                             |
|---------|-----|---------------|--------|---------|-----------------------------------------------|
| CSR     | Ad- | Name          | Privi- | Parame- | Description                                   |
| dress   |     |               | lege   | ter     |                                               |
| User CS | Rs  |               |        |         | ·                                             |
| 0xC03   |     | hpmcounter3   | URO    |         | (HPM) Performance-Monitoring Counter 3        |
|         |     |               |        |         | ·                                             |
| 0xC1F   |     | hpmcounter31  | URO    |         | (HPM) Performance-Monitoring Counter 31       |
| 0xC83   |     | hpmcounterh3  | URO    |         | (HPM) Upper 32 Performance-Monitoring Counter |
|         |     |               |        |         | 3                                             |
|         |     |               |        |         |                                               |
| 0xC9F   |     | hpmcounterh31 | URO    |         | (HPM) Upper 32 Performance-Monitoring Counter |
|         |     |               |        |         | 31                                            |

Table 13.4: Control and Status Register Map (additional CSRs for Zihpm)

# **13.2 CSR Descriptions**

What follows is a detailed definition of each of the CSRs listed above. The **R/W** column defines the access mode behavior of each bit field when accessed by the privilege level specified in Table 13.1 (or a higher privilege level):

- **R**: **read** fields are not affected by CSR write instructions. Such fields either return a fixed value, or a value determined by the operation of the core.
- **RW**: **read/write** fields store the value written by CSR writes. Subsequent reads return either the previously written value or a value determined by the operation of the core.
- WARL: write-any-read-legal fields store only legal values written by CSR writes. For example, a WARL (0x0) field supports only the value 0x0. Any value may be written, but all reads would return 0x0 regardless of the value being written to it. A WARL field may support more than one value. If an unsupported value is (attempted to be) written to a WARL field, the original (legal) value of the bitfield is preserved.
- WPRI: Software should ignore values read from these fields, and preserve the values when writing.

Note: The R/W information does not impact whether CSR accesses result in illegal instruction exceptions or not.

#### 13.2.1 Jump Vector Table (jvt)

CSR Address: 0x017

Reset Value: 0x0000\_0000

Include Condition: ZC\_EXT = 1

Detailed:

| Bit # | R/W        | Description                                  |
|-------|------------|----------------------------------------------|
| 31: 6 | RW         | <b>BASE</b> : Base Address, 64 byte aligned. |
| 5:0   | WARL (0x0) | MODE: Jump table mode                        |

Table jump base vector and control register

# 13.2.2 Machine Status (mstatus)

CSR Address: 0x300

Reset Value: defined (based on *X\_EXT*`, **X\_ECS\_XS**)

| Bit<br># | R/W                           | Description                                                                                                       |
|----------|-------------------------------|-------------------------------------------------------------------------------------------------------------------|
| #<br>31  | R                             | <b>SD</b> : State Dirty. $SD = ((FS == 0x3) OR (XS == 0x3) OR (VS == 0x3)).$                                      |
| -        | WPRI                          | SD. State Dirty. $SD = ((PS = 0.35) \text{ OK} (AS = 0.35) \text{ OK} (VS = 0.35)).$<br>Reserved. Hardwired to 0. |
| 50.25    | (0x0)                         | Reserved. Hardwired to 0.                                                                                         |
| 22       | WARL                          | <b>TSR</b> . Hardwired to 0.                                                                                      |
| 22       | (0x0)                         | ISK. Mardwilled to 0.                                                                                             |
| 21       | WARL                          | <b>TW</b> . Hardwired to 0.                                                                                       |
| 21       | (0x0)                         |                                                                                                                   |
| 20       | WARL                          | <b>TVM</b> . Hardwired to 0.                                                                                      |
| 20       | (0x0)                         |                                                                                                                   |
| 19       | $\frac{(0x0)}{R(0x0)}$        | MXR. Hardwired to 0.                                                                                              |
| 19       | $\frac{R(0x0)}{R(0x0)}$       | SUM. Hardwired to 0.                                                                                              |
| 17       | R(0x0)                        | MPRV. Hardwired to 0.                                                                                             |
|          | $\overline{\mathbf{R} (0x0)}$ | <b>XS</b> : Other Extension Context Status. R with reset value defined by X_ECS_XS if X_EXT == 1,                 |
| 10.15    | K / K (0x0)                   | hardwired to 0 otherwise.                                                                                         |
| 14:13    | RW /                          | <b>FS</b> : FPU Extension Context Status. RW if <b>X_EXT</b> == 1, hardwired to 0 otherwise.                      |
| 1        | WARL                          | 19.11 C Exclusion context status. Rev in x_Ext == 1; hardwired to o otherwise.                                    |
|          | (0x0)                         |                                                                                                                   |
| 12.11    | WARL                          | MPP: Machine Previous Priviledge mode. Hardwired to 0x3.                                                          |
| 12.11    | (0x3)                         |                                                                                                                   |
| 10:9     | RW/WPRI                       | <b>VS</b> : Vector Extension Context Status. RW if <b>X_EXT</b> == 1, hardwired to 0 otherwise.                   |
|          | (0x0)                         | ,                                                                                                                 |
| 8        | WARL                          | SPP. Hardwired to 0.                                                                                              |
|          | (0x0)                         |                                                                                                                   |
| 7        | RW                            | MPIE: When an exception is encountered, MPIE will be set to MIE. When the mret instruction                        |
|          |                               | is executed, the value of MPIE will be stored to MIE.                                                             |
| 6        | WARL                          | UBE. Hardwired to 0.                                                                                              |
|          | (0x0)                         |                                                                                                                   |
| 5        | R (0x0)                       | SPIE. Hardwired to 0.                                                                                             |
| 4        | WPRI                          | Reserved. Hardwired to 0.                                                                                         |
|          | (0x0)                         |                                                                                                                   |
| 3        | RW                            | MIE: If you want to enable interrupt handling in your exception handler, set the Interrupt                        |
|          |                               | Enable MIE to 1 inside your handler code.                                                                         |
| 2        | WPRI                          | Reserved. Hardwired to 0.                                                                                         |
|          | (0x0)                         |                                                                                                                   |
| 1        | R (0x0)                       | SIE. Hardwired to 0.                                                                                              |
| 0        | WPRI                          | Reserved. Hardwired to 0                                                                                          |
|          | (0x0)                         |                                                                                                                   |

# 13.2.3 Machine ISA (misa)

CSR Address: 0x301

Reset Value: defined (based on RV32, A\_EXT, M\_EXT, X\_MISA)

Detailed:

| Bit # | R/W        | Description                                                              |
|-------|------------|--------------------------------------------------------------------------|
| 31:30 | WARL (0x1) | MXL (Machine XLEN).                                                      |
| 29:26 | WARL (0x0) | (Reserved).                                                              |
| 25    | WARL (0x0) | Z (Reserved).                                                            |
| 24    | WARL (0x0) | Y (Reserved).                                                            |
| 23    | WARL       | X (Non-standard extensions present).                                     |
| 22    | WARL (0x0) | W (Reserved).                                                            |
| 21    | WARL       | V (Tentatively reserved for Vector extension).                           |
| 20    | WARL (0x0) | U (User mode implemented).                                               |
| 19    | WARL (0x0) | T (Tentatively reserved for Transactional Memory extension).             |
| 18    | WARL (0x0) | S (Supervisor mode implemented).                                         |
| 17    | WARL (0x0) | <b>R</b> (Reserved).                                                     |
| 16    | WARL       | <b>Q</b> (Quad-precision floating-point extension).                      |
| 15    | WARL       | <b>P</b> (Packed-SIMD extension).                                        |
| 14    | WARL (0x0) | O (Reserved).                                                            |
| 13    | WARL (0x0) | N                                                                        |
| 12    | WARL       | M (Integer Multiply/Divide extension).                                   |
| 11    | WARL (0x0) | L (Tentatively reserved for Decimal Floating-Point extension).           |
| 10    | WARL (0x0) | K (Reserved).                                                            |
| 9     | WARL (0x0) | J (Tentatively reserved for Dynamically Translated Languages extension). |
| 8     | WARL       | I (RV32I/64I/128I base ISA).                                             |
| 7     | WARL (0x0) | H (Hypervisor extension).                                                |
| 6     | WARL (0x0) | G (Additional standard extensions present).                              |
| 5     | WARL       | <b>F</b> (Single-precision floating-point extension).                    |
| 4     | WARL       | E (RV32E base ISA).                                                      |
| 3     | WARL       | <b>D</b> (Double-precision floating-point extension).                    |
| 2     | WARL (0x1) | C (Compressed extension).                                                |
| 1     | WARL (0x0) | <b>B</b> Reserved.                                                       |
| 0     | WARL       | A (Atomic extension).                                                    |

All bitfields in the misa CSR read as 0 except for the following:

- A = 1 if A\_EXT == 1
- **C** = 1
- **I** = 1 if RV32 == RV32I
- **E** = 1 if RV32 == RV32E
- $\mathbf{M} = 1$  if  $\mathbb{M}_{EXT} == \mathbf{M}$
- MXL = 1 (i.e. XLEN = 32)
- If X\_EXT == 1, then the value of X\_MISA is ORed into the misa CSR.

Note: The WARL `` in above table is depending on `X\_EXT. If X\_EXT == 1, then some of the misa bits can read values depending on the value of X\_MISA.

# 13.2.4 Machine Interrupt Enable Register (mie) - SMCLIC == 0

CSR Address: 0x304

Reset Value: 0x0000\_0000

Detailed:

| Bit # | R/W        | Description                                                                    |
|-------|------------|--------------------------------------------------------------------------------|
| 31:16 | RW         | Machine Fast Interrupt Enables: Set bit x to enable interrupt irq_i[x].        |
| 15:12 | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 11    | RW         | <b>MEIE</b> : Machine External Interrupt Enable, if set, irq_i[11] is enabled. |
| 10    | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 9     | WARL (0x0) | SEIE. Hardwired to 0                                                           |
| 8     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 7     | RW         | <b>MTIE</b> : Machine Timer Interrupt Enable, if set, irq_i[7] is enabled.     |
| 6     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 5     | WARL (0x0) | STIE. Hardwired to 0.                                                          |
| 4     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 3     | RW         | <b>MSIE</b> : Machine Software Interrupt Enable, if set, irq_i[3] is enabled.  |
| 2     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 1     | WARL (0x0) | SSIE. Hardwired to 0.                                                          |
| 0     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |

# 13.2.5 Machine Interrupt Enable Register (mie) - SMCLIC == 1

CSR Address: 0x304 Reset Value: 0x0000\_0000 Detailed:

| Bit # | R/W        | Description               |
|-------|------------|---------------------------|
| 31:0  | WARL (0x0) | Reserved. Hardwired to 0. |

Note: In CLIC mode the mie CSR is replaced by separate memory-mapped interrupt enables (clicintie).

# 13.2.6 Machine Trap-Vector Base Address (mtvec) - SMCLIC == 0

CSR Address: 0x305

Reset Value: Defined

Detailed:

| Bit  | R/W        | Description                                                                               |
|------|------------|-------------------------------------------------------------------------------------------|
| #    |            |                                                                                           |
| 31:7 | RW         | BASE[31:7]: Trap-handler base address, always aligned to 128 bytes.                       |
| 6:2  | WARL (0x0) | BASE[6:2]: Trap-handler base address, always aligned to 128 bytes. mtvec[6:2] is          |
|      |            | hardwired to 0x0.                                                                         |
| 1:0  | WARL (0x0, | <b>MODE</b> [0]: Interrupt handling mode. $0x0 = non-vectored basic mode, 0x1 = vectored$ |
|      | 0x1)       | basic mode.                                                                               |

The initial value of mtvec is equal to {mtvec\_addr\_i[31:7], 5'b0, 2'b01}.

When an exception or an interrupt is encountered, the core jumps to the corresponding handler using the content of the mtvec[31:7] as base address. Both direct mode and vectored mode are supported.

The NMI vector location is at index 15 of the machine trap vector table for both direct mode and vectored mode (i.e. at {mtvec[31:7], 5'hF, 2'b00}).

# 13.2.7 Machine Trap-Vector Base Address (mtvec) - SMCLIC == 1

CSR Address: 0x305

Reset Value: Defined

Detailed:

| Bit  | R/W   | Description                                                                            |
|------|-------|----------------------------------------------------------------------------------------|
| #    |       |                                                                                        |
| 31:7 | RW    | <b>BASE[31:7]</b> : Trap-handler base address, always aligned to 128 bytes.            |
| 6:2  | WARL  | BASE[6:2]: Trap-handler base address, always aligned to 128 bytes. mtvec[6:2] is hard- |
|      | (0x0) | wired to 0x0.                                                                          |
| 1:0  | WARL  | MODE: Interrupt handling mode. Always CLIC mode.                                       |
|      | (0x3) |                                                                                        |

The initial value of mtvec is equal to {mtvec\_addr\_i[31:7], 5'b0, 2'b11}.

#### 13.2.8 Machine Trap Vector Table Base Address (mtvt)

CSR Address: 0x307 Reset Value: 0x0000\_0000 Include Condition: SMCLIC = 1 Detailed:

| Bit  | R/W   | Description                                                                                                |
|------|-------|------------------------------------------------------------------------------------------------------------|
| #    |       |                                                                                                            |
| 31:N | RW    | <b>BASE[31:N]</b> : Trap-handler vector table base address. N = maximum(6, 2+SMCLIC_ID_WIDTH).             |
|      |       | See note below for alignment restrictions.                                                                 |
| N-   | WARL  | <b>BASE</b> [N-1:6]: Trap-handler vector table base address. This field is only defined if $N > 6$ . $N =$ |
| 1:6  | (0x0) | maximum(6, 2+SMCLIC_ID_WIDTH). mtvt[N-1:6] is hardwired to 0x0. See note below for                         |
|      |       | alignment restrictions.                                                                                    |
| 5:0  | R     | Reserved. Hardwired to 0.                                                                                  |
|      | (0x0) |                                                                                                            |

**Note:** The mtvt CSR holds the base address of the trap vector table, which has its alignment restricted to both at least 64-bytes and to  $2^{(2+SMCLIC_ID_WIDTH)}$  bytes or greater power-of-two boundary. For example if SMCLIC\_ID\_WIDTH = 8, then 256 CLIC interrupts are supported and the trap vector table is aligned to 1024 bytes, and therefore **BASE[9:6]** will be WARL (0x0).

#### 13.2.9 Machine Status (mstatush)

CSR Address: 0x310 Reset Value: 0x0000\_0000 Detailed:

| Bit# | R/W        | Definition                   |
|------|------------|------------------------------|
| 31:6 | WPRI (0x0) | Reserved. Hardwired to 0.    |
| 5    | WARL (0x0) | <b>MBE</b> . Hardwired to 0. |
| 4    | WARL (0x0) | <b>SBE</b> . Hardwired to 0. |
| 3:0  | WPRI (0x0) | Reserved. Hardwired to 0.    |

# 13.2.10 Machine Counter-Inhibit Register (mcountinhibit)

CSR Address: 0x320

Reset Value: Defined

The performance counter inhibit control register. The default value is to inihibit all implemented counters out of reset. The bit returns a read value of 0 for non implemented counters.

Detailed:

| Bit# | R/W   | Description                                                                           |  |
|------|-------|---------------------------------------------------------------------------------------|--|
| 31:3 | WARL  | mhpmcounter3 - mhpmcounter31 inhibits. Depends on NUM_MHPMCOUNTERS (i.e. bits related |  |
|      |       | to non-implemented counters always read as 0).                                        |  |
| 2    | WARL  | IR: minstret inhibit                                                                  |  |
| 1    | WARL  | Hardwired to 0.                                                                       |  |
|      | (0x0) |                                                                                       |  |
| 0    | WARL  | CY: mcycle inhibit                                                                    |  |

#### 13.2.11 Machine Performance Monitoring Event Selector (mhpmevent3 ... mhpmevent31)

CSR Address: 0x323 - 0x33F

Reset Value: 0x0000\_0000

Detailed:

| Bit#  | R/W        | Definition                                                     |
|-------|------------|----------------------------------------------------------------|
| 31:16 | WARL (0x0) | Hardwired to 0.                                                |
| 15:0  | WARL       | <b>SELECTORS:</b> Each bit represents a unique event to count. |

The event selector fields are further described in Performance Counters section. Non implemented counters always return a read value of 0.

# 13.2.12 Machine Scratch (mscratch)

CSR Address: 0x340 Reset Value: 0x0000\_0000 Detailed:

| Bit # | R/W | Description   |
|-------|-----|---------------|
| 31:0  | RW  | Scratch value |

# 13.2.13 Machine Exception PC (mepc)

CSR Address: 0x341

Reset Value: 0x0000\_0000

| Bit # | R/W        | Description                            |
|-------|------------|----------------------------------------|
| 31:1  | WARL       | Machine Expection Program Counter 31:1 |
| 0     | WARL (0x0) | Hardwired to 0.                        |

When an exception is encountered, the current program counter is saved in MEPC, and the core jumps to the exception address. When a mret instruction is executed, the value from MEPC replaces the current program counter.

#### 13.2.14 Machine Cause (mcause) - SMCLIC == 0

CSR Address: 0x342

Reset Value: 0x0000\_0000

| Bit # | R/W        | Description                                                                         |
|-------|------------|-------------------------------------------------------------------------------------|
| 31    | RW         | <b>INTERRUPT:</b> This bit is set when the exception was triggered by an interrupt. |
| 30:11 | WLRL (0x0) | EXCCODE[30:11]. Hardwired to 0.                                                     |
| 10:0  | WLRL       | EXCCODE[10:0]. See note below.                                                      |

**Note:** Software accesses to *mcause[10:0]* must be sensitive to the WLRL field specification of this CSR. For example, when *mcause[31]* is set, writing 0x1 to *mcause[1]* (Supervisor software interrupt) will result in UNDEFINED behavior.

#### 13.2.15 Machine Cause (mcause) - SMCLIC == 1

CSR Address: 0x342

Reset Value: 0x0000\_0000

| Bit   | R/W   | Description                                                                               |
|-------|-------|-------------------------------------------------------------------------------------------|
| #     |       |                                                                                           |
| 31    | RW    | <b>INTERRUPT:</b> This bit is set when the exception was triggered by an interrupt.       |
| 30    | R     | MINHV. Set by hardware at start of hardware vectoring, cleared by hardware at end of suc- |
|       |       | cessful hardware vectoring.                                                               |
| 29:28 | WARL  | MPP: Previous privilege mode. Same as mstatus.MPP                                         |
|       | (0x3) |                                                                                           |
| 27    | RW    | MPIE: Previous interrupt enable. Same as mstatus.MPIE                                     |
| 26:24 | RW    | Reserved. Hardwired to 0.                                                                 |
| 23:16 | RW    | MPIL: Previous interrupt level.                                                           |
| 15:12 | WARL  | Reserved. Hardwired to 0.                                                                 |
|       | (0x0) |                                                                                           |
| 11    | WLRL  | EXCCODE[11]                                                                               |
|       | (0x0) |                                                                                           |
| 10:0  | WLRL  | EXCCODE[10:0]                                                                             |

**Note:** mcause.MPP and mstatus.MPP mirror each other. mcause.MPIE and mstatus.MPIE mirror each other. Reading or writing the fields MPP/MPIE in mcause is equivalent to reading or writing the homonymous field in mstatus.

# 13.2.16 Machine Trap Value (mtval)

CSR Address: 0x343 Reset Value: 0x0000\_0000

Detailed:

| Bit # | R/W        | Description     |
|-------|------------|-----------------|
| 31:0  | WARL (0x0) | Hardwired to 0. |

#### 13.2.17 Machine Interrupt Pending Register (mip) - SMCLIC == 0

CSR Address: 0x344 Reset Value: 0x0000\_0000 Detailed:

| Bit # | R/W        | Description                                                                    |
|-------|------------|--------------------------------------------------------------------------------|
| 31:16 | R          | Machine Fast Interrupt Enables: Interrupt irq_i[x] is pending.                 |
| 15:12 | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 11    | R          | <b>MEIP</b> : Machine External Interrupt Enable, if set, irq_i[11] is pending. |
| 10    | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 9     | WARL (0x0) | SEIP. Hardwired to 0                                                           |
| 8     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 7     | R          | <b>MTIP</b> : Machine Timer Interrupt Enable, if set, irq_i[7] is pending.     |
| 6     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 5     | WARL (0x0) | STIP. Hardwired to 0.                                                          |
| 4     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 3     | R          | <b>MSIP</b> : Machine Software Interrupt Enable, if set, irq_i[3] is pending.  |
| 2     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |
| 1     | WARL (0x0) | <b>SSIP</b> . Hardwired to 0.                                                  |
| 0     | WARL (0x0) | Reserved. Hardwired to 0.                                                      |

#### 13.2.18 Machine Interrupt Pending Register (mip) - SMCLIC == 1

CSR Address: 0x344

Reset Value: 0x0000\_0000

Detailed:

| Bit # | R/W        | Description               |
|-------|------------|---------------------------|
| 31:0  | WARL (0x0) | Reserved. Hardwired to 0. |

Note: In CLIC mode the mip CSR is replaced by separate memory-mapped interrupt enables (clicintip).

# 13.2.19 Machine Next Interrupt Handler Address and Interrupt Enable (mnxti)

CSR Address: 0x345

Reset Value: 0x0000\_0000

Include Condition: SMCLIC = 1

Detailed:

| Bit # | R/W | Description                                                         |
|-------|-----|---------------------------------------------------------------------|
| 31:0  | RW  | MNXTI: Machine Next Interrupt Handler Address and Interrupt Enable. |

This register can be used by the software to service the next interrupt when it is in the same privilege mode, without incurring the full cost of an interrupt pipeline flush and context save/restore.

#### 13.2.20 Machine Interrupt Status (mintstatus)

CSR Address: 0x346 Reset Value: 0x0000\_0000 Include Condition: SMCLIC = 1 Detailed:

| Bit # | R/W     | Description                                              |  |
|-------|---------|----------------------------------------------------------|--|
| 31:24 | R       | MIL: Machine Interrupt Level                             |  |
| 23:16 | R (0x0) | Reserved. Hardwired to 0.                                |  |
| 15: 8 | R (0x0) | <b>SIL</b> : Supervisor Interrupt Level, hardwired to 0. |  |
| 7:0   | R (0x0) | <b>UIL</b> : User Interrupt Level, hardwired to 0.       |  |

This register holds the active interrupt level for each privilege mode. Only Machine Interrupt Level is supported.

# 13.2.21 Machine Interrupt-Level Threshold (mintthresh)

CSR Address: 0x347 Reset Value: 0x0000\_0000

Include Condition: SMCLIC = 1

Detailed:

| Bit # | R/W     | Description               |
|-------|---------|---------------------------|
| 31: 8 | R (0x0) | Reserved. Hardwired to 0. |
| 7:0   | RW      | TH: Threshold             |

This register holds the machine mode interrupt level threshold.

# 13.2.22 Machine Scratch Swap for Priv Mode Change (mscratchcsw)

CSR Address: 0x348 Reset Value: 0x0000\_0000 Include Condition: SMCLIC = 1 Detailed:

| Bit # | R/W | Description                                                 |
|-------|-----|-------------------------------------------------------------|
| 31:0  | RW  | MSCRATCHCSW: Machine scratch swap for privilege mode change |

Scratch swap register for multiple privilege modes.

# 13.2.23 Machine Scratch Swap for Interrupt-Level Change (mscratchcswl)

CSR Address: 0x349 Reset Value: 0x0000\_0000 Include Condition: SMCLIC = 1 Detailed:

 Bit #
 R/W
 Description

 31:0
 RW
 MSCRATCHCSWL: Machine Scratch Swap for Interrupt-Level Change

Scratch swap register for multiple interrupt levels.

#### 13.2.24 CLIC Base (mclicbase)

CSR Address: 0x34A

Reset Value: 0x0000\_0000

Include Condition: SMCLIC = 1

Detailed:

| [ | Bit # | R/W     | Description               |
|---|-------|---------|---------------------------|
|   | 31:12 | RW      | MCLICBASE: CLIC Base      |
|   | 11: 0 | R (0x0) | Reserved. Hardwired to 0. |

CLIC base register.

# 13.2.25 Trigger Select Register (tselect)

CSR Address: 0x7A0

Reset Value: 0x0000\_0000

| Bit # | R/W                                  | Description                                                                                                                      |
|-------|--------------------------------------|----------------------------------------------------------------------------------------------------------------------------------|
| 31:0  | WARL<br>(0x0 - (DBG_NUM_TRIGGERS-1)) | CV32E40X implements 0 to<br>DBG_NUM_TRIGGERS triggers.<br>Selects<br>which trigger CSRs are accessed<br>through the tdata* CSRs. |

# 13.2.26 Trigger Data 1 (tdata1)

CSR Address: 0x7A1

Reset Value: 0x6800\_1044

Accessible in Debug Mode or M-Mode, depending on **TDATA1.dmode**. The contents of the **data** field depends on the current value of the **type** field. See [RISC-V-DEBUG] for details regarding all trigger related CSRs.

| Bit#  | R/W                | Description                                                              |
|-------|--------------------|--------------------------------------------------------------------------|
| 31:28 | WARL<br>(0x5, 0x6) | <b>type:</b> 6 = Address match trigger<br>type.<br>5 = Exception trigger |
| 27    | WARL (0x1)         | <b>dmode:</b> Only debug mode can write tdata registers                  |
| 26:0  | WARL               | <b>data:</b> Trigger data depending on type                              |

# 13.2.27 Match Control Type 6 (mcontrol6)

CSR Address: 0x7A1

Reset Value: 0x6800\_1044

Accessible in Debug Mode or M-Mode, depending on TDATA1.DMODE.

| Bit#  | R/W        | Description                              |
|-------|------------|------------------------------------------|
| 31:28 | WARL (0x6) | <b>TYPE:</b> 6 = Address match trigger.  |
| 27    | WARL (0x1) | DMODE: Only debug mode can               |
|       |            | write tdata registers                    |
| 26:25 | WARL (0x0) | Hardwired to 0.                          |
| 24    | WARL (0x0) | VS:. Hardwired to 0.                     |
| 23    | WARL (0x0) | VU:. Hardwired to 0.                     |
| 22    | WARL (0x0) | <b>HIT:</b> Hardwired to 0.              |
| 21    | WARL (0x0) | SELECT: Only address matching is         |
|       |            | supported.                               |
|       |            |                                          |
| 20    | WARL (0x0) | <b>TIMING:</b> Break before the          |
| 20    | WARL (0x0) | instruction at the specified             |
|       |            | address.                                 |
|       |            | address.                                 |
| 19:16 | WARL (0x0) | <b>SIZE:</b> Match accesses of any size. |
| 15:12 | WARL (0x1) | ACTION: Enter debug mode on              |
|       |            | match.                                   |
| 11    | WARL (0x0) | CHAIN:. Hardwired to 0                   |
|       |            |                                          |
| 10:7  | WARL       | MATCH: 0: Address matches                |
| 1017  | (0x0, 0x2, | tdata2.                                  |
|       |            | 2: Address is greater than or            |
|       | 0x3)       | equal to <i>tdata2</i>                   |
|       |            | 3: Address is less than <i>tdata2</i>    |
|       |            | 5. Address is less than iduluz           |
| 6     | WARL (0x1) | M: Match in M-Mode.                      |
| 5     | WARL (0x0) | Hardwired to 0.                          |
| 4     | WARL (0x0) | S:. Hardwired to 0.                      |
| 3     | WARL (0x0) | U:. Hardwired to 0.                      |
| 2     | WARL       | <b>EXECUTE:</b> Enable matching on       |
|       |            | instruction address.                     |
| 1     | WARL       | <b>STORE:</b> Enable matching on store   |
|       |            | address.                                 |
| 0     | WARL       | LOAD: Enable matching on load            |
|       |            | address.                                 |

# 13.2.28 Exception Trigger (etrigger)

CSR Address: 0x7A1

Reset Value: 0x5800\_0201

Accessible in Debug Mode or M-Mode, depending on TDATA1.DMODE.

| Bit#  | R/W        | Description                                             |
|-------|------------|---------------------------------------------------------|
| 31:28 | WARL (0x5) | <b>TYPE:</b> 5 = Exception trigger.                     |
| 27    | WARL (0x1) | <b>DMODE:</b> Only debug mode can write tdata registers |
| 26    | WARL (0x0) | <b>HIT:</b> . Hardwired to 0.                           |
| 25:13 | WARL (0x0) | Hardwired to 0.                                         |
| 12    | WARL (0x0) | VS:. Hardwired to 0.                                    |
| 11    | WARL (0x0) | VU:. Hardwired to 0.                                    |
| 10    | WARL       | <b>NMI:</b> Set to enable trigger on NMI.               |
| 9     | WARL (0x1) | M: Match in M-Mode.                                     |
| 8     | WARL (0x0) | Hardwired to 0.                                         |
| 7     | WARL (0x0) | S:. Hardwired to 0.                                     |
| 6     | WARL (0x0) | U:. Hardwired to 0.                                     |
| 5:0   | WARL (0x1) | ACTION: Enter debug mode on match.                      |

# 13.2.29 Trigger Data Register 2 (tdata2)

CSR Address: 0x7A2

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W | Description |
|------|-----|-------------|
| 31:0 | RW  | DATA        |

Accessible in Debug Mode or M-Mode, depending on **TDATA1.DMODE**. This register stores the instruction address to match against for a breakpoint trigger or the currently selected exception codes for an exception trigger.

# 13.2.30 Trigger Data Register 3 (tdata3)

CSR Address: 0x7A3

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W        | Description     |
|------|------------|-----------------|
| 31:0 | WARL (0x0) | Hardwired to 0. |

Accessible in Debug Mode or M-Mode. CV32E40X does not support the features requiring this register. CSR is hardwired to 0.

# 13.2.31 Trigger Info (tinfo)

CSR Address: 0x7A4

Reset Value: 0x0000\_0060

Detailed:

| Bit#  | R/W                            | Description                              |
|-------|--------------------------------|------------------------------------------|
| 31:16 | WARL (0x0)                     | Hardwired to 0.                          |
| 15:0  | <b>R</b> ( <b>0x20</b> , 0x40) | <b>INFO</b> . Type 5 and 6 is supported. |

The **info** field contains one bit for each possible *type* enumerated in *tdata1*. Bit N corresponds to type N. If the bit is set, then that type is supported by the currently selected trigger. If the currently selected trigger does not exist, this field contains 1.

Accessible in Debug Mode or M-Mode.

# 13.2.32 Trigger Control (tcontrol)

CSR Address: 0x7A5

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W        | Description           |
|------|------------|-----------------------|
| 31:8 | WARL (0x0) | Hardwired to 0.       |
| 7    | WARL (0x0) | MPTE. Hardwired to 0. |
| 6:4  | WARL (0x0) | Hardwired to 0.       |
| 3    | WARL (0x0) | MTE. Hardwired to 0.  |
| 2:0  | WARL (0x0) | Hardwired to 0.       |

CV32E40X does not support the features requiring this register. CSR is hardwired to 0.

# 13.2.33 Machine Context Register (mcontext)

CSR Address: 0x7A8

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W        | Description     |  |
|------|------------|-----------------|--|
| 31:0 | WARL (0x0) | Hardwired to 0. |  |

Accessible in Debug Mode or M-Mode. CV32E40X does not support the features requiring this register. CSR is hardwired to 0.

# 13.2.34 Machine Supervisor Context Register (mscontext)

CSR Address: 0x7AA Reset Value: 0x0000\_0000 Detailed:

| Bit# | R/W        | Description     |
|------|------------|-----------------|
| 31:0 | WARL (0x0) | Hardwired to 0. |

Accessible in Debug Mode or M-Mode. CV32E40X does not support the features requiring this register. CSR is hardwired to 0.

# 13.2.35 Debug Control and Status (dcsr)

CSR Address: 0x7B0

Reset Value: 0x4000\_0003

Detailed:

| Bit # | R/W     | Description                                                                |
|-------|---------|----------------------------------------------------------------------------|
| 31:28 | R (0x4) | XDEBUGVER: returns 4 - External debug support exists as it is described in |
|       |         | [RISC-V-DEBUG].                                                            |
| 27:18 | WARL    | Reserved                                                                   |
|       | (0x0)   |                                                                            |
| 17    | WARL    | EBREAKVS. Hardwired to 0                                                   |
|       | (0x0)   |                                                                            |
| 16    | WARL    | EBREAKVU. Hardwired to 0.                                                  |
|       | (0x0)   |                                                                            |
| 15    | RW      | EBREAKM: Set to enter debug mode on ebreak.                                |
| 14    | WARL    | Hardwired to 0.                                                            |
|       | (0x0)   |                                                                            |
| 13    | WARL    | EBREAKS. Hardwired to 0.                                                   |
|       | (0x0)   |                                                                            |
| 12    | WARL    | EBREAKU. Hardwired to 0.                                                   |
|       | (0x0)   |                                                                            |
| 11    | WARL    | STEPIE: Set to enable interrupts during single stepping.                   |
| 10    | WARL    | STOPCOUNT. Hardwired to 0.                                                 |
|       | (0x0)   |                                                                            |
| 9     | WARL    | STOPTIME. Hardwired to 0.                                                  |
|       | (0x0)   |                                                                            |
| 8:6   | R       | CAUSE: Return the cause of debug entry.                                    |
| 5     | WARL    | V. Hardwired to 0.                                                         |
|       | (0x0)   |                                                                            |
| 4     | WARL    | MPRVEN. Hardwired to 0.                                                    |
|       | (0x0)   |                                                                            |
| 3     | R       | <b>NMIP</b> . If set, an NMI is pending                                    |
| 2     | RW      | STEP: Set to enable single stepping.                                       |
| 1:0   | WARL    | <b>PRV:</b> Returns the priviledge mode before debug entry.                |
|       | (0x3)   |                                                                            |

# 13.2.36 Debug PC (dpc)

CSR Address: 0x7B1 Reset Value: 0x0000\_0000 Detailed:

| Bit # | R/W | Description           |
|-------|-----|-----------------------|
| 31:0  | RW  | <b>DPC</b> . Debug PC |

When the core enters in Debug Mode, DPC contains the virtual address of the next instruction to be executed.

#### 13.2.37 Debug Scratch Register 0/1 (dscratch0/1)

CSR Address: 0x7B2/0x7B3 Reset Value: 0x0000\_0000 Detailed:

| Bit # | R/W | Description |
|-------|-----|-------------|
| 31:0  | RW  | DSCRATCH0/1 |

#### 13.2.38 Machine Cycle Counter (mcycle)

CSR Address: 0xB00 Reset Value: 0x0000\_0000 Detailed:

| Bit# | R/W | Description                                                 |
|------|-----|-------------------------------------------------------------|
| 31:0 | RW  | The lower 32 bits of the 64 bit machine mode cycle counter. |

# 13.2.39 Machine Instructions-Retired Counter (minstret)

CSR Address: 0xB02

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W | Description                                                               |  |
|------|-----|---------------------------------------------------------------------------|--|
| 31:0 | RW  | The lower 32 bits of the 64 bit machine mode instruction retired counter. |  |

#### 13.2.40 Machine Performance Monitoring Counter (mhpmcounter3.. mhpmcounter31)

CSR Address: 0xB03 - 0xB1F

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W | Description                            | ] |
|------|-----|----------------------------------------|---|
| 31:0 | RW  | Machine performance-monitoring counter | ] |

The lower 32 bits of the 64 bit machine performance-monitoring counter(s). The number of machine performancemonitoring counters is determined by the parameter NUM\_MHPMCOUNTERS with a range from 0 to 29 (default value of 1). Non implemented counters always return a read value of 0.

# 13.2.41 Upper 32 Machine Cycle Counter (mcycleh)

CSR Address: 0xB80 Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W | Description                                                 | ] |
|------|-----|-------------------------------------------------------------|---|
| 31:0 | RW  | The upper 32 bits of the 64 bit machine mode cycle counter. | ] |

#### 13.2.42 Upper 32 Machine Instructions-Retired Counter (minstreth)

CSR Address: 0xB82 Reset Value: 0x0000\_0000 Detailed:

Bit#R/WDescription31:0RWThe upper 32 bits of the 64 bit machine mode instruction retired counter.

# 13.2.43 Upper 32 Machine Performance Monitoring Counter (mhpmcounter3h ... mhpmcounter31h)

CSR Address: 0xB83 - 0xB9F

Reset Value: 0x0000 0000

Detailed:

| Bit# | R/W | Description                            |
|------|-----|----------------------------------------|
| 31:0 | RW  | Machine performance-monitoring counter |

The upper 32 bits of the 64 bit machine performance-monitoring counter(s). The number of machine performancemonitoring counters is determined by the parameter NUM\_MHPMCOUNTERS with a range from 0 to 29 (default value of 1). Non implemented counters always return a read value of 0.

## 13.2.44 Machine Vendor ID (mvendorid)

CSR Address: 0xF11

Reset Value: 0x0000\_0602

Detailed:

| Bit # | R/W     | Description                                                     |
|-------|---------|-----------------------------------------------------------------|
| 31:7  | R (0xC) | Number of continuation codes in JEDEC manufacturer ID.          |
| 6:0   | R (0x2) | Final byte of JEDEC manufacturer ID, discarding the parity bit. |

The mvendorid encodes the OpenHW JEDEC Manufacturer ID, which is 2 decimal (bank 13).

# 13.2.45 Machine Architecture ID (marchid)

CSR Address: 0xF12

Reset Value: 0x0000\_0014

Detailed:

| Bit # | R/W      | Description                                              |
|-------|----------|----------------------------------------------------------|
| 31:0  | R (0x14) | Machine Architecture ID of CV32E40X is 0x14 (decimal 20) |

#### 13.2.46 Machine Implementation ID (mimpid)

CSR Address: 0xF13 Reset Value: Defined Detailed:

| ſ | Bit # | R/W     | Description                                 |
|---|-------|---------|---------------------------------------------|
| ſ | 31:20 | R (0x0) | Hardwired to 0.                             |
| ſ | 19:16 | R (0x0) | MAJOR.                                      |
|   | 15:12 | R (0x0) | Hardwired to 0.                             |
| ſ | 11:8  | R (0x0) | MINOR.                                      |
| ſ | 7:4   | R (0x0) | Hardwired to 0.                             |
| ľ | 3:0   | R       | PATCH. mimpid_patch_i, see Core Integration |

The Machine Implementation ID uses a Major, Minor, Patch versioning scheme. The **PATCH** bitfield is defined and set by the integrator and shall be set to 0 when no patches are applied. It is made available as **mimpid\_patch\_i** on the boundary of CV32E40X such that it can easily be changed by a metal layer only change.

#### 13.2.47 Hardware Thread ID (mhartid)

CSR Address: 0xF14

Reset Value: Defined

| Bit # | R/W | Description                                                |  |
|-------|-----|------------------------------------------------------------|--|
| 31:0  | R   | Machine Hardware Thread ID mhartid_i, see Core Integration |  |

#### 13.2.48 Machine Configuration Pointer (mconfigptr)

CSR Address: 0xF15 Reset Value: 0x0000\_0000 Detailed:

| Bit# | R/W     | Definition |
|------|---------|------------|
| 31:0 | R (0x0) | Reserved   |

#### 13.2.49 Cycle Counter (cycle)

CSR Address: 0xC00

| Reset | Value: | 0x0000   | 0000  |
|-------|--------|----------|-------|
| neset | varue. | 0.00000_ | _0000 |

Detailed:

| Bit# | R/W | Description |
|------|-----|-------------|
| 31:0 | R   |             |

Read-only unprivileged shadow of the lower 32 bits of the 64 bit machine mode cycle counter.

# 13.2.50 Instructions-Retired Counter (instret)

CSR Address: 0xC02

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W | Description |
|------|-----|-------------|
| 31:0 | R   |             |

Read-only unprivileged shadow of the lower 32 bits of the 64 bit machine mode instruction retired counter.

#### 13.2.51 Performance Monitoring Counter (hpmcounter3 .. hpmcounter31)

CSR Address: 0xC03 - 0xC1F Reset Value: 0x0000\_0000 Detailed:

| Bit# | R/W | Description |
|------|-----|-------------|
| 31:0 | R   |             |

Read-only unprivileged shadow of the lower 32 bits of the 64 bit machine mode performance counter. Non implemented counters always return a read value of 0.

### 13.2.52 Upper 32 Cycle Counter (cycleh)

CSR Address: 0xC80 Reset Value: 0x0000\_0000 Detailed:

| Bit# | R/W | Description |
|------|-----|-------------|
| 31:0 | R   |             |

Read-only unprivileged shadow of the upper 32 bits of the 64 bit machine mode cycle counter.

### 13.2.53 Upper 32 Instructions-Retired Counter (instreth)

CSR Address: 0xC82 Reset Value: 0x0000\_0000 Detailed:

| Bit# | R/W | Description |
|------|-----|-------------|
| 31:0 | R   |             |

Read-only unprivileged shadow of the upper 32 bits of the 64 bit machine mode instruction retired counter.

# 13.2.54 Upper 32 Performance Monitoring Counter (hpmcounter3h . . hpmcounter31h)

CSR Address: 0xC83 - 0xC9F

Reset Value: 0x0000\_0000

Detailed:

| Bit# | R/W | Description |
|------|-----|-------------|
| 31:0 | R   |             |

Read-only unprivileged shadow of the upper 32 bits of the 64 bit machine mode performance counter. Non implemented counters always return a read value of 0.

### FOURTEEN

# **PERFORMANCE COUNTERS**

CV32E40X implements performance counters according to [RISC-V-PRIV]. The performance counters are placed inside the Control and Status Registers (CSRs) and can be accessed with the CSRRW(I) and CSRRS/C(I) instructions.

CV32E40X implements the clock cycle counter mcycle(h), the retired instruction counter minstret(h), as well as the parameterizable number of event counters mhpmcounter3(h) - mhpmcounter31(h) and the corresponding event selector CSRs mhpmevent3 - mhpmevent31, and the mcountinhibit CSR to individually enable/disable the counters. mcycle(h) and minstret(h) are always available.

All counters are 64 bit wide.

The number of event counters is determined by the parameter NUM\_MHPMCOUNTERS with a range from 0 to 29 (default value of 1).

Unimplemented counters always read 0.

**Note:** All performance counters are using the gated version of clk\_i. The **wfi** instruction impact the gating of clk\_i as explained in *Sleep Unit* and can therefore affect the counters.

# 14.1 Event Selector

The following events can be monitored using the performance counters of CV32E40X.

| Bit # | Event Name    |                                                                                     |
|-------|---------------|-------------------------------------------------------------------------------------|
| 0     | CYCLES        | Number of cycles                                                                    |
| 1     | INSTR         | Number of instructions retired                                                      |
| 2     | COMP_INSTR    | Number of compressed instructions retired                                           |
| 3     | JUMP          | Number of jumps (unconditional)                                                     |
| 4     | BRANCH        | Number of branches (conditional)                                                    |
| 5     | BRANCH_TAKEN  | Number of branches taken (conditional)                                              |
| 6     | INTR_TAKEN    | Number of taken interrupts (excluding NMI)                                          |
| 7     | DATA_READ     | Number of read transactions on the OBI data interface.                              |
| 8     | DATA_WRITE    | Number of write transactions on the OBI data interface.                             |
| 9     | IF_INVALID    | Number of cycles that the IF stage causes ID stage underutilization                 |
| 10    | ID_INVALID    | Number of cycles that the ID stage causes EX stage underutilization                 |
| 11    | EX_INVALID    | Number of cycles that the EX stage causes WB stage underutilization                 |
| 12    | WB_INVALID    | Number of cycles that the WB stage causes register file write port underutilization |
| 13    | LD_STALL      | Number of stall cycles caused by load use hazards                                   |
| 14    | JMP_STALL     | Number of stall cycles caused by jump register hazards                              |
| 15    | WB_DATA_STALL | Number of stall cycles caused in the WB stage by loads/stores.                      |

The event selector CSRs mhpmevent3 - mhpmevent31 define which of these events are counted by the event counters mhpmcounter3(h) - mhpmcounter31(h). If a specific bit in an event selector CSR is set to 1, this means that events with this ID are being counted by the counter associated with that selector CSR. If an event selector CSR is 0, this means that the corresponding counter is not counting any event.

**Note:** At most 1 bit should be set in an event selector. If multiple bits are set in an event selector, then the operation of the associated counter is undefined.

### 14.2 Controlling the counters from software

By default, all available counters are disabled after reset in order to provide the lowest power consumption.

They can be individually enabled/disabled by overwriting the corresponding bit in the mcountinhibit CSR at address 0x320 as described in [RISC-V-PRIV]. In particular, to enable/disable mcycle(h), bit 0 must be written. For minstret(h), it is bit 2. For event counter mhpmcounterX(h), it is bit X.

The lower 32 bits of all counters can be accessed through the base register, whereas the upper 32 bits are accessed through the h-register. Reads of all these registers are non-destructive.

# 14.3 Parametrization at synthesis time

The mcycle(h) and minstret(h) counters are always available and 64 bit wide.

The number of available event counters mhpmcounterX(h) can be controlled via the NUM\_MHPMCOUNTERS parameter. By default NUM\_MHPCOUNTERS set to 1.

An increment of 1 to the NUM\_MHPCOUNTERS results in the addition of the following:

- 64 flops for mhpmcounterX
- 15 flops for *mhpmeventX*
- 1 flop for *mcountinhibit*[X]
- · Adder and event enablement logic

# 14.4 Time Registers (time(h))

The user mode time(h) registers are not implemented. Any access to these registers will cause an illegal instruction trap. It is recommended that a software trap handler is implemented to detect access of these CSRs and convert that into access of the platform-defined mtime register (if implemented in the platform).

#### FIFTEEN

# **EXCEPTIONS AND INTERRUPTS**

CV32E40X supports one of two interrupt architectures. If the SMCLIC parameter is set to 0, then the basic interrupt architecture is supported (see *Basic Interrupt Architecture*). If the SMCLIC parameter is set to 1, then the CLIC interrupt architecture is supported (see *CLIC Interrupt Architecture*).

# **15.1 Basic Interrupt Architecture**

If SMCLIC == 0, then CV32E40X supports the basic interrupt architecture as defined in [RISC-V-PRIV]. In this configuration only the basic interrupt handling modes (non-vectored basic mode and vectored basic mode) can be used. The irq\_i[31:16] interrupts are a custom extension that can be used with the basic interrupt architecture.

When entering an interrupt/exception handler, the core sets the mepc CSR to the current program counter and saves mstatus.MIE to mstatus.MPIE. All exceptions cause the core to jump to the base address of the vector table in the mtvec CSR. Interrupts are handled in either non-vectored basic mode or vectored basic mode depending on the value of mtvec.MODE. In non-vectored basic mode the core jumps to the base address of the vector table in the mtvec CSR. In vectored basic mode the core jumps to the base address plus four times the interrupt ID. Upon executing an MRET instruction, the core jumps to the program counter previously saved in the mepc CSR and restores mstatus.MPIE to mstatus.MIE.

The base address of the vector table must be aligned to 128 bytes and can be programmed by writing to the mtvec CSR (see *Machine Trap-Vector Base Address (mtvec)* - SMCLIC == 0).

### 15.1.1 Interrupt Interface

Table 15.1 describes the interrupt interface used for the basic interrupt architecture.

| Signal    | Di-      | Description                                                                                     |
|-----------|----------|-------------------------------------------------------------------------------------------------|
|           | rec-     |                                                                                                 |
|           | tion     |                                                                                                 |
| irq_i[31: | 1i6n]put | Active high, level sensistive interrupt inputs. Custom extension.                               |
| irq_i[15: | 1i2n]put | Reserved. Tie to 0.                                                                             |
| irq_i[11] | input    | Active high, level sensistive interrupt input. Referred to as Machine External Interrupt (MEI), |
|           |          | but integrator can assign a different purpose if desired.                                       |
| irq_i[10: | 8i]nput  | Reserved. Tie to 0.                                                                             |
| irq_i[7]  | input    | Active high, level sensistive interrupt input. Referred to as Machine Timer Interrupt (MTI),    |
|           |          | but integrator can assign a different purpose if desired.                                       |
| irq_i[6:4 | ]input   | Reserved. Tie to 0.                                                                             |
| irq_i[3]  | input    | Active high, level sensistive interrupt input. Referred to as Machine Software Interrupt (MSI), |
|           |          | but integrator can assign a different purpose if desired.                                       |
| irq_i[2:0 | ]input   | Reserved. Tie to 0.                                                                             |

| Table 15.1: Basic interrupt | architecture interface signals |
|-----------------------------|--------------------------------|
|-----------------------------|--------------------------------|

**Note:** The clic\_\*\_i pins are ignored in basic mode and should be tied to 0.

#### 15.1.2 Interrupts

The irq\_i[31:0] interrupts are controlled via the mstatus, mie and mip CSRs. CV32E40X uses the upper 16 bits of mie and mip for custom interrupts (irq\_i[31:16]), which reflects an intended custom extension in the RISC-V basic (a.k.a. CLINT) interrupt architecture. After reset, all interrupts, except for NMIs, are disabled. To enable any of the irq\_i[31:0] interrupts, both the global interrupt enable (MIE) bit in the mstatus CSR and the corresponding individual interrupt enable bit in the mie CSR need to be set. For more information, see the *Control and Status Registers* documentation.

If multiple interrupts are pending, they are handled in the fixed priority order defined by [RISC-V-PRIV]. The highest priority is given to the interrupt with the highest ID, except for the Machine Timer Interrupt, which has the lowest priority. So from high to low priority the interrupts are ordered as follows:

- store bus fault NMI (1025)
- load bus fault NMI (1024)
- irq\_i[31]
- irq\_i[30]
- ...
- irq\_i[16]
- irq\_i[11]
- irq\_i[3]
- irq\_i[7]

The irq\_i[31:0] interrupt lines are level-sensitive. The NMIs are triggered by load/store bus fault events. To clear the irq\_i[31:0] interrupts at the external source, CV32E40X relies on a software-based mechanism in which the

interrupt handler signals completion of the handling routine to the interrupt source, e.g., through a memory-mapped register, which then deasserts the corresponding interrupt line.

In Debug Mode, all interrupts are ignored independent of mstatus.MIE and the content of the mie CSR.

CV32E40X can trigger the following interrupts as reported in mcause:

| Inter- | Exception | Description                 | Scenario(s)                                     |
|--------|-----------|-----------------------------|-------------------------------------------------|
| rupt   | Code      |                             |                                                 |
| 1      | 3         | Machine Software Interrupt  | irq_i[3]                                        |
|        |           | (MSI)                       |                                                 |
| 1      | 7         | Machine Timer Interrupt     | irq_i[7]                                        |
|        |           | (MTI)                       |                                                 |
| 1      | 11        | Machine External Interrupt  | irq_i[11]                                       |
|        |           | (MEI)                       |                                                 |
| 1      | 31-16     | Machine Fast Interrupts     | irq_i[31]-irq_i[16]                             |
| 1      | 1024      | Load bus fault NMI (impre-  | <pre>data_err_i = 1 and data_rvalid_i = 1</pre> |
|        |           | cise)                       | for load                                        |
| 1      | 1025      | Store bus fault NMI (impre- | <pre>data_err_i = 1 and data_rvalid_i = 1</pre> |
|        |           | cise)                       | for store                                       |

**Note:** Load bus fault and store bus fault are handled as imprecise non-maskable interrupts (as opposed to precise exceptions).

**Note:** The NMI vector location is at index 15 of the machine trap vector table for both non-vectored basic mode and vectored basic mode (i.e. at {**mtvec[31:7]**, 5'hF, 2'b00}). The NMI vector location therefore does **not** match its exception code.

### 15.1.3 Nested Interrupt Handling

Within the basic interrupt architecture there is no hardware support for nested interrupt handling. Nested interrupt handling can however still be supported via software.

The hardware automatically disables interrupts upon entering an interrupt/exception handler. Otherwise, interrupts during the critical part of the handler, i.e. before software has saved the mepc and mstatus CSRs, would cause those CSRs to be overwritten. If desired, software can explicitly enable interrupts by setting mstatus.MIE to 1 from within the handler. However, software should only do this after saving mepc and mstatus. There is no limit on the maximum number of nested interrupts. Note that, after enabling interrupts by setting mstatus.MIE to 1, the current handler will be interrupted also by lower priority interrupts. To allow higher priority interrupts only, the handler must configure mie accordingly.

# **15.2 CLIC Interrupt Architecture**

If SMCLIC == 1, then CV32E40X supports the Core-Local Interrupt Controller (CLIC) Privileged Architecture Extension defined in [RISC-V-SMCLIC]. In this configuration only the CLIC interrupt handling mode can be used (i.e. mtvec[1:0] = 0x3).

The CLIC implementation is split into a part internal to the core (containing CSRs and related logic) and a part external to the core (containing memory mapped registers and arbitration logic). CV32E40X only provides the core internal part of CLIC. The external part can be added on the interface described in *Interrupt Interface*. CLIC provides low-latency, vectored, pre-emptive interrupts.

### 15.2.1 Interrupt Interface

Table 15.2 describes the interrupt interface used for the CLIC interrupt architecture.

| Signal Direc-                    |           | Description                                                       |
|----------------------------------|-----------|-------------------------------------------------------------------|
| _                                | tion      |                                                                   |
| clic_irq_i                       | input     | Is there any pending-and-enabled interrupt?                       |
| clic_irq_id_i[SMCLIC_ID_WID      | THrp1ut0] | Index of the most urgent pending-and-enabled interrupt.           |
| <pre>clic_irq_level_i[7:0]</pre> | input     | Interrupt level of the most urgent pending-and-enabled interrupt. |
| clic_irq_priv_i[1:0] input       |           | Associated privilege mode of the most urgent pending-and-enabled  |
|                                  |           | interrupt.                                                        |
| clic_irq_shv_i                   | input     | Selective hardware vectoring enabled for the most urgent pending- |
|                                  |           | and-enabled interrupt?                                            |

Table 15.2: CLIC interrupt architecture interface signals

The term *pending-and-enabled* interrupt in above table refers to *pending-and-locally-enabled*, i.e. based on the CLICINTIP and CLICINTIE memory mapped registers from [RISC-V-SMCLIC].

Note: Edge triggered interrupts are not supported.

**Note:** The irq\_i[31:0] pins are ignored in CLIC mode and should be tied to 0.

#### 15.2.2 Interrupts

Although the [RISC-V-SMCLIC] specification supports up to 4096 interrupts, CV32E40X itself supports at most 1024 interrupts. The maximum number of supported CLIC interrupts is equal to 2^SMCLIC\_ID\_WIDTH, which can range from 2 to 1024. The SMCLIC\_ID\_WIDTH parameter also impacts the alignment requirement for the trap vector table, see *Machine Trap Vector Table Base Address (mtvt)*.

### 15.2.3 Nested Interrupt Handling

CV32E40X offers hardware support for nested interrupt handling when SMCLIC == 1.

CLIC extends interrupt preemption to support up to 256 interrupt levels for each privilege mode, where highernumbered interrupt levels can preempt lower-numbered interrupt levels. See [RISC-V-SMCLIC] for details.

# 15.3 Non Maskable Interrupts

Non Maskable Interrupts (NMIs) update mepc, mcause and mstatus similar to regular interrupts. However, as the faults that result in NMIs are imprecise, the contents of mepc is not guaranteed to point to the instruction after the faulted load or store.

**Note:** Specifically mstatus.mie will get cleared to 0 when an (unrecoverable) NMI is taken. [RISC-V-PRIV] does not specify the behavior of mstatus in response to NMIs, see https://github.com/riscv/riscv-isa-manual/issues/756. If this behavior is specified at a future date, then we will reconsider our implementation.

The NMI vector location is at index 15 of the machine trap vector table for non-vectored basic mode, vectored basic mode and CLIC mode (i.e. {**mtvec[31:7]**, 5'hF, 2'b00}).

An NMI will occur when a load or store instruction experiences a bus fault. The fault resulting in an NMI is handled in an imprecise manner, meaning that the instruction that causes the fault is allowed to retire and the associated NMI is taken afterwards. NMIs are never masked by the MIE bit. NMIs are masked however while in debug mode or while single stepping with STEPIE = 0 in the dcsr CSR. This means that many instructions may retire before the NMI is visible to the core if debugging is taking place. Once the NMI is visible to the core, at most two instructions will retire before the NMI is taken.

If an NMI becomes pending while in debug mode as described above, the NMI will be taken immediately after debug mode has been exited.

In case of bufferable stores, the NMI is allowed to become visible an arbitrary time after the instruction retirement. As for the case with debugging, this can cause several instructions to retire before the NMI becomes visible to the core.

When a data bus fault occurs, the first detected fault will be latched and used for mcause when the NMI is taken. Any new data bus faults occuring while an NMI is pending will be discarded. When the NMI handler is entered, new data bus faults may be latched.

While an NMI is pending, DCSR.nmip will be 1. Note that this CSR is only accessible from debug mode, and is thus not visible for machine mode code.

# **15.4 Exceptions**

CV32E40X can trigger the following exceptions as reported in mcause:

| In-  | Ex-  | Description       | Scenario(s)                                                             |
|------|------|-------------------|-------------------------------------------------------------------------|
| ter- | cep- |                   |                                                                         |
| rupt | tion |                   |                                                                         |
|      | Code |                   |                                                                         |
| 0    | 1    | Instruction       | Execution attempt from I/O region.                                      |
|      |      | access fault      |                                                                         |
| 0    | 2    | Illegal instruc-  |                                                                         |
|      |      | tion              |                                                                         |
| 0    | 3    | Breakpoint        | Environment break.                                                      |
| 0    | 5    | Load access fault | Non-naturally aligned load access attempt to an I/O region. Load-       |
|      |      |                   | Reserved attempt to region without atomic support.                      |
| 0    | 7    | Store/AMO         | Non-naturally aligned store access attempt to an I/O region. Store-     |
|      |      | access fault      | Conditional or Atomic Memory Operation (AMO) attempt to re-             |
|      |      |                   | gion without atomic support.                                            |
| 0    | 11   | Environment call  |                                                                         |
|      |      | from M-Mode       |                                                                         |
|      |      | (ECALL)           |                                                                         |
| 0    | 48   | Instruction bus   | <pre>instr_err_i = 1 and instr_rvalid_i = 1 for instruction fetch</pre> |
|      |      | fault             |                                                                         |

If an instruction raises multiple exceptions, the priority, from high to low, is as follows:

- instruction access fault (1)
- instruction bus fault (48)
- illegal instruction (2)
- environment call from M-Mode (11)
- environment break (3)
- store/AMO access fault (7)
- load access fault (5)

Exceptions in general cannot be disabled and are always active. All exceptions are precise. Whether the PMA will actually cause exceptions depends on its configuration. CV32E40X raises an illegal instruction exception for any instruction in the RISC-V privileged and unprivileged specifications that is explicitly defined as being illegal according to the ISA implemented by the core, as well as for any instruction that is left undefined in these specifications unless the instruction encoding is configured as a custom CV32E40X instruction for specific parameter settings as defined in (see *CORE-V Instruction Set Extensions*). An instruction bus error leads to a precise instruction interface bus fault if an attempt is made to execute the instruction that has an associated bus error. Similarly an instruction fetch with a failing PMA check only leads to an instruction access exception if an actual execution attempt is made for it.

### SIXTEEN

# **DEBUG & TRIGGER**

CV32E40X offers support for execution-based debug according to [RISC-V-DEBUG]. The main requirements for the core are described in Chapter 4: RISC-V Debug, Chapter 5: Trigger Module, and Appendix A.2: Execution Based.

The following list shows the simplified overview of events that occur in the core when debug is requested:

- 1. Enters Debug Mode
- 2. Saves the PC to DPC
- 3. Updates the cause in the DCSR
- 4. Points the PC to the location determined by the input port dm\_haltaddr\_i
- 5. Begins executing debug control code.

Debug Mode can be entered by one of the following conditions:

- External debug event using the debug\_req\_i signal
- Trigger Module match event with TDATA1.action set to 1
- ebreak instruction when not in Debug Mode and when DCSR.EBREAKM == 1 (see *EBREAK Behavior* below)

A user wishing to perform an abstract access, whereby the user can observe or control a core's GPR or CSR register from the hart, is done by invoking debug control code to move values to and from internal registers to an externally addressable Debug Module (DM). Using this execution-based debug allows for the reduction of the overall number of debug interface signals.

**Note:** Debug support in CV32E40X is only one of the components needed to build a System on Chip design with run-control debug support (think "the ability to attach GDB to a core over JTAG"). Additionally, a Debug Module and a Debug Transport Module, compliant with the RISC-V Debug Specification, are needed.

A supported open source implementation of these building blocks can be found in the RISC-V Debug Support for PULP Cores IP block.

The CV32E40X also supports a Trigger Module to enable entry into Debug Mode on a trigger event with the following features:

- Number of trigger register(s) : Parametrizable 0-4 triggers using parameter DBG\_NUM\_TRIGGERS.
- Supported trigger types: instruction address match (Match Control) and exception trigger.

A trigger match will cause debug entry if TDATA1.action is 1.

The CV32E40X will not support the optional debug features 10, 11, & 12 listed in Section 4.1 of [RISC-V-DEBUG]. Specifically, a control transfer instruction's destination location being in or out of the Program Buffer and instructions depending on PC value shall **not** cause an illegal instruction.

# 16.1 Interface

| Signal                    | Direction | Description                          |
|---------------------------|-----------|--------------------------------------|
| debug_req_i               | input     | Request to enter Debug Mode          |
| debug_havereset_o         | output    | Debug status: Core has been reset    |
| debug_running_o           | output    | Debug status: Core is running        |
| debug_halted_o            | output    | Debug status: Core is halted         |
| dm_halt_addr_i[31:0]      | input     | Address for debugger entry           |
| dm_exception_addr_i[31:0] | input     | Address for debugger exception entry |

debug\_req\_i is the "debug interrupt", issued by the debug module when the core should enter Debug Mode. The debug\_req\_i is synchronous to clk\_i and requires a minimum assertion of one clock period to enter Debug Mode. The instruction being decoded during the same cycle that debug\_req\_i is first asserted shall not be executed before entering Debug Mode.

debug\_havereset\_o, debug\_running\_o, and debug\_mode\_o signals provide the operational status of the core to the debug module. The assertion of these signals is mutually exclusive.

debug\_havereset\_o is used to signal that the CV32E40X has been reset. debug\_havereset\_o is set high during the assertion of rst\_ni. It will be cleared low a few (unspecified) cycles after rst\_ni has been deasserted and fetch\_enable\_i has been sampled high.

debug\_running\_o is used to signal that the CV32E40X is running normally.

debug\_halted\_o is used to signal that the CV32E40X is in debug mode.

dm\_halt\_addr\_i is the address where the PC jumps to for a debug entry event. When in Debug Mode, an ebreak instruction will also cause the PC to jump back to this address without affecting status registers. (see *EBREAK Behavior* below)

dm\_exception\_addr\_i is the address where the PC jumps to when an exception occurs during Debug Mode. When in Debug Mode, the mret instruction will also cause the PC to jump back to this address without affecting status registers.

Both dm\_halt\_addr\_i and dm\_exception\_addr\_i must be word aligned.

# 16.2 Core Debug Registers

CV32E40X implements four core debug registers, namely *Debug Control and Status (dcsr)*, *Debug PC (dpc)*, and two debug scratch registers. Access to these registers in non Debug Mode results in an illegal instruction.

Several trigger registers are included if DBG\_NUM\_TRIGGERS is set to a value greater than 0. The following are the most relevant: *Trigger Select Register (tselect)*, *Trigger Data 1 (tdata1)*, *Trigger Data Register 2 (tdata2)* and *Trigger Info (tinfo)* If DBG\_NUM\_TRIGGERS is zero, access to the trigger registers will result in an illegal instruction exception.

The TDATA1.DMODE controls write access permission to the currently selected triggers tdata registers. In CV32E40X this bit is tied to 1, and thus only debug mode is able to write to the trigger registers.

# 16.3 Debug state

As specified in RISC-V Debug Specification ([RISC-V-DEBUG]) every hart that can be selected by the Debug Module is in exactly one of four states: nonexistent, unavailable, running or halted.

The remainder of this section assumes that the CV32E40X will not be classified as nonexistent by the integrator.

The CV32E40X signals to the Debug Module whether it is running or halted via its debug\_running\_o and debug\_halted\_o pins respectively. Therefore, assuming that this core will not be integrated as a nonexistent core, the CV32E40X is classified as unavailable when neither debug\_running\_o or debug\_halted\_o is asserted. Upon rst\_ni assertion the debug state will be unavailable until some cycle(s) after rst\_ni has been deasserted and fetch\_enable\_i has been sampled high. After this point (until a next reset assertion) the core will transition between having its debug\_halted\_o or debug\_running\_o pin asserted depending whether the core is in debug mode or not. Exactly one of the debug\_havereset\_o, debug\_running\_o, debug\_halted\_o is asserted at all times.

Figure 16.1 and show Figure 16.2 show typical examples of transitioning into the running and halted states.

Figure 16.1: Transition into debug running state

Figure 16.2: Transition into debug halted state

The key properties of the debug states are:

- The CV32E40X can remain in its unavailable state for an arbitrarily long time (depending on rst\_ni and fetch\_enable\_i).
- If debug\_req\_i is asserted after rst\_ni deassertion and before or coincident with the assertion of fetch\_enable\_i, then the CV32E40X is guaranteed to transition straight from its unavailable state into its halted state. If debug\_req\_i is asserted at a later point in time, then the CV32E40X might transition through the running state on its ways to the halted state.
- If debug\_req\_i is asserted during the running state, the core will eventually transition into the halted state (typically after a couple of cycles).

# 16.4 EBREAK Behavior

The EBREAK instruction description is distributed across several RISC-V specifications: [RISC-V-DEBUG], [RISC-V-PRIV], [RISC-V-UNPRIV]. The following is a summary of the behavior for three common scenarios.

### 16.4.1 Scenario 1 : Enter Exception

Executing the EBREAK instruction when the core is **not** in Debug Mode and the DCSR.EBREAKM == 0 shall result in the following actions:

- The core enters the exception handler routine located at MTVEC (Debug Mode is not entered)
- MEPC & MCAUSE are updated

To properly return from the exception, the ebreak handler will need to increment the MEPC to the next instruction. This requires querying the size of the ebreak instruction that was used to enter the exception (16 bit c.ebreak or 32 bit ebreak).

Note: The CV32E40X does not support MTVAL CSR register which would have saved the value of the instruction for exceptions. This may be supported on a future core.

### 16.4.2 Scenario 2 : Enter Debug Mode

Executing the EBREAK instruction when the core is **not** in Debug Mode and the DCSR.EBREAKM == 1 shall result in the following actions:

- The core enters Debug Mode and starts executing debug code located at dm\_halt\_addr\_i (exception routine not called)
- DPC & DCSR are updated

Similar to the exception scenario above, the debugger will need to increment the DPC to the next instruction before returning from Debug Mode.

Note: The default value of DCSR.EBREAKM is 0 and the DCSR is only accessible in Debug Mode. To enter Debug Mode from EBREAK, the user will first need to enter Debug Mode through some other means, such as from the external ``debug\_req\_i``, and set DCSR.EBREAKM.

### 16.4.3 Scenario 3 : Exit Program Buffer & Restart Debug Code

Execuitng the EBREAK instruction when the core is in Debug Mode shall result in the following actions:

- The core remains in Debug Mode and execution jumps back to the beginning of the debug code located at dm\_halt\_addr\_i
- none of the CSRs are modified

#### SEVENTEEN

## **RISC-V FORMAL INTERFACE**

**Note:** A bindable RISC-V Formal Interface (RVFI) interface will be provided for CV32E40X. See [SYMBIOTIC-RVFI] for details on RVFI.

The module  $cv32e40x_rvfi$  can be used to create a log of the executed instructions. It is a behavioral, non-synthesizable, module that can be bound to the  $cv32e40x_core$ .

RVFI serves the following purposes:

- It can be used for formal verification.
- It can be used to produce an instruction trace during simulation.
- It can be used as a monitor to ease interfacing with an external scoreboard that itself can be interfaced to an Instruction Set Simulator (ISS) for verification reasons.

### **17.1 New Additions**

#### **Debug Signals**

| output [NRET * | 3 - 1 : 0] rvfi_dbg               |
|----------------|-----------------------------------|
| output [NRET   | <pre>- 1 : 0] rvfi_dbg_mode</pre> |

Debug entry is seen by RVFI as happening between instructions. This means that neither the last instruction before debug entry nor the first instruction of the debug handler will signal any direct side-effects. The first instruction of the handler will however show the resulting state caused by these side-effects (e.g. the CSR rmask/rdata signals will show the updated values, pc\_rdata will be at the debug handler address, etc.).

For the first instruction after entering debug, the rvfi\_dbg signal contains the debug cause (see table below). The signal is otherwise 0. The rvfi\_dbg\_mode signal is high if the instruction was executed in debug mode and low otherwise.

| Ľ                |       |
|------------------|-------|
| Cause            | Value |
| None             | 0x0   |
| Ebreak           | 0x1   |
| Trigger Match    | 0x2   |
| External Request | 0x3   |
| Single Step      | 0x4   |

| Table | 17.1: | Debug | Causes |
|-------|-------|-------|--------|
| ruore | 1/.1. | Debug | Cuubeb |

#### **NMI** signals

```
output [1:0] rvfi_nmip
```

Whenever CV32E40X has a pending NMI, the rvfi\_nmip will signal this. rvfi\_nmip[0] will be 1 whenever an NMI is pending, while rvfi\_nmip[1] will be 0 for loads and 1 for stores.

#### **Sleep Signals**

These signals report core sleep and wakeup information.

output rvfi\_wu\_t [NRET - 1 : 0] rvfi\_wu
output logic [NRET - 1 : 0] rvfi\_sleep

Where the rvfi\_wu\_t struct contains following fields:

| Field     | Туре         | Bits   |
|-----------|--------------|--------|
| wu        | logic        | [0]    |
| interrupt | logic        | [1]    |
| debug     | logic        | [2]    |
| cause     | logic [10:0] | [13:3] |

Table 17.2: RVFI wu type

rvfi\_sleep is set on the last instruction before the core enters sleep mode. rvfi\_wu.wu is set for the first instruction
executed after waking up. rvfi\_wu.interrupt is set if the wakeup was caused by an interrupt, and rvfi\_wu.debug
is set if the wakeup was caused by a debug request. rvfi\_wu.cause signals the wakeup cause exception code.

# **17.2 Compatibility**

This chapter specifies interpretations and compatibilities to the [SYMBIOTIC-RVFI].

#### **Interface Qualification**

All RVFI output signals are qualified with the rvfi\_valid signal. Any RVFI operation (retired or trapped instruction) will set rvfi\_valid high and increment the rvfi\_order field. When rvfi\_valid is low, all other RVFI outputs can be driven to arbitrary values.

#### **Trap Signal**

The trap signal indicates that a synchronous trap has ocurred and side-effects can be expected.

```
output rvfi_trap_t[NRET - 1 : 0] rvfi_trap
```

Where the rvfi\_trap\_t struct contains the following fields:

|                 | 1.0         | •       |
|-----------------|-------------|---------|
| Field           | Туре        | Bits    |
| trap            | logic       | [0]     |
| exception       | logic       | [1]     |
| debug           | logic       | [2]     |
| exception_cause | logic [5:0] | [8:3]   |
| debug_cause     | logic [2:0] | [11:9]  |
| cause_type      | logic [1:0] | [13:12] |

rvfi\_trap consists of 14 bits. rvfi\_trap.trap is asserted if an instruction causes an exception or debug entry. rvfi\_trap.exception is set for synchronous traps that do not cause debug entry. rvfi\_trap.debug is set

for synchronous traps that do cause debug mode entry. rvfi\_trap.exception\_cause provide information about non-debug traps, while rvfi\_trap.debug\_cause provide information about traps causing entry to debug mode. rvfi\_trap.cause\_type differentiates between fault causes that map to the same exception code in rvfi\_trap. exception\_cause and rvfi\_trap.debug\_cause. When an exception is caused by a single stepped instruction, both rvfi\_trap.exception and rvfi\_trap.debug will be set. When rvfi\_trap signals a trap, CSR side effects and a jump to a trap/debug handler in the next cycle can be expected. The different trap scenarios, their expected side-effects and trap signalling are listed in the table below:

| Scenario                       | Trap                |                               |                     |     |                    |                  |     |                            | Description                                                           |
|--------------------------------|---------------------|-------------------------------|---------------------|-----|--------------------|------------------|-----|----------------------------|-----------------------------------------------------------------------|
|                                | Туре                | trap ex- de- excep- de- cause |                     |     |                    | da               | up- |                            |                                                                       |
|                                |                     | trap                          | ex-<br>cep-<br>tion | bug | excep-<br>tion_cau | ue-<br>isebug_ca | -   | _tølpated                  |                                                                       |
| Instruction<br>Access<br>Fault | Ex-<br>cep-<br>tion | 1                             | 1                   | Х   | 0x01               | X                | 0x0 | mcause,<br>mepc            | PMA detects instruction ex-<br>ecution from non-executable<br>memory. |
| Illegal In-<br>struction       | Ex-<br>cep-<br>tion | 1                             | 1                   | X   | 0x02               | X                | 0x0 | mcause,<br>mepc            | Illegal instruction decode.                                           |
| Break-<br>point                | Ex-<br>cep-<br>tion | 1                             | 1                   | Х   | 0x03               | X                | 0x0 | mcause,<br>mepc            | EBREAK executed with dcsr.ebreakm = 0.                                |
| Load<br>Access                 | Ex-<br>cep-         | 1                             | 1                   | Х   | 0x05               | X                | 0x0 | mcause,<br>mepc            | Non-naturally aligned load access attempt to an I/O region.           |
| Fault                          | tion                |                               |                     |     |                    |                  | 0x1 | mcause,<br>mepc            | Load-Reserved attempt to re-<br>gion without atomic support.          |
| Store/AMO<br>Access            | Ex-<br>cep-         | 1                             | 1                   | Х   | 0x07               | X                | 0x0 | mcause,<br>mepc            | Non-naturally aligned store access attempt to an I/O region.          |
| Fault                          | tion                |                               |                     |     |                    |                  | 0x1 | mcause,<br>mepc            | SC or AMO attempt to region without atomic support.                   |
| Envi-<br>ronment<br>Call       | Ex-<br>cep-<br>tion | 1                             | 1                   | Х   | 0x0B               | X                | 0x0 | mcause,<br>mepc            | ECALL executed from Ma-<br>chine mode.                                |
| Instruction<br>Bus Fault       | Ex-<br>cep-<br>tion | 1                             | 1                   | X   | 0x30               | X                | 0x0 | mcause,<br>mepc            | OBI bus error on instruction fetch.                                   |
| Break-<br>point to<br>debug    | De-<br>bug          | 1                             | 0                   | 1   | Х                  | 0x1              | 0x0 | dpc,<br>dcsr               | EBREAK from non-debug<br>mode executed with dcsr.<br>ebreakm == 1.    |
| Break-<br>point in<br>debug    | De-<br>bug          | 1                             | 0                   | 1   | X                  | 0x1              | 0x0 | No<br>CSRs<br>up-<br>dated | EBREAK in debug mode jumps to debug handler.                          |
| Debug<br>Trigger<br>Match      | De-<br>bug          | 1                             | 0                   | 1   | X                  | 0x2              | 0x0 | dpc,<br>dcsr               | Debug trigger address match with mcontrol.timing = 0.                 |
| Single step                    | De-<br>bug          | 1                             | Х                   | 1   | Х                  | 0x4              | Х   | dpc,<br>dcsr               | Single step.                                                          |

| Table 17.4: Table of synchronous trap type | Table 17.4: | Table | of sy | nchronous | trap | types |
|--------------------------------------------|-------------|-------|-------|-----------|------|-------|
|--------------------------------------------|-------------|-------|-------|-----------|------|-------|

#### Interrupts

Interrupts are seen by RVFI as happening between instructions. This means that neither the last instruction before the

interrupt nor the first instruction of the interrupt handler will signal any direct side-effects. The first instruction of the handler will however show the resulting state caused by these side-effects (e.g. the CSR rmask/rdata signals will show the updated values, pc\_rdata will be at the interrupt handler address etc.).

output rvfi\_intr\_t[NRET - 1 : 0] rvfi\_intr

Where the rvfi\_intr\_t struct contains the following fields:

|           |              | <b>7</b> 1 |
|-----------|--------------|------------|
| Field     | Туре         | Bits       |
| intr      | logic        | [0]        |
| exception | logic        | [1]        |
| interrupt | logic        | [2]        |
| cause     | logic [10:0] | [13:3]     |
|           |              |            |

rvfi\_intr consists of 14 bits. rvfi\_intr.intr is set for the first instruction of the trap handler when encountering an exception or interrupt. rvfi\_intr.exception indicates it was caused by synchronous trap and rvfi\_intr. interrupt indicates it was caused by an interrupt. rvfi\_intr.cause signals the cause for entering the trap handler.

rvfi\_intr is not set for debug traps unless a debug entry happens in the first instruction of an interrupt handler (see rvfi\_intr == X in the table below). In this case CSR side-effects (to mepc) can be expected.

| Scenario                          | rvfi_ | rvfi_intr |        |           |        | rvfi_dbg[2:0) cause[31] csr[8:6] |            |  |
|-----------------------------------|-------|-----------|--------|-----------|--------|----------------------------------|------------|--|
|                                   | intr  | ex-       | inter- | cause     |        |                                  | (cause)    |  |
|                                   |       | cep-      | rupt   |           |        |                                  |            |  |
|                                   |       | tion      |        |           |        |                                  |            |  |
| Synchronous trap                  | 0     | 1         | 1      | Sync trap | 0x0    | 0                                | Х          |  |
|                                   |       |           |        | cause     |        |                                  |            |  |
| Interrupt (includes NMIs from bus | 1     | 0         | 1      | Interrupt | 0x0    | 1                                | X          |  |
| errors)                           |       |           |        | cause     |        |                                  |            |  |
| Debug entry due to EBREAK (from   | 0     | 0         | 0      | 0x0       | 0x1    | X                                | 0x1        |  |
| non-debug mode)                   |       |           |        |           |        |                                  |            |  |
| Debug entry due to EBREAK (from   | 0     | 0         | 0      | 0x0       | 0x1    | Х                                | Х          |  |
| debug mode)                       |       |           |        |           |        |                                  |            |  |
| Debug entry due to trigger match  | 0     | 0         | 0      | 0x0       | 0x2    | Х                                | 0x2        |  |
| Debug entry due to external debug | Х     | Х         | X      | X         | 0x3 or | Х                                | 0x3 or 0x5 |  |
| request                           |       |           |        |           | 0x5    |                                  |            |  |
| Debug handler entry due to single | Х     | Х         | X      | X         | 0x4    | Х                                | 0x4        |  |
| step                              |       |           |        |           |        |                                  |            |  |

Table 17.6: Table of scenarios for 1st instruction of exception/interrupt/debug handler

#### **Program Counter**

The pc\_wdata signal shows the predicted next program counter. This prediction ignores asynchronous traps (asynchronous debug requests and interrupts) and single step debug requests that may have happened at the same time as the instruction.

#### **Memory Access**

For cores as CV32E40X that support misaligned access rvfi\_mem\_addr will not always be 4 byte aligned. For misaligned accesses the start address of the transfer is reported (i.e. the start address of the first sub-transfer).

#### **CSR Signals**

To reduce the number of signals in the RVFI interface, a vectorized CSR interface has been introduced for register ranges.

```
output [<NUM_CSRNAME>-1:0] [NRET * XLEN - 1 : 0] rvfi_csr_<csrname>_rmask
output [<NUM_CSRNAME>-1:0] [NRET * XLEN - 1 : 0] rvfi_csr_<csrname>_wmask
output [<NUM_CSRNAME>-1:0] [NRET * XLEN - 1 : 0] rvfi_csr_<csrname>_rdata
output [<NUM_CSRNAME>-1:0] [NRET * XLEN - 1 : 0] rvfi_csr_<csrname>_wdata
```

Example:

```
output [31:0] [31:0] rvfi_csr_name_rmask
output [31:0] [31:0] rvfi_csr_name_wmask
output [31:0] [31:0] rvfi_csr_name_rdata
output [31:0] [31:0] rvfi_csr_name_wdata
```

Instead of:

```
output [31:0] rvfi_csr_name0_rmask
output [31:0] rvfi_csr_name0_rmask
output [31:0] rvfi_csr_name0_rdata
output [31:0] rvfi_csr_name0_wdata
...
output [31:0] rvfi_csr_name31_rmask
output [31:0] rvfi_csr_name31_rmask
output [31:0] rvfi_csr_name31_rdata
output [31:0] rvfi_csr_name31_wdata
```

#### **Machine Counter/Timers**

In contrast to [SYMBIOTIC-RVFI], the **mcycle[h]** and **minstret[h]** registers are not modelled as happening "between instructions" but rather as a side-effect of the instruction. This means that an instruction that causes an increment (or decrement) of these counters will set the rvfi\_csr\_mcycle\_wmask, and that rvfi\_csr\_mcycle\_rdata is not necessarily equal to rvfi\_csr\_mcycle\_wdata.

#### Halt Signal

The rvfi\_halt signal is meant for liveness properties of cores that can halt execution. It is only needed for cores that can lock up. Tied to 0 for RISC-V compliant cores.

#### **Mode Signal**

The rvfi\_mode signal shows the *current* privilege mode as opposed to the *effective* privilege mode of the instruction. I.e. for load and store instructions the reported privilege level will therefore not depend on mstatus.mpp and mstatus. mprv.

### 17.3 Trace output file

Tracing can be enabled during simulation by defining CV32E40X\_TRACE\_EXECUTION. All traced instructions are written to a log file. The log file is named trace\_rvfi.log.

# **17.4 Trace output format**

The trace output is in tab-separated columns.

- 1. **PC**: The program counter
- 2. **Instr**: The executed instruction (base 16). 32 bit wide instructions (8 hex digits) are uncompressed instructions, 16 bit wide instructions (4 hex digits) are compressed instructions.
- 3. rs1\_addr Register read port 1 source address, 0x0 if not used by instruction
- 4. rs1\_data Register read port 1 read data, 0x0 if not used by instruction
- 5. rs2\_addr Register read port 2 source address, 0x0 if not used by instruction
- 6. rs2\_data Register read port 2 read data, 0x0 if not used by instruction
- 7. rd\_addr Register write port 1 destination address, 0x0 if not used by instruction
- 8. rd\_data Register write port 1 write data, 0x0 if not used by instruction
- 9. mem\_addr Memory address for instructions accessing memory
- 10. rvfi\_mem\_rmask Bitmask specifying which bytes in rvfi\_mem\_rdata contain valid read data
- 11. rvfi\_mem\_wmask Bitmask specifying which bytes in rvfi\_mem\_wdata contain valid write data
- 12. rvfi\_mem\_rdata The data read from memory address specified in mem\_addr
- 13. rvfi\_mem\_wdata The data written to memory address specified in mem\_addr

| PC             | Instr      | rs1_addr   | 1   | rs1_rdata   | rs2_addr | rs2_rdata | rd_addr | rd_wdata | mem_     |
|----------------|------------|------------|-----|-------------|----------|-----------|---------|----------|----------|
| ⊶addr n        | em_rmask m | em_wmask m | em_ | _rdata mem_ | _wdata   |           |         |          |          |
| 00001f9c       | 14c70793   | 0e         |     | 000096c8    | 0c       | 00000000  | 0f      | 00009814 | <b>.</b> |
| <b>⇔000098</b> | 14         | 0          | 0   | 00000000    | 00000000 |           |         |          |          |
| 00001fa0       | 14f72423   | 0e         |     | 000096c8    | 0f       | 00009814  | 00      | 00000000 |          |
| <b>⇔000098</b> | 10         | 0          | f   | 00000000    | 00009814 |           |         |          |          |
| 00001fa4       | 0000bf6d   | 1f         |     | 00000000    | 1b       | 00000000  | 00      | 00000000 |          |
| ⊶00001f        | a6         | 0          | 0   | 00000000    | 00000000 |           |         |          |          |
| 00001f5e       | 000043d8   | 0f         |     | 00009814    | 04       | 00000000  | 0e      | 00000000 |          |
| <b>⇔000098</b> | 18         | f          | 0   | 00000000    | 00000000 |           |         |          |          |
| 00001f60       | 0000487d   | 00         |     | 00000000    | 1f       | 00000000  | 10      | 0000001f |          |
| <b>⇔000000</b> | 1f         | 0          | 0   | 00000000    | 00000000 |           |         |          |          |

# EIGHTEEN

# **CORE-V INSTRUCTION SET EXTENSIONS**

CV32E40X does not support any custom ISA Extensions internal to the core. Custom instructions can be added external to the core via the eXtension interface described in *eXtension Interface*.

NINETEEN

# **CORE VERSIONS AND RTL FREEZE RULES**

The CV32E40X is defined by the marchid and mimpid tuple. The tuple identify which sets of parameters have been verified by OpenHW Group, and once RTL Freeze is achieved, no further non-logically equivalent changes are allowed on that set of parameters.

The RTL Freeze version of the core is indentified by a GitHub tag with the format  $cv32e40x_vMAJOR.MINOR.PATCH$  (e.g.  $cv32e40x_v1.0.0$ ). In addition, the release date is reported in the documentation.

# 19.1 What happens after RTL Freeze?

#### 19.1.1 A bug is found

If a bug is found that affect the already frozen parameter set, the RTL changes required to fix such bug are non-logically equivalent by definition. Therefore, the RTL changes are applied only on a different mimpid value and the bug and the fix must be documented. These changes are visible by software as the mimpid has a different value. Every bug or set of bugs found must be followed by another RTL Freeze release and a new GitHub tag.

#### 19.1.2 RTL changes on non-verified yet parameters

If changes affecting the core on a non-frozen parameter set are required, then such changes must remain logically equivalent for the already frozen set of parameters (except for the required mimpid update), and they must be applied on a different mimpid value. They can be non-logically equivalent to a non-frozen set of parameters. These changes are visible by software as the mimpid has a different value. Once the new set of parameters is verified and achieved the sign-off for RTL freeze, a new GitHub tag and version of the core is released.

#### 19.1.3 PPA optimizations and new features

Non-logically equivalent PPA optimizations and new features are not allowed on a given set of RTL frozen parameters (e.g., a faster divider). If PPA optimizations are logically-equivalent instead, they can be applied without changing the mimpid value (as such changes are not visible in software). However, a new GitHub tag should be released and changes documented.

# **19.2 Released core versions**

The verified parameter sets of the core, their implementation version, GitHub tags, and dates are reported here.

### TWENTY

# GLOSSARY

- ALU: Arithmetic/Logic Unit
- ASIC: Application-Specific Integrated Circuit
- Byte: 8-bit data item
- CPU: Central Processing Unit, processor
- CSR: Control and Status Register
- **Custom extension**: Non-Standard extension to the RISC-V base instruction set (RISC-V Instruction Set Manual, Volume I: User-Level ISA)
- **EXE**: Instruction Execute
- FPGA: Field Programmable Gate Array
- FPU: Floating Point Unit
- Halfword: 16-bit data item
- Halfword aligned address: An address is halfword aligned if it is divisible by 2
- **ID**: Instruction Decode
- IF: Instruction Fetch (Instruction Fetch)
- ISA: Instruction Set Architecture
- KGE: kilo gate equivalents (NAND2)
- LSU: Load Store Unit (Load-Store-Unit (LSU))
- M-Mode: Machine Mode (RISC-V Instruction Set Manual, Volume II: Privileged Architecture)
- NMI: Non-Maskable Interrupt
- **OBI**: Open Bus Interface
- PC: Program Counter
- PMA: Physical Memory Attribution
- RV32C: RISC-V Compressed (C extension)
- **RV32F**: RISC-V Floating Point (F extension)
- SIMD: Single Instruction/Multiple Data
- **Standard extension**: Standard extension to the RISC-V base instruction set (RISC-V Instruction Set Manual, Volume I: User-Level ISA)
- WARL: Write Any Values, Reads Legal Values

- WB: Write Back of instruction results
- WLRL: Write/Read Only Legal Values
- Word: 32-bit data item
- Word aligned address: An address is word aligned if it is divisible by 4
- WPRI: Reserved Writes Preserve Values, Reads Ignore Values

### BIBLIOGRAPHY

- [RISC-V-UNPRIV] RISC-V Instruction Set Manual, Volume I: User-Level ISA, Document Version 20191213 (December 13, 2019), https://github.com/riscv/riscv-isa-manual/releases/download/Ratified-IMAFDQC/ riscv-spec-20191213.pdf
- [RISC-V-PRIV] RISC-V Instruction Set Manual, Volume II: Privileged Architecture, Document Version 20211105-signoff (November 5, 2021), https://github.com/riscv/riscv-isa-manual/releases/download/ draft-20211105-c30284b/riscv-privileged.pdf
- [RISC-V-DEBUG] RISC-V Debug Support, version 1.0.0-STABLE, fe3d1e65efed4b56574c50867830c3c499f9b18c, https://github.com/riscv/riscv-debug-spec/blob/b659d7dc7f578e1a2a76f9e62a5eec0f2d80045c/ riscv-debug-stable.pdf
- [RISC-V-SMCLIC] "Smclic" Core-Local Interrupt Controller (CLIC) RISC-V Privileged Architecture Extension, version 0.9-draft, 3/15/2022, https://github.com/riscv/riscv-fast-interrupt/blob/ af247be97888f759c61a00800f86171f28151820/clic.pdf
- [RISC-V-ZBA\_ZBB\_ZBC\_ZBS] RISC-V Bit Manipulation ISA-extensions, Version 1.0.0-38-g865e7a7, 2021-06-28, https://github.com/riscv/riscv-bitmanip/releases/download/1.0.0/bitmanip-1.0.0-38-g865e7a7.pdf
- [RISC-V-ZCA\_ZCB\_ZCMB\_ZCMP\_ZCMT] RISC-V Standard Extension for the Zca, Zcb, Zcmb, Zcmp, Zcmt subsets of Zc, v0.70.1, 29f0511 (not ratified yet), https://github.com/riscv/riscv-code-size-reduction/ releases/download/V0.70.1-TOOLCHAIN-DEV/Zc\_0\_70\_1.pdf
- [RISC-V-CRYPTO] RISC-V Cryptography Extensions Volume I, Scalar & Entropy Source Instructions, Version v1.0.0, 2'nd December, 2021: Ratified, https://github.com/riscv/riscv-crypto/releases/download/v1.0. 0-scalar/riscv-crypto-spec-scalar-v1.0.0.pdf
- [OPENHW-OBI] OpenHW Open Bus Interface (OBI) protocol, version 1.4, https://github.com/openhwgroup/ core-v-docs/blob/master/cores/obi/OBI-v1.4.pdf
- [OPENHW-XIF] OpenHW eXtension Interface, revision 458c8a73, https://docs.openhwgroup.org/projects/ openhw-group-core-v-xif/
- [SYMBIOTIC-RVFI] Symbiotic EDA RISC-V Formal Interface https://github.com/SymbioticEDA/riscv-formal/ blob/master/docs/rvfi.md