Blame - docs/design_docs/software/code_sharing.rst - TF-M/trusted-firmware-m

blob: 7d06eacf23730b9b147d1c7e579bf9a998adce71 [file] [log] [blame]

Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	1	######################################################
				2	Code sharing between independently linked XIP binaries
				3	######################################################
				4
Anton Komlev	19b9071	2021-10-19 16:28:02 +0100	[diff] [blame]	5	:Author: Tamas Ban
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	6	:Organization: Arm Limited
				7	:Contact: tamas.ban@arm.com
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	8
				9	**********
				10	Motivation
				11	**********
				12	Cortex-M devices are usually constrained in terms of flash and RAM. Therefore,
				13	it is often challenging to fit bigger projects in the available memory. The PSA
				14	specifications require a device to both have a secure boot process in place at
				15	device boot-up time, and to have a partition in the SPE which provides
				16	cryptographic services at runtime. These two entities have some overlapping
				17	functionality. Some cryptographic primitives (e.g. hash calculation and digital
				18	signature verification) are required both in the bootloader and the runtime
				19	environment. In the current TF-M code base, both firmware components use the
				20	mbed-crypto library to implement these requirements. During the build process,
				21	the mbed-crpyto library is built twice, with different configurations (the
				22	bootloader requires less functionality) and then linked to the corresponding
				23	firmware component. As a result of this workflow, the same code is placed in the
				24	flash twice. For example, the code for the SHA-256 algorithm is included in
				25	MCUboot, but the exact same code is duplicated in the SPE cryptography
				26	partition. In most cases, there is no memory isolation between the bootloader
				27	and the SPE, because both are part of the PRoT code and run in the secure
				28	domain. So, in theory, the code of the common cryptographic algorithms could be
				29	reused among these firmware components. This could result in a big reduction in
				30	code footprint, because the cryptographic algorithms are usually flash hungry.
				31	Code size reduction can be a good opportunity for very constrained devices,
				32	which might need to use TF-M Profile Small anyway.
				33
				34	*******************
				35	Technical challenge
				36	*******************
				37	Code sharing in a regular OS environment is easily achievable with dynamically
				38	linked libraries. However, this is not the case in Cortex-M systems where
				39	applications might run bare-metal, or on top of an RTOS, which usually lacks
				40	dynamic loading functionality. One major challenge to be solved in the Cortex-M
				41	space is how to share code between independently linked XIP applications that
				42	are tied to a certain memory address range to be executable and have absolute
				43	function and global data memory addresses. In this case, the code is not
				44	relocatable, and in most cases, there is no loader functionality in the system
				45	that can perform code relocation. Also, the lack of an MMU makes the address
				46	space flat, constant and not reconfigurable at runtime by privileged code.
				47
				48	One other difficulty is that the bootloader and the runtime use the same RAM
				49	area during execution. The runtime firmware is executed strictly after the
				50	bootloader, so normally, it can reuse the whole secure RAM area, as it would be
				51	the exclusive user. No attention needs to be paid as to where global data is
				52	placed by the linker. The bootloader does not need to retain its state. The low
				53	level startup of the runtime firmware can freely overwrite the RAM with its data
				54	without corrupting bootloader functionality. However, with code sharing between
				55	bootloader and runtime firmware, these statements are no longer true. Global
				56	variables used by the shared code must either retain their value or must be
				57	reinitialised during low level startup of the runtime firmware. The startup code
				58	is not allowed to overwrite the shared global variables with arbitrary data. The
				59	following design proposal provides a solution to these challenges.
				60
				61	**************
				62	Design concept
				63	**************
				64	The bootloader is sometimes implemented as ROM code (BL1) or stored in a region
				65	of the flash which is lockable, to prevent tampering. In a secure system, the
				66	bootloader is immutable code and thus implements a part of the Root of Trust
				67	anchor in the device, which is trusted implicitly. The shared code is primarily
				68	part of the bootloader, and is reused by the runtime SPE firmware at a later
				69	stage. Not all of the bootloader code is reused by the runtime SPE, only some
				70	cryptographic functions.
				71
				72	Simplified steps of building with code sharing enabled:
				73
				74	- Complete the bootloader build process to have a final image that contains
				75	the absolute addresses of the shared functions, and the global variables
				76	used by these functions.
				77	- Extract the addresses of the functions and related global variables that are
				78	intended to be shared from the bootloader executable.
				79	- When building runtime firmware, provide the absolute addresses of the shared
				80	symbols to the linker, so that it can pick them up, instead of instantiating
				81	them again.
				82
				83	The execution flow looks like this:
				84
				85	.. code-block:: bash
				86
				87	SPE MCUboot func1() MCUboot func2() MCUboot func3()
				88	\|
				89	\| Hash()
				90	\|------------->\|
				91	\|----------------->\|
				92	\|
				93	Return \|
				94	Return \|<-----------------\|
				95	\|<-------------\|
				96	\|
				97	\|
				98	\|----------------------------------------------------->\|
				99	\|
				100	Function pointer in shared global data() \|
				101	\|<-----------------------------------------------------\|
				102	\|
				103	\| Return
				104	\|----------------------------------------------------->\|
				105	\|
				106	Return \|
				107	\|<-----------------------------------------------------\|
				108	\|
				109	\|
				110
				111	The execution flow usually returns from a shared function back to the SPE with
				112	an ordinary function return. So usually, once a shared function is called in the
				113	call path, all further functions in the call chain will be shared as well.
				114	However, this is not always the case, as it is possible for a shared function to
				115	call a non-shared function in SPE code through a global function pointer.
				116
				117	For shared global variables, a dedicated data section must be allocated in the
				118	linker configuration file. This area must have the same memory address in both
				119	MCUboot's and the SPE's linker files, to ensure the integrity of the variables.
				120	For simplicity's sake, this section is placed at the very beginning of the RAM
				121	area. Also, the RAM wiping functionality at the end of the secure boot flow
				122	(that is intended to remove any possible secrets from the RAM) must not clear
				123	this area. Furthermore, it must be ensured that the linker places shared globals
				124	into this data section. There are two way to achieve this:
				125
				126	- Put a filter pattern in the section body that matches the shared global
				127	variables.
				128	- Mark the global variables in the source code with special attribute
				129	`__attribute__((section(<NAME_OF_SHARED_SYMBOL_SECTION>)))`
				130
				131	RAM memory layout in MCUboot with code sharing enabled:
				132
				133	.. code-block:: bash
				134
				135	+------------------+
				136	\| Shared symbols \|
				137	+------------------+
				138	\| Shared boot data \|
				139	+------------------+
				140	\| Data \|
				141	+------------------+
				142	\| Stack (MSP) \|
				143	+------------------+
				144	\| Heap \|
				145	+------------------+
				146
				147	RAM memory layout in SPE with code sharing enabled:
				148
				149	.. code-block:: bash
				150
				151	+-------------------+
				152	\| Shared symbols \|
				153	+-------------------+
				154	\| Shared boot data \|
				155	+-------------------+
				156	\| Stack (MSP) \|
				157	+-------------------+
				158	\| Stack (PSP) \|
				159	+-------------------+
				160	\| Partition X Data \|
				161	+-------------------+
				162	\| Partition X Stack \|
				163	+-------------------+
				164	.
				165	.
				166	.
				167	+-------------------+
				168	\| Partition Z Data \|
				169	+-------------------+
				170	\| Partition Z Stack \|
				171	+-------------------+
				172	\| PRoT Data \|
				173	+-------------------+
				174	\| Heap \|
				175	+-------------------+
				176
Nicola Mazzucato	afd24bb	2024-02-14 17:27:27 +0000	[diff] [blame]	177	Patching Mbed TLS
				178	=================
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	179	In order to share some global function pointers from mbed-crypto that are
				180	related to dynamic memory allocation, their scope must be extended from private
				181	to global. This is needed because some compiler toolchain only extract the
				182	addresses of public functions and global variables, and extraction of addresses
				183	is a requirement to share them among binaries. Therefore, a short patch was
				184	created for the mbed-crypto library, which "globalises" these function pointers:
				185
Anton Komlev	fb83540	2022-08-09 13:04:04 +0100	[diff] [blame]	186	`lib/ext/mbedcrypto/0002-Enable-crypto-code-sharing-between-independent-binar.patch`
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	187
Nicola Mazzucato	afd24bb	2024-02-14 17:27:27 +0000	[diff] [blame]	188	The patch needs to be manually applied in the Mbed TLS repo, if code sharing is
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	189	enabled. The patch has no effect on the functional behaviour of the
				190	cryptographic library, it only extends the scope of some variables.
				191
				192	*************
				193	Tools support
				194	*************
				195	All the currently supported compilers provide a way to achieve the above
				196	objectives. However, there is no standard way, which means that the code sharing
				197	functionality must be implemented on a per compiler basis. The following steps
				198	are needed:
				199
				200	- Extraction of the addresses of all global symbols.
				201	- The filtering out of the addresses of symbols that aren't shared. The goal is
				202	to not need to list all the shared symbols by name. Only a simple pattern
				203	has to be provided, which matches the beginning of the symbol's name.
				204	Matching symbols will be shared. Examples are in :
Anton Komlev	fb83540	2022-08-09 13:04:04 +0100	[diff] [blame]	205	`bl2/shared_symbol_template.txt`
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	206	- Provision of the addresses of shared symbols to the linker during the SPE
				207	build process.
				208	- The resolution of symbol collisions during SPE linking. Because mbed-crypto
				209	is linked to both firmware components as a static library, the external
				210	shared symbols will conflict with the same symbols found within it. In order
				211	to prioritize the external symbol, the symbol with the same name in
				212	mbed-crypto must be marked as weak in the symbol table.
				213
				214	The above functionalities are implemented in the toolchain specific CMake files:
				215
				216	- `toolchain_ARMCLANG.cmake`
				217	- `toolchain_GNUARM.cmake`
				218
				219	By the following two functions:
				220
Anton Komlev	fb83540	2022-08-09 13:04:04 +0100	[diff] [blame]	221	- `target_share_symbols()`: Extract and filter shared symbol addresses
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	222	from MCUboot.
Anton Komlev	fb83540	2022-08-09 13:04:04 +0100	[diff] [blame]	223	- `target_link_shared_code()`: Link shared code to the SPE and resolve symbol
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	224	conflict issues.
				225
				226	ARMCLANG
				227	========
				228	The toolchain specific steps are:
				229
				230	- Extract all symbols from MCUboot: add `-symdefs` to the compiler command line
				231	- Filter shared symbols: call CMake script `FilterSharedSymbols.cmake`
				232	- Weaken duplicated (shared) symbols in the mbed-crypto static library that are
				233	linked to the SPE: `arm-none-eabi-objcopy`
				234	- Link shared code to SPE: Add the filtered output of `-symdefs` to the SPE
				235	source file list.
				236
				237	GNUARM
				238	======
				239	The toolchain specific steps are:
				240
				241	- Extract all symbols from MCUboot: `arm-none-eabi-nm`
				242	- Filter shared symbols: call CMake script: `FilterSharedSymbols.cmake`
				243	- Strip unshared code from MCUboot: `arm-none-eabi-strip`
				244	- Weaken duplicated (shared) symbols in the mbed-crypto static library that are
				245	linked to the SPE: `arm-none-eabi-objcopy`
				246	- Link shared code to SPE: Add `-Wl -R <SHARED_STRIPPED_CODE.axf>` to the
				247	compiler command line
				248
				249	IAR
				250	===
				251	Functionality currently not implemented, but the toolchain supports doing it.
				252
				253	**************************
				254	Memory footprint reduction
				255	**************************
				256	Build type: MinSizeRel
				257	Platform: mps2/an521
				258	Version: TF-Mv1.2.0 + code sharing patches
				259	MCUboot image encryption support is disabled.
				260
				261	+------------------+-------------------+-------------------+-------------------+
				262	\| \| ConfigDefault \| ConfigProfile-M \| ConfigProfile-S \|
				263	+------------------+----------+--------+----------+--------+----------+--------+
				264	\| \| ARMCLANG \| GNUARM \| ARMCLANG \| GNUARM \| ARMCLANG \| GNUARM \|
				265	+------------------+----------+--------+----------+--------+----------+--------+
Anton Komlev	fb83540	2022-08-09 13:04:04 +0100	[diff] [blame]	266	\| CODE_SHARING=OFF \| 122268 \| 124572 \| 75936 \| 75996 \| 50336 \| 50224 \|
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	267	+------------------+----------+--------+----------+--------+----------+--------+
Anton Komlev	fb83540	2022-08-09 13:04:04 +0100	[diff] [blame]	268	\| CODE_SHARING=ON \| 113264 \| 115500 \| 70400 \| 70336 \| 48840 \| 48988 \|
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	269	+------------------+----------+--------+----------+--------+----------+--------+
Anton Komlev	fb83540	2022-08-09 13:04:04 +0100	[diff] [blame]	270	\| Difference \| 9004 \| 9072 \| 5536 \| 5660 \| 1496 \| 1236 \|
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	271	+------------------+----------+--------+----------+--------+----------+--------+
				272
				273	If MCUboot image encryption support is enabled then saving could be up to
				274	~13-15KB.
				275
				276	.. Note::
				277
				278	Code sharing on Musca-B1 was tested only with SW only crypto, so crypto
				279	hardware acceleration must be turned off: -DCRYPTO_HW_ACCELERATOR=OFF
				280
				281
				282	*************************
Antonio de Angelis	9d496a5	2025-01-07 21:18:00 +0000	[diff] [blame]	283	Usability considerations
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	284	*************************
				285	Functions that only use local variables can be shared easily. However, functions
				286	that rely on global variables are a bit tricky. They can still be shared, but
				287	all global variables must be placed in the shared symbol section, to prevent
				288	overwriting and to enable the retention of their values.
				289
				290	Some global variables might need to be reinitialised to their original values by
				291	runtime firmware, if they have been used by the bootloader, but need to have
				292	their original value when runtime firmware starts to use them. If so, the
				293	reinitialising functionality must be implemented explicitly, because the low
				294	level startup code in the SPE does not initialise the shared variables, which
				295	means they retain their value after MCUboot stops running.
				296
				297	If a bug is discovered in the shared code, it cannot be fixed with a firmware
				298	upgrade, if the bootloader code is immutable. If this is the case, disabling
				299	code sharing might be a solution, as the new runtime firmware could contain the
				300	fixed code instead of relying on the unfixed shared code. However, this would
				301	increase code footprint.
				302
				303	API backward compatibility also can be an issue. If the API has changed in newer
				304	version of the shared code. Then new code cannot rely on the shared version.
				305	The changed code and all the other shared code where it is referenced from must
				306	be ignored and the updated version of the functions must be compiled in the
Nicola Mazzucato	afd24bb	2024-02-14 17:27:27 +0000	[diff] [blame]	307	SPE binary. The Mbed TLS library is API compatible with its current version
Tamas Ban	8a7a551	2020-05-29 16:25:07 +0100	[diff] [blame]	308	(``v2.24.0``) since the ``mbedtls-2.7.0 release`` (2018-02-03).
				309
				310	To minimise the risk of incompatibility, use the same compiler flags to build
				311	both firmware components.
				312
				313	The artifacts of the shared code extraction steps must be preserved so as to
				314	remain available if new SPE firmware (that relies on shared code) is built and
				315	released. Those files are necessary to know the address of shared symbols when
				316	linking the SPE.
				317
				318	************************
				319	How to use code sharing?
				320	************************
				321	Considering the above, code sharing is an optional feature, which is disabled
				322	by default. It can be enabled from the command line with a compile time switch:
				323
				324	- `TFM_CODE_SHARING`: Set to `ON` to enable code sharing.
				325
				326	With the default settings, only the common part of the mbed-crypto library is
				327	shared, between MCUboot and the SPE. However, there might be other device
				328	specific code (e.g. device drivers) that could be shared. The shared
				329	cryptography code consists mainly of the SHA-256 algorithm, the `bignum` library
				330	and some RSA related functions. If image encryption support is enabled in
				331	MCUboot, then AES algorithms can be shared as well.
				332
				333	Sharing code between the SPE and an external project is possible, even if
				334	MCUboot isn't used as the bootloader. For example, a custom bootloader can also
				335	be built in such a way as to create the necessary artifacts to share some of its
				336	code with the SPE. The same artifacts must be created like the case of MCUboot:
				337
				338	- `shared_symbols_name.txt`: Contains the name of the shared symbols. Used by
				339	the script that prevents symbol collision.
				340	- `shared_symbols_address.txt`: Contains the type, address and name of shared
				341	symbols. Used by the linker when linking runtime SPE.
				342	- `shared_code.axf`: GNUARM specific. The stripped version of the firmware
				343	component, only contains the shared code. It is used by the linker when
				344	linking the SPE.
				345
				346	.. Note::
				347
				348	The artifacts of the shared code extraction steps must be preserved to be
				349	able to link them to any future SPE version.
				350
				351	When an external project is sharing code with the SPE, the `SHARED_CODE_PATH`
				352	compile time switch must be set to the path of the artifacts mentioned above.
				353
				354	********************
				355	Further improvements
				356	********************
				357	This design focuses only on sharing the cryptography code. However, other code
				358	could be shared as well. Some possibilities:
				359
				360	- Flash driver
				361	- Serial driver
				362	- Image metadata parsing code
				363	- etc.
				364
				365	--------------
				366
Nicola Mazzucato	afd24bb	2024-02-14 17:27:27 +0000	[diff] [blame]	367	Copyright (c) 2020-2024, Arm Limited. All rights reserved.