This project aims to provide a modular automation toolchain to analyze current state and over-time changes of reproducibility of build artifacts derived from the Android Open Source Project (AOSP). Our framework builds AOSP in its native build system, automatically compares artifacts, and computes difference scores. As reference builds we currently support
- factory images for phones by Google, and
- generic system images as provided by Android CI.
This project enables the broader Android community, as well as any interested third parties, to track differences between AOSP and official Google builds. Furthermore, it can act as basis for future changes improving the reproducibility of Android.
SOAP uses Docker to ensure a consistent environment for the build and analysis of AOSP artifacts and enables support for a wide range of Android versions. Our framework supports two workflows for running the build and analysis stages:
- you can invoke the master shell scripts directly, resulting in the execution of SOAP directly in a terminal, or
- you may setup a Jenkins server which facilitates easier monitoring.
In all cases, make sure you perform the following shared setup:
- Your host environment requires a working Docker installation, refer to the official install instructions for guidance.
- Ensure that you have at least a basic git configuration in your home directory (
~/.gitconfig
), this is required to check out the AOSP source code. - Acquire a copy of this repository.
- Build the docker images by invoking all
build-docker-image.sh
shell scripts found in thedocker
directory in this repository. (However, the working directory needs to be the root of the repository).
You can invoke one of the following master scripts performing the AOSP build and SOAP analysis process (no additional setup steps necessary):
run_device_fixed.sh <AOSP_REF> <BUILD_ID> <DEVICE_CODENAME> <RB_BUILD_TARGET> <GOOGLE_BUILD_TARGET>
: Build an AOSP (Android 7+) device target and compare against the matching Google factory image.run_device-legacy_fixed.sh <AOSP_REF> <BUILD_ID> <DEVICE_CODENAME> <DEVICE_CODENAME_FACTORY_IMAGE> <RB_BUILD_TARGET> <GOOGLE_BUILD_TARGET>
: Build an AOSP (Android 5-6) device target and compare against the matching Google factory image.run-generic_fixed.sh <BUILD_NUMBER> <BUILD_TARGET>
: Build an AOSP generic system image (GSI) and compare against the GSI build from Google.run-generic_latest.sh <BUILD_TARGET>
: Build an AOSP generic system image (GSI) and compare against the GSI build from Google. TheBUILD_NUMBER
parameter is the latest valid build from the Android CI dashboard.
Note: This is is only needed if you intend to run the build workflows in Jenkins instead of invoking the master shell scripts directly.
- Follow the official install instructions for Jenkins.
- Create the parameterized Jenkins pipelines via the GUI, refer to the
.Jenkinsfile
files for the build scripts and be sure to create all parameters required for them. - Adjust the imported pipeline scripts. Specifically, in the environment section perform the following changes:
- Ensure that
RB_AOSP_BASE
points to a location with sufficient space (see Hardware requirements).
- Ensure that
You can then invoke one of the following master scripts performing the AOSP build and SOAP analysis process:
run_device_jenkins_fixed.sh
: Build an AOSP (Android 7+) device target and compare against the matching Google factory image. Parameters are hardcoded in the script, change as needed.run_device-legacy_jenkins_fixed.sh
: Build an AOSP (Android 5-6) device target and compare against the matching Google factory image. Parameters are hardcoded in the script, change as needed.run_generic_jenkins_fixed.sh
: Build an AOSP generic system image (GSI) and compare against the GSI build from Google. Parameters are hardcoded in the script, change as needed.run_generic_jenkins_latest.sh
: Build an AOSP generic system image (GSI) and compare against the GSI build from Google. TheBUILD_NUMBER
parameter is the latest valid build from the Android CI dashboard, all other other parameters are hardcoded.
AOSP_REF
: Branch or tag in the AOSP codebase, refer to source code tags and builds for a list. Specifically refer to the "Tag" column, e.g.android-10.0.0_r40
.BUILD_ID
: Specific version of AOSP, corresponds to a certain tag. Refer to the "Build" column in the same source code tags and builds table, e.g.QQ3A.200705.002
.DEVICE_CODENAME
: Internal code name for the device, see the Fastboot instructions for a list mapping branding names to the codenames. For example, the "Pixel 3 XL" branding name has the codenamecrosshatch
.DEVICE_CODENAME_FACTORY_IMAGE
: Alternative internal code name for device used in factory image list, see the Google factory images for a list. E.g. the "Nexus 10" has theDEVICE_CODENAME
manta
in the build tooling (e.g.lunch
command), but theDEVICE_CODENAME_FACTORY_IMAGE
ismantaray
.RB_BUILD_TARGET
: Our build target as chosen inlunch
(a helper function of AOSP), consisting of a tuple (TARGET_PRODUCT
andTARGET_BUILD_VARIANT
) combined via a dash. The choose a target section provides documentation for these. AOSP offersTARGET_PRODUCT
values for each device with anaosp_
prefix. E.g. a release build of the previously mentioned "Pixel 3 XL" device would be defined asaosp_crosshatch-user
.GOOGLE_BUILD_TARGET
: Very similar toRB_BUILD_TARGET
, except that this represents Google's build target. Note that factory images provided by Google use a non-AOSPTARGET_PRODUCT
variable. E.g. a release build by Google for our running example would be defined ascrosshatch-user
.
BUILD_NUMBER
: Build number used by Google in the Android CI Dashboard.BUILD_TARGET
: Build target consisting of a tuple (TARGET_PRODUCT
andTARGET_BUILD_VARIANT
) combined via a dash. Refer to the Android CI Dashboard for available GSI targets.
Once the build and subsequent analysis finishes, one can find the uncovered differences in ${RB_AOSP_BASE}/diff
in a folder named according to your build parameters. Optionally, one can set the environment variable RB_AOSP_BASE
ahead of the master script invocation to tell SOAP the base location where all (temporary and non-temporary) files should be placed under. If not specified, it defaults to ${HOME}/aosp
. Refer to Hardware requirements for disk space demand.
When navigating the SOAP reports with a browser, start at ${RB_AOSP_BASE}/diff/reports-overview.html
. The links contain the AOSP source tag (or Google build id), build target and build environment for both the reference AOSP build and ours. For each of these we provide the following reports:
- A detailed HTML report showing difference listings for all artifacts that exhibit variations.
- CSV reports summarizing the number of change lines for all artifacts via the tool diffstat. In a post-processing step, these CSV reports are cleaned from expected accountable changes that we deliberately let slip through into the difference reports, which we refer to as diff score.
- The individual CSV reports of each artifact are further aggregated into a single change summary report. Besides accumulated change lines, the individual CSV reports are also used as the basis to calculate a weight score that describes the relative amount of changes with regard to the overall artifact size.
- The final quantitative metrics are also visualized in a hierarchical treemap for improved navigation through detailed difference reports.
Problem:
++ virt-filesystems -a <disk-image>.img
libguestfs: error: /usr/bin/supermin exited with error status 1.
Our scripts use libguestfs
(just like diffoscope internally) to access ext4 file system images without requiring root permissions.
Internally libguestfs boots a small QEMU VM, which requires world read access to the currently active kernel image in /boot/
.
Ubuntu does not permit non-root users to read the local kernel images (see this bug).
A permanent fix (for all kernels installed through future updates) is the creation of following hook under /etc/kernel/postinst.d/statoverride
(based on a solution proposed in the bug report):
#!/bin/sh
version="$1"
# passing the kernel version is required
[ -z "${version}" ] && exit 0
dpkg-statoverride --update --add root root 0644 "/boot/vmlinuz-${version}"
In addition, the currenly installed kernel image can be made world-readable through
sudo chmod +r "/boot/vmlinuz-"*
Note that, in general, it is strongly recommended that you use the permanent fix, otherwise you'll have to fix the kernel permissions after each kernel update.
Our toolchain currently does not support running the snap version of Docker. If you use the snap package, please remove it first and install Docker according to the official install instructions at https://docs.docker.com/engine/install/. You can remove the snap package with
sudo snap remove docker
Problem:
WARNING: Could not write to <tmp-dir>, using /tmp instead
Certain versions of APKtool (we experienced this with version 2.3.4) do not run if their local temp directory does not exist (see this bug for details).
As a solution, create the following directory path for the user running the SOAP scripts:
mkdir -p "${HOME}/.local/share/apktool/framework"
Project by Johannes Kepler University Linz, Institute of Networks and Security. Details can be found at https://android.ins.jku.at/reproducible-builds/. SOAP reports can be found here. This project has originally been started by Manuel Pöll as his bachelor thesis project.
Detailed design decisions, results, and interpretations can be found in our paper and in the initial bachelor thesis:
- Manuel Pöll and Michael Roland: "Automating the Quantitative Analysis of Reproducibility for Build Artifacts derived from the Android Open Source Project", in WiSec '22: 15th ACM Conference on Security and Privacy in Wireless and Mobile Networks, San Antonio, TX, USA, ACM, 2022. DOI 10.1145/3507657.3528537 (accepted for publication)
- Manuel Pöll: "An Investigation into Reproducible Builds for AOSP", Bachelor's thesis, Johannes Kepler University Linz, Institute of Networks and Security, 2020.
This work has been carried out within the scope of Digidow, the Christian Doppler Laboratory for Private Digital Authentication in the Physical World and has partially been supported by ONCE (FFG grant FO999887054 in the program "IKT der Zukunft") and the LIT Secure and Correct Systems Lab. We gratefully acknowledge financial support by the Austrian Federal Ministry for Digital and Economic Affairs (BMDW), the Austrian Federal Ministry for Climate Action, Environment, Energy, Mobility, Innovation and Technology (BMK), the National Foundation for Research, Technology and Development, the Christian Doppler Research Association, 3 Banken IT GmbH, ekey biometric systems GmbH, Kepler Universitätsklinikum GmbH, NXP Semiconductors Austria GmbH & Co KG, Österreichische Staatsdruckerei GmbH, and the State of Upper Austria.
Copyright (C) 2021 Johannes Kepler University Linz, Institute of Networks and Security
Copyright (C) 2020 Manuel Pöll
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at https://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.