← Back

Chiral ruthenium complexes induce apoptosis of tumor cell and interact with bovine serum albumin.

PMID: 22180296
TECHNISCHE UNIVERSITÄT MÜNCHEN TUM School of Life Sciences Studying in vitro-differentiated hepatocytes by proteomics Johannes Krumm Vollständiger Abdruck der von der TUM School of Life Sciences der Technischen Universität München zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften genehmigten Dissertation. Vorsitzender: Prof. Angelika Schnieke, Ph.D Prüfer der Dissertation: 1. Prof. Dr. Bernhard Küster 2. Prof. Dr. Maximilian Reichert Die Dissertation wurde am 01.02.2022 bei der Technischen Universität München eingereicht und durch die TUM School of Life Sciences am 28.06.2022 angenommen. Abstract Mass spectrometry-based proteomcis is a powerful tool for acquiring comprehensive and unbiased expression information on thousands of proteins. However, high sample amounts are required in order to achieve an adequate depth. Hence, in this study the suitability of several proteomics approaches to start from quantity-limited samples was evaluated. The most promising methodologies were further fine-tuned and their usefulness for low-input samples was proven. Optimizing the in-StageTip method yielded around 10,000 peptides from only 2,000 cells, which is 5 times more than with the original protocol. Only the SP3 approach, which is based on the precipitation of proteins to magnetic beads, was more sensitive. In addition, the latter enables the removal of detergents and demonstrated great compatibility with the chemical Tandem Mass Tag labeling approach. In this regard, the identification and quantification quality of several other magnetic microparticles was assessed, including IMAC-beads for phosphopeptide enrichment. In the second part of this work, hepatocytes generated from human pluripotent stem cells were studied. The in vitro differentiated hepatocyte-like cells are increasingly discussed as potential alternatives for primary hepatocytes, which are scarce, show high donor variability, and quickly change their properties upon culturing. Especially in the field of regenerative medicine and drug discovery, the pluripotent stem cell-derived hepatocytes could be of great value. However, the underlying differentiation process is incompletely characterized. Therefore, quantitative proteomics experiments at multiple time points during the differentiation were employed resulting in the detection of around 9,000 proteins, 12,000 phosphorylation sites, and 800 acetylation sites. The expression dynamics revealed a major protein rewiring between the hepatic endoderm and immature hepatocyte-like cells. While metabolic and cell surface proteins were upregulated, levels of cell cycle-related proteins, epigenetic modifiers, and transcription factors diminished. Furthermore, the high temporal resolution enabled to define novel and specific protein markers for each development stage. In a complementary comprehensive (phospho)proteomic experiment, 2D and 3D pluripotent stem cell-derived hepatocytes were compared against fetal and adult liver samples to elaborate on how well they can recapitulate their in vivo counterpart. Indeed, the 3D model was superior to the monolayer approach in expressing several liver-specific proteins. However, both differentiation derivatives lacked the expression of sufficiently high levels of ADME/Tox proteins, which are crucial for metabolising therapeutic drugs in the liver. Moreover, the phosphorylation data indicated divergent kinase activity, suggesting that the stem cell-derived hepatocytes resemble a rather fetal state. Based on the obtained results a molecular roadmap of hepatocyte differentiation was constructed, which enhances the understanding of the underlying biology and serves as a proteomics resource. Additionally, several starting points for differentiation protocol improvements were inferred from the collective data. In summary, this study demonstrated the versatile application of an optimized SP3 beads-based proteomic workflow, specifically for minute protein quantities. Furthermore, the analysis of protein expression and post-translational modifications was utilized to shed light on i|P age developmental processess and to provide recommendations for protocol modifications of pluripotent stem cell-derived hepatocyte differentiation. ii | P a g e Zusammenfassung Die Massenspektrometriebasierte Proteomik ist eine leistungsfähige Methode zur umfassenden und unvoreingenommenen Expressionsmessung von Tausenden Proteinen. Allerdings ist für das Erreichen einer solch hinreichenden Tiefe eine große Probenmenge nötig. Daher wurden in dieser Arbeit mehrere proteomische Ansätze hinsichtlich ihrer Eignung im Bezug auf Proben mit begrenzter Menge untersucht. Die vielversprechendsten Methoden wurden weiter verfeinert und ihre Nützlichkeit für Proben mit geringem Materialeinsatz wurde nachgewiesen. Durch die Optimierung der in-StageTip Methode wurden rund 10.000 Peptide aus nur 2.000 Zellen gemessen, was einer Verfünffachung gegenüber des originalen Protokolls entspricht. Lediglich der SP3 Ansatz, welcher auf der Präzipitation von Proteinen auf magnetische Kügelchen basiert, war noch sensitiver. Zusätzlich ermöglicht Letzterer die Beseitigung von Detergenzien und zeigte gute Kompatibilität mit der Tandem Mass Tag Markierung. In diesem Zusammenhang wurde die Qualität der Identifizierung und Quantifizierung verschiedener anderer magnetischer Mikropartikel getestet, einschließlich IMAC-Kügelchen die für die Anreicherung von Phosphopeptiden verwendet werden. Im zweiten Teil dieser Arbeit wurden aus menschlichen pluripotenten Stammzellen hergestellte Hepatozyten untersucht. Diese in vitro differenzierten Hepatozytenähnlichen Zellen werden vermehrt als mögliche Alternativen für primäre Zellen diskutiert, da diese rar sind, eine hohe Variabilität zwischen Spendern aufweisen und schnell ihre Eigenschaften in Kultur verlieren. Insbesondere im Bereich der regenerativen Medizin und der Arzneimittelforschung könnten diese Hepatozyten aus pluripotenten Stammzellen von großen Wert sein. Allerdings ist deren zu Grunde liegender Differenzierungsprozess unzureichend charakterisiert. Daher wurde ein quantitatives proteomisches Experiment mit mehreren Zeitpunkten während der Differenzierung durchgeführt was zu 9.000 Proteinen, 12.000 Phosphorylierungsstellen und 800 Acetylierungsstellen führte. Die Dynamik der Expression zeigte starke Veränderungen zwischen dem hepatischen Endoderm und den unreifen Hepatozytenähnlichen Zellen. Während Stoffwechsel und Zelloberflächenproteine anstiegen, nahmen die Expressionslevel von Zellzyklusproteinen, epigentischen Modifikatoren und Transkriptionsfaktoren ab. Darüber hinaus konnten durch die hohe zeitliche Auflösung neue und spezifischen Proteinmarker für jedes Entwicklungsstadium definiert werden. In einem ergänzenden umfangreichen (Phospho)proteom Experiment wurden aus pluripotenten Stammzellen gewonnene 2D- und 3D-Hepatozyten mit fötalen und adulten Leberproben verglichen, um zu untersuchen, wie gut diese ihre in vivo Gegenstücke darstellen können. Tatsächlich war das 3D-Model dem Monolayer-Ansatz bei der Expression mehrerer leberspezifischer Proteine überlegen. Jedoch zeigten beide Differenzierungsderivate keine ausreichend hohe Expression von ADME/Tox Proteinen, welche essentiell für die Verstoffwechselung von therapeutischen Medikamenten in der Leber sind. Zudem deuteten die Phosphorylierungsdaten auf abweichende Kinaseaktivität hin und suggerierten, dass die aus Stammzellen hergestellten Hepatozyten eher ein fötales Stadium darstellen. Auf der Grundlage dieser Ergebnisse wurde eine molekulare Roadmap für Hepatozytendifferenzierung erstellt, welche sowohl das Verständnis der zugrunde liegenden Biologie verbessert, als auch als iii | P a g e proteomische Resource dient. Außerdem wurden aus den gesammelten Daten einige Ansatzpunkte für Verbesserungen des Differenzierungsprotokolls abgeleitet. Zusammenfassend lässt sich sagen, dass diese Arbeit die vielfältige Anwendung eines optimierten, auf SP3 Kügelchen-basierenden Arbeitsablauf für die Proteomik demonstriert, insbesondere für kleinste Proteinmengen. Darüber hinaus wurde die Analyse von Proteinexpression und posttranslationaler Modifikationen genutzt, um Aufschluss über Entwicklungsprozesse zu geben und Empfehlungen für Protokolländerungen bei der Differenzierung von Hepatozyten aus pluripotenten Stammzellen zu geben. iv | P a g e Table of contents Abstract ....................................................................................................................................... i Zusammenfassung ..................................................................................................................... iii Table of contents ........................................................................................................................ v Chapter I: General introduction ................................................................................................... 1 Chapter II: Optimizing sample preparation workflows for proteomics ....................................... 27 Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation .......... 51 Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples................................... 81 Chapter V: General discussion and outlook ............................................................................... 99 References .............................................................................................................................. 107 List of Abbreviations................................................................................................................ 131 List of Figures .......................................................................................................................... 133 Appendix...................................................................................................................................... I Danksagungen........................................................................................................................... VII List of publications ..................................................................................................................... IX v|P ag e Chapter I: General introduction 1 Stem cells and their value for research ................................................................................. 3 1.1 Stem cell properties and classification ........................................................................... 3 1.2 Directed stem cell differentiation................................................................................... 5 2 Stem cell differentiation towards hepatocytes ...................................................................... 8 2.1 Liver: Physiology and embryonic development .............................................................. 8 2.2 Hepatocyte differentiation and applications ................................................................ 10 3 Proteomics for studying developmental processes ............................................................. 15 3.1 Bottom-up sample preparation .................................................................................... 15 3.2 Basics of bottom-up proteomics .................................................................................. 18 3.3 Mass spectrometry data analysis ................................................................................. 21 4 Objectives and Outline ....................................................................................................... 24 1|P a ge Chapter I: General Introduction 2|P a ge Chapter I: General Introduction 1 Stem cells and their value for research 1.1 Stem cell properties and classification Stem cells are rapidly dividing cells which are capable of self-renewal over a long period of time and have the potential to differentiate into various different cell types. Depending on their differentiation potency, they can be classified into four groups (I-Figure 1; as reviewed in [1]). Totipotent stem cells Totipotent stem cells comprise the highest differentiation potential of all stem cells. They are capable of developing into any cell type and are therefore the only cells that can form an entire organism (reviewed in [2]). In the human body a totipotent cell, the zygote, is formed through the fusion of a spermatozoon with an oocyte. During further development, the zygote undergoes cell division which results in a decrease of potency. After approximately 4-5 days, the zygote specifies into the inner cell mass (ICM) and the extra-embryonic cell lineage. While the extra-embryonic cell lineage matures further to the placenta, the ICM gives rise to the embryonic lineage and possesses only pluripotent characteristics. The exact time and cell stage at which totipotency is lost is still controversial and challenging to study as the number of totipotent cells and their accessibility is limited (reviewed in [3]). Pluripotent stem cells The second most potent cells are pluripotent stem cells (PSCs). They are capable of differentiating into each of the three germ layers of the embryonic lineage, but are not able to form extraembryonic structures (reviewed in [4]). Human embryonic stem cells (hESCs) are one example of pluripotent cells. They can be isolated from the ICM and cultured in vitro as first shown by Thompson et al. [5] as well as Reubinoff et al. [6]. However, the hESCs research is challenging as cells are scarce and their use is strictly regulated. Some of these challenges were overcome by the discovery of induced pluripotent stem cells (iPSCs) in 2006, which was a scientific breakthrough that was rewared with the Nobel Price in 2012. Yamanaka and colleagues were able to regain pluripotency from terminally differentiated somatic cells by the transduction of four transcription factors: Oct4, Sox2, Klf4, and c-Myc [7, 8]. While in the beginning iPSCs were mostly generated from fibroblasts, nowadays a variety of sources exist, such as blood cells or keratinocytes [9-11]. Another source are cells from the urinary system which are excreted in the urine. This is of special interest, because it is a noninvasive method with almost unlimited capacities [12]. Urine-derived iPSCs showed the potential to differentiate into cardiomyocytes [13] and neural progenitors [14] indicating a promising and valuable resource for iPSCs. In the original reprogramming protocol of Yamanaka and colleagues retroviral transduction was utilized to deliver the four crucial transcriptions factors [7]. As this method is prone for introducing genetic alterations, several alternatives were developed (reviewed in [15, 16]). For example, transduction with an adenovirus which enables iPSC generation of fibroblasts without transgenic integration [17]. One of the most commonly used systems for generating iPSCs at the moment is the Sendai virus vector [18], which is non-integrating and capable of infecting a broad range of host cells with a high efficiency [19]. Another alternative is the non-viral piggyback transposon system which is able to integrate the 3|P a ge Chapter I: General Introduction reprogramming factors into somatic cells [20] and appears to be a promising method, but has shown lower efficiency than virus-based methods so far [21]. In general, non-integrating and nonviral vectors with high reprogramming efficiency are desirable in order to make this method applicabale for medical administration. As iPSCs can be harvested and derived from patients, not only are the ethical concerns lower, but also the rejection of the immune system is decreased. Despite the great potential to renew or replace damaged patient tissues, no iPSC-based therapy has yet made its way into routine clinical applications, although several are in clinical trials [22]. For their routine application, the remaining challenges such as carcinogeneity [23] and genomic instability [24] have to be overcome. Multipotent stem cells Multipotent stem cells have less potency compared to PSCs, but still possess important stem cell characteristics such as high self-renewal capacity and the ability to specialize into several different cells. One example are mesenchymal stem cells, which are adult stem cells that can be obtained from various different origins, like the bone marrow or the umbilical cord [25, 26]. Mesenchymal stem cells can, for example, differentiate into cells from the mesodermal lineage such, adipocytes [27], osteocytes [28], or cartilage [29]. Furthermore, some studies postulated the differentiation of mesenchymal stem cells into the endodermal [30, 31] and ectodermal [32] lineage. However, this differentiation outside of the mesenchymal lineage is widely debated [33, 34]. Nevertheless, mesenchymal stem cells are considered as a promising source for regenerative medicine for example for cartilage repair or bone regeneration (as reviewed in [35]). Unipotent stem cells Apart from multipotent stem cells, unipotent stem cells comprise the second class of adult stem cells. They are characterized by the least differentiation potency and are only able to generate one specific cell type. For example, male germline stem cells are unipotent precursors which exclusively differentiate into sperm cells. However, studies have shown that these unipotent stem cells can be dedifferentiated upon culturing with growth factors to gain pluripotency [36, 37]. Compared to iPSCs and ESCs, pluripotent cells obtained from germline stem cells suffer from less ethical concerns, a decreased risk of tumorigenesis, and diminished immune rejection (reviewed in [38]). 4|P a ge Chapter I: General Introduction I-Figure 1: Schematic of stem cell classification and their differentiation potency. Modified with permission from Berdasco et al. [39] and created with BioRender.com. 1.2 Directed stem cell differentiation The potency of stem cells to specify into different cell lineages can be used and controlled in a process called directed differentiation. This process enables to mimic and study some of the fundamental developing steps of embryogenesis in vitro. Knowing the fundamental developmental mechanism, allows to spot errors in this process and explain the resulting malfunctions or diseases. Besides studying the underlying basic biology, directed cell differentiation offers a promising approach for several clinical applications like regenerative medicine or personalized medicine as well as for drug discovery. Directed differentiation of pluripotent stem cells in vitro In order to direct cell fate, the knowledge of essential developmental pathways, key transcription factors, or other crucial proteins is required. Based on this expertise, recombinant proteins and small molecules, which are both powerful tools, can be utilized to manipulate cell differentiation. Although the application of recombinant proteins has been successful in recent years, it is associated with high costs and batch-to-batch variations. Since chemical molecules are usually more stable, cheaper, and non-immunogenic, they might be the go-to tool in the future. In order to accomplish directed differentiation from PSCs, the first step comprises germ layer induction using growth factors [40]. Once the desired germ layer is formed, additional supplementation of specific proteins and small molecules enable further cell lineage specification. This process usually needs extensive optimization, as the right concentration, duration, and timing 5|P a ge Chapter I: General Introduction of the supplemented reagents is essential. Many protocols are developed for just one specific pluripotent cell line and the adaption to a different cell line can be tedious. Moreover, further modifications are usually required when the differentiation protocol was developed in a different organism. Although in vitro differentiated cells can resemble many functions of their in vivo counterparts, they very often represent rather immature characteristics. One reason for this is the varying differentiation capacity observed between cell lines as well as between clones from the same cell line, which leads to a heterogenous cell population rather than a very specific cell type [41]. Another reason is that the cell development in vitro usually only takes a fraction of the time compared to the developmental process in a living organism. Organoid models and their applications Traditionally, in vitro differentiation was performed in a petri dish and grown in two-dimensional (2D) monolayer. However, this experimental setup does not entirely recapitulate the in vivo situation, as for example cell-cell interactions are vastly diminished and thus, only a limited view of the developmental process is represented. To overcome some of these challenges, threedimensional (3D) organoid models have been developed, which are self-organized structures composed of one or more cell types. They resemble the in vivo conditions more closely and have been superior to 2D models in terms of cell-cell communication, viability, differentiation, cell polarization, and drug metabolism, just to name a few (as reviewed in [42, 43]). Stem cells have been used to generate functional organoids from all three germ layers [44]. For example, the development of a brain organoid system, called cerebral organoids, enabled to elucidate on human brain development, which has been difficult to assess before. By using organoids derived from PSCs, not only discrete brain regions were differentiated but also features of microcephaly were mimicked and a potential cause of this disease was derived [45]. Antonica and colleagues were able to generate functional thyroid follicles from ESCs in vitro, which had the capacity to organify iodide [46]. Upon transplantation into mice, thyroid hormone deficiency was overcome showing the potential impact of this powerful tool for regenerative medicine in the future [46]. Personaliced medicine is another research field with great capacity for the organoid technology. For example, it has been used for drugs targeting cystic fibrosis [47]. Here, iPSCs were generated from CF patients and further differentiated into cholangiocytes which matured through organoid formation. These organoids not only mimicked the CF disease, but further reproduced drug effects in vitro. In a second study, rectal organoids from biopsies of CF patients were derived [48]. The drug responses of this in vitro model positively correlated with data from clinical trials demonstrating the ability of this technology for predicting drug effects of individual patients. While personaliced medicine is applied to individual patients, drug discovery usually makes use of high-throughput screening of biologically active molecules. In pharma industry these screenings are often based on primary cells or animal models, but the organoid culture system might support or even substitute some of the current pipelines. To this regard, StemoniX® offers a platform of commercially available brain and heart organoids derived from iPSCs and developed for highthroughput applications. Further, a platform for colon organoids in a 384-well as well as 1536-well formats have been introduced for drug screening [49]. However, high-throughput applications have been challenging so far, because organoids are formed by self-organization which is a stochastic manner that is difficult to control. This decreases the reproducibility leading to batch6|P a ge Chapter I: General Introduction to-batch variations and has to be overcome in order to make large-scale screening on organoid models reasonable. Since the 3D culture system is still in its infancy, further optimization is required to exploit its full potential and to make it widely applicable. Transdifferentiation of somatic cells Besides the dedifferentiation of somatic cells to iPSC and their subsequent directed differentiation, transdifferentiation of somatic cells offers an alternative for generating specific cell types (reviewed in [50]). Here, somatic cells are directly reprogrammed into a different cell type using small molecules, transcription factors, or a combination of both. This was first shown in 1987, when Davis and colleagues converted fibroblast into muscle cells by the transfection of MyoD [51]. Then, in 2008 the first in vivo application was published where the combination of Ngn3, Pdx1, and Mafa resulted in the reprogramming of pancreatic exocrine cells into functional β-cells [52]. The directed transdifferentiation with small molecules offers a fast method without the requirement of viral transduction to generate specific cells types, while the risk of tumorigenesis is low. However, this method is limited by low differentiation efficiencies as well as the small number of cases where transdifferentiation was successful so far. Noteworthy, besides the artificially induced transdifferentiation, reprogramming and lineage transition has been observed naturally in zebrafish, where atrial cardiomyocytes converted into functional ventricular cardiomyocytes [53]. Post-transclational modifications and their function in regulating pluripotency and differentiation As shown by Yamanaka and colleagues the introduction of transcriptions factors enables the reprogramming of somatic cells into iPSCs, which possess the capacity to specify into several different cell types [7, 8]. This shows how crucial gene expression for the pluripotent state is. However, studies have shown varying repogramming and differentiation efficiencies among PSCs, which indicates that transcript levels are not solely decisive for cell fate decisions [41]. This seems obvious as many signaling pathways related to cell differentiation are regulated by posttranslational modifications (PTMs) (reviewed in [54]). For example, several studies have elucidated on the connection between pluripotency and PTMs, such as ubiquitination [55], sumoylation [56], glycosylation [57], and methylation [58]. Various different types of PTMs exist, however the majority rely on enzymatic modifications of amino acid side chains or the protein’s termini. By manipulating the biophysical properties of proteins, PTMs are capable of influencing gene transcription, enzymatic activity, protein translocation, protein degradation, or proteinprotein interactions, just to name a few [59]. Phosphorylation is not only one of the most commonly observed PTMs, but it is also involved in controlling various signaling pathways [60]. Phosphorylation is enzymatically controlled by kinases and phosphatases, which add or remove phosphate groups from hydroxyl groups of serine, threonine, and tyrosine amino acid residues. Transfer of this phosphate group changes the net charge of amino acids, which often results in a conformational change and altered protein characteristics. Many signaling pathways known for regulating cell differentiation and pluripotency are controlled through phosphorylation, e.g. via receptor tyrosine kinases such as FGFR. A large scale study performed by Swaney and colleagues detected more than 10,000 7|P a ge Chapter I: General Introduction phosphorylation sites (P-sites) in ESC and among them several on the two critical transcription factors for pluripotency SOX2 and OCT4 [61]. In a different study more than 50% of the detected P-sites were observed to be modified already after 1 hour of differentiation showing how rapid and vast this modification is [62]. To this regard, Yang and colleagues performed a multi-omics survey of ESCs transitioning from the naïve state to the primed state and found that changes on the phosphoproteome level precede changes on epigenome, transcriptome, and proteome [63]. This shows that phosphorylation possessess a crucial role in cell development, but further research is necessary in order to grasp the full picture. The acetylation of the lysine side chain ε-amino group is another reversible PTM with significant effects on protein characteristics. While lysine/histone acetyltransferases catalyze the transfer of acetyl groups from acetyl-CoA to lysine residues, lysine/histone deacetylases (KDAC/HDAC) are capable of reverting this process [64]. Although acetylation was first discovered only on histone proteins [65], acetylation on many non-histone proteins has been reported (as reviewed in [64]). The addition of an acetyl group removes the positive charge of the lysine residue which can have a significant impact on protein characteristics. For example, elevated histone acetylation is associated with increased chromatin accessibility and therefore higher gene transcription rates [66]. On the other hand, low levels of histone acetylation result in decreased chromatin accessibility and transcriptional hypoactivity [54]. Considering that in general pluripotency is associated with a high rate of transcriptional activity while differentiation is accompanied with decreased activity [67], this PTM is highly associated with cell development. To this regard, several studies have shown positive effects on the reprogramming efficiency of somatic cells using HDAC inhibitors which target histone proteins [21, 68, 69]. On the contrary, HDAC inhibitors have also been shown to support endoderm differentiation [70], improve the bone formation from mesenchymal stem cell [71], induce neural cell differentiation [72, 73], and support the expansion of hematopoietic stem cells [74]. This process gets even more complex when considering the timing component of this PTM. For example, high levels of acetylation support the acinus fate of pancreatic progenitors, whereas low levels promote ductal specification [75, 76]. Although the understanding so far is rather rudimentary, it could already be shown how closely pluripotency and acetylation levels are interwoven and we are just beginning to understand their connection. In summary, although the addition and modulation of transcription factors have shown to induce pluripotency and regulate cell differentiation, several studies have shown a considerable impact of PTMs in these processes. However, their influence is incompletely understood and a more detailed perception may be helpful for appreciating the differentiation better and improve future applications. 2 Stem cell differentiation towards hepatocytes 2.1 Liver: Physiology and embryonic development The liver is located in the right upper part of the abdomen and is the largest internal organ of the human body with approximately 1.6 kg. It consists of two unequally sized lobes and it is tightly connected to the blood system via the hepatic artery as well as the portal vein. 8|P a ge Chapter I: General Introduction Biological functions of the liver Around 70-80% of the liver mass can be accounted to hepatocytes, which is the main parenchymal cell type fulfilling most of the vital organ functions. One key feature of the liver is the regulation of the blood glucose level (as reviewed in [77]). In order to do this, hepatocytes surround the portal vein, which passes through the liver, and absorb nutrients coming from the gastrointestinal tract. Under carbohydrate-rich conditions, glucose is converted into glyocogen or fatty acids and stored. As the glucose level in the blood decreases, glycogen is enzymatically broken down to maintain the blood glucose level constant and to nourish other organs, such as the brain [77]. Once the glycogen storage is exhausted, the liver is able to convert precursors, such as lactate, glycerol, alanine, or glutamine, into glucose in a process called gluconeogenesis [78]. Apart from the carbohydrate metabolism, the majority of plasma proteins, such as serum albumin, are synthesized here [79]. The liver is also an important organ for lipid metabolism, as it produces cholesterol and degrades fatty acids. Excessive fatty acids which are not internally used for energy supply can be converted into ketone bodies by liver cells and distributed to other tissues for energy supply. The liver also produces bile and secretes it via the biliary tract into the gallbladder for storage. Another important feature of hepatocytes is the detoxification of molecules such as ammonia, a toxic product of amino acid catabolism, which can be converted into urea through the urea cycle and subsequently excreted. Furthermore, hepatocytes are capable of absorbing and metabolizing bioactive compounds from the blood. This is of special interest for drug discovery, as this process often leads to toxic intermediates which is one of the main causes for drugs failing in preclinical and clinical trials [80, 81]. Additionally, the absorption significantly affects a drugs half life and thus its efficacy. Apart from hepatocytes, the liver is composed of non-parenchymal cells including kupffer, stellate, and sinusoidal endothelial cells. The kupffer cells are macrophagic cells, which are capable of taking up particles or pathogens from their environment by phagocytosis. They can stimulate cytokine production, which makes them a crucial part of the innate immune system, but also a key player in disease states like inflammation or steatosis [82]. The hepatic stellate cells reside in a quiescent state and their functions are not fully understood. However, it is known that under chronical stress conditions, such as viral infections or alcohol abuse, stellate cells are activated and differentiate into myofibroblast, which are the main driver for liver fibrosis [83, 84]. A key regulator of stellate cell activation are the third type of non-parenchymal cells, the sinusoidal endothelial cells. An in vitro study showed that they are capable of keeping stellate cells in a quiescent state making them an important gatekeeper for liver fibrosis [85]. Moreover, sinusoidal endothelial cells comprise immunological functions, such as the elimination of pathogens, similar to kupffer cells [86]. Furthermore, the non-parenchymal liver cells maintain and regulate liver growth and participate in the control of hepatocyte proliferation [87]. Embryonic development of hepatocytes Embryonic development is initialized by the fertilization of a male and a female gamete resulting in a zygote. Through several rounds of mitotic cell division, the single fertilized egg forms a group of cells, the blastomere, during the first days. After 5-7 days, a blastocyst is formed which consists of an outer layer, the trophoectoderm which will later develop into the placenta, and the ICM, which will give rise to the embryo. This blastocyst implants into the uterus and starts the 9|P a ge Chapter I: General Introduction gastrulation. During this process, cells migrate through the process of epithelial-mesenchymal transition (EMT) from the epiblast into the primitive streak and form the ectoderm as well as the mesendoderm, which is the precursor of the definitive endoderm (DE) and the mesoderm (I-Figure 2). While organs like skin and brain are formed from the ectoderm, the mesoderm developes, for instance, into muscle cells and kidneys [88]. On the other hand, the endoderm forms a primitive gut consisting of anterior foregut, posterior foregut, midgut, and hindgut regions. From the posterior foregut hepatic progenitors, the hepatoblasts, are derived. These hepatoblasts form the liver bud, which is at this developmental stage the main producer of fetal blood cells [89]. Later the hepatoblasts can differentiate to biliary endothelial cells, which are progenitors for the bile ducts that are used to transport bile salts generated from the liver into the gallbladder. The majority of hepatoblasts, however, develop into immature hepatocytes which undergo a maturation process that even continues after birth. While hepatocytes are obtained from the endodermal lineage, the non-parenchymal cells are of mesodermal origin (reviewed in [89]). I-Figure 2: Hepatocyte cell lineage specification. Through gastrulation the fertilized zygote forms the three germ layers ectoderm, endoderm, and mesoderm. Then the foregut formation is enhanced in order to differentiate hepatoblasts, which are progenitors with the capacity to develop hepatocytes and biliary epithelium. The stepwise hepatic development is highlighted in red (adapted from [89]). 2.2 Hepatocyte differentiation and applications The liver possesses the capacity to fully re-grow even after 70% was removed, which is unique among organs in the human body [90]. Despite this regenerative potential, organ donation is the only cure for severe liver failure. As organ donors are scarce, the artificial production of liver cells is recognized as an alternative source. Hence, several different methods for hepatocyte generation have been developed over recent years. This chapter will highlight some of these 10 | P a g e Chapter I: General Introduction methods with a special emphasis on stem cell approaches and will draft a picture of the versatile applications of differentiated hepatocytes not only in medicine but also for basic research as well as drug discovery. Hepatocytes derived from pluripotent stem cells PSCs are a widely used source for generating hepatocyte-like cells in vitro. In the beginning, embryonic bodies, which are self-aggregated, heterogeneous, 3D structures of ESCs, were employed to generate hepatocyte-like cells [91]. However, this technique suffers from low differentiation efficiency and reproducibility. Gaining more knowledge about crucial differentiation conditions, including signaling pathways, transcription factors, and the extracellular matrix, paved the way to a directed differentiation of PSCs towards hepatocytes. Several protocols have been developed in recent years and are commonly divided into three steps: DE differentiation, hepatoblast specification, and hepatocyte maturation (reviewed in [92]). As DE is the common origin of several endodermal tissues, such as pancreas, thyroid, and liver, the faithful formation of DE is crucial for further differentiation. In vivo TGFβ signaling is activated during gastrulation in order to suppress ectodermal specification and induce the mesendodermal lineage formation. Mouse experiments showed that the mesendodermal lineage formation is highly regulated by the TGFβ superfamily member Nodal together with downstream Smad2 signaling [93]. While high levels of Nodal lead to DE generation, low levels support the specification towards the mesodermal lineage [94]. These Nodal signals can be mimicked in vitro with activin A, which binds to the same receptor protein and activates TGFβ signaling leading to an increased expression of DE markers like SOX17 and FOXA2 [95]. Other groups have shown that DE differentiation can also be supported by the canonical Wnt pathway [96, 97]. Therefore, several protocols use a combination of activin A and WNT3a to trigger both pathways [98-100]. However, the duration of supplementation has to be controlled as prolonged Wnt signaling during primitive gut tube formation favours hindgut over foregut specification [101]. Hence, during this time Wnt inhibitors are expressed to maintain foregut identity [102]. Adding to the elaborated timely interplay, Toivonen and colleagues demonstrated that extended WNT stimulation for more than 1 day resulted in decreased competence of stem cells to form pancreas progenitors, while hepatocyte-like cells could still be obtained after 5-7 days [103]. In a different approach recombinantly expressed proteins are substituted by small molecules, which are more stable and chemically defined. Siller and colleagues used the GSK-3 inhibitor CHIR99021 to activate the canonical Wnt pathway and promote DE induction, which led to comparable differentiation efficiencies [104]. During the second step, hepatoblasts, the precursors of hepatocytes, are specified from the DE. In vivo this process is promoted by the secretion of cytokines from the mesodermal tissues in close proximity to the endoderm. Jung and colleagues showed that expression of fibroblast growth factors (FGF) 1, 2, and 8, in the surrounding mesoderm triggered liver development from foregut endoderm in mice [105]. In addition, bone morphogenetic protein (BMP) was shown to be essential for hepatic specification from the endoderm [106]. Multiple different protocols have been developed to mimic this developmental process in vitro. Although they vary in the exact administration of the recombinant protein and its concentration as well as the duration, they commonly activate the FGF and BMP signaling pathways [99, 107, 108]. Other protocols 11 | P a g e Chapter I: General Introduction circumvent the application of recombinant proteins by using small molecules, such as sodium butyrate or DMSO to enhance hepatoblast formation [100, 104, 109]. Once hepatoblasts are derived from stem cells, they are solely capable of differentiating into hepatocytes or cholangiocytes. Cholangiocyte commitment can be supported with the supplementation of activin A, FGF-10, and retionoic acid, which results in the expression of early biliary markers, such as KRT19 and SOX9 [110]. On the contrary, hepatocyte maturation is promoted by the combination of hepatocyte growth factor (HGF), oncostatin M (OSM), fibroblast growth factor (FGF), and dexamethasone. OSM is an interleukin-6 family member which is highly expressed in hematopoietic cells in the fetal liver [111]. While its supplementation promotes hepatocyte maturation of hepatic progenitors in vitro, the differentiation towards hematopoietic lineage is suppressed [112-114]. Like OSM, HGF attenuates cholangiocyte differentiation and promotes upregulation of hepatic proteins, such as albumin [112]. As an alternative, the HGF mimetic N-hexanoic-tyrosine-isoleucine-(6) aminohexanoic amide can be used, which showed high affinity towards HGF and a comparable induction of c-Met phosphorylation [104, 115]. Lastly, dexamethasone is a glucocorticoid with antiphlogistic characteristics. However in the context of liver development, it enhanced the expression of HNF4 and C/EBPα, which are key transcription factors triggering hepatocyte maturation in vitro [116]. As reviewed by Palakkan and colleagues various protocols for hepatocyte maturation exist [92], which all use different combinations and conditions of the afore mentioned recombinant proteins or small molecules. Besides the utilization of recombinant proteins and small molecules, cell-cell and cell-matrix interactions are essential drivers for cell fate regulation and proliferation [117]. Especially for hepatocytes as they are epithelial cells with pronounced adhesion properties and strong interactions with the surrounding extracellular matrix (ECM). Proteins of the ECM interact with cellular receptors, such as integrins, which enables them to manipulate the cell via downstream signal transduction [118]. There are several artificial ECM alternatives that mimic in vivo properties and allow for more physiological differentiation [119]. Among them, matrigel and laminin have been shown to be useful [120, 121]. Applications of plutipotent stem cell-derived hepatocytes Hepatocytes derived from directed differentiation are capable of mimicking various functions of their in vivo counterparts. To this end, Takebe and colleagues were the first to demonstrate that in vitro generated liver organoids have the capacity to fulfill organ functions in vivo [122]. The liver organoids were formed by self-aggregation of iPSC-derived hepatic endoderm (HE), endothelial cells, and mesenchymal cells, and further matured upon transplantation into mice. Moreover, the liver organoids were vascularized and exhibited various hepatic characteristics which allowed to extend the life of liver damaged mice [122]. Additionally, Takebe and colleagues showed that liver organoids can entirely be generated from PSCs in a clinical relevant scale which highlights their great potential for regenerative medicine [123]. In addition to clinical applications, stem cell-derived hepatocytes are also an attractive model system in the field of drug development. Firstly, they are considered as a potential alternative for assessing the toxicity of bioactive drugs [124, 125]. So far, primary human hepatocytes (PHH) are the ‘gold standard’ in such assays, but they come with several limitations, such as they are scarce, show low proliferation rates in vitro, and quickly lose hepatic features in cell culture. To overcome 12 | P a g e Chapter I: General Introduction some of these limitations, 3D culture system generated from human ESCs [126] as well as human iPSCs [127] have been developed and applied to test toxicity of xenobiotics. The second promising application is drug screening (as reviewed in [128]). Jing and colleagues used the CRISPR/Cas9 technology to generate iPSCs that mimic the mitochondrial dysfunction of mtDNA depletion syndrome 3 [129]. The mutated iPSCs were plated in 96-well plates and differentiated into hepatocytes that were further used to assess the effects of 2,400 drugs on the metabolism. Based on this large-scale screen, 15 interesting hits were found of which one was confirmed to increase mitochondrial activity in vivo. In a similar study, Choi and colleagues employed patient-derived iPSCs with an alpha-1 antitrypsin deficiency [130]. They differentiated these cells into hepatocytes in a 96-well format and measured the effects of more than 3,100 drugs using an immunofluorescent readout. Alternative strategies for hepatocyte differentiation Oval cells are hepatic progenitors which supposedly origin from the canal of Hering, which is the connection between bile canaliculus and the bile ducts in the liver [131]. They are assumed to be not terminally differentiated hepatoblasts, which possess the capability to mature into hepatocytes or bile endothelial cells. Although they have some differentiation potency, they are most likely only progeny of stem cells as they are not able to self-renew [131, 132]. Since they are scarce and their isolation is challenging, the application for research is limited. Awan and colleagues showed that oval cells can be derived from adult bone marrow stem cells, which were able to take over some of the functions of co-cultured injured hepatocytes [133]. Furthermore, upon transplantation the oval-like cells were capable of rescuing some functions of fibrotic livers in mice, showing their potential for future applications [133]. Multipotent stem cells are an additional type of cell with the capacity to differentiate into hepatocytes. They can, for example, be non-invasively obtained from umbilical cord blood and showed the potential to differentiate into hepatocyte-like cells [134]. Lin and colleagues proved that cells from the umbilical cord can be transplanted into rats with liver damage and improve liver recovery [135]. They further demonstrated the production of crucial hepatic proteins from these cells, such as albumin and hepatocyte growth factor. A third alternative technique for hepatocyte generation is transdifferentiation. During this process terminally differentiated somatic cells are converted into a different cell type without going through an intermediate pluripotent state. For example, Shen and colleagues showed that the treatment of pancreatic cells with dexamethasone led to the conversion of exocrine cells into hepatocytes [136]. The transcription factor C/EBPβ proved to be a key player in this transformation and is suspected to regulate the differentiation of the two endoderm-derived cell types. Hepatocytes can also be obtained from the non-parenchymal stellate liver cells. As mentioned in the previous subchapter (2.1 Liver: Physiology and embryonic development), the exact function of stellate cells is inconclusive, but during liver fibrosis they get activated and transform into myofibroblasts. Song and colleagues illustrated that through the overexpression of the transcription factors FOXA3, GATA4, HNF1A, and HNF4A, myofibroblasts could be differentiated into hepatocyte-like cells in vitro [137]. They further evaluated their findings in an in vivo mouse model by converting the fibrogenesis-inducing myofibroblasts into hepatocyte-like cells resulting in alleviated liver fibrosis. In 2011, two groups showed independently that mouse 13 | P a g e Chapter I: General Introduction fibroblasts can be directly reprogrammed into hepatocyte-like cells, which were able to restore liver functions of diseased mice. While Huang and colleagues overexpressed Gata4, Hnf1α, and Foxa3 [138], Sekiya and Suzuki used combinations of Hnf4α, Foxa1, Foxa2, and Foxa3 [139]. However, transdifferentiation was incomplete as immature markers, like Afp and CK19, were upregulated and some cytochrome P450 proteins were not expressed [138]. Furthermore, this resulted in only a partial rescue making additional optimizations inevitable. In summary, different strategies to obtain hepatocytes in vitro exist and several studies demonstrated exciting results for applications in the clinics as well as for drug discovery. If limitations, such as immature hepatic characteristics, heterogeneous cell populations, or varying reproducibility between batches are improved, in vitro generated hepatocytes can be a promising alternative to primary hepatocytes. 14 | P a g e Chapter I: General Introduction 3 Proteomics for studying developmental processes When studying developmental processes, very often transcriptomics or low-throughput proteinbased approaches are employed. For transcriptomics analysis, mRNA is extracted from cells and further amplified resulting in great sensitivity and sequencing depth. However, since mRNA is just the precursor of proteins and the correlation of mRNA to protein levels can diverge [140], such studies do not provide the full picture of biological processes in the cell. Hence, studying proteins, the executing molecules of the cell, can provide more insight by adding another level of information. For this type of analysis, very often methods like fluorescent activated cell sorting (FACS), immunostainings, or western plots are utilized. However, these approaches heavily rely on antibodies, which constrain the analysis in a way that it is low-troughput, requires specific antibodies, and prior hypotheses to select the proteins of interest. An alternative, which overcomes some of the limitations, is mass spectrometry-based proteomics. This method allows the indentification and quantification of global protein levels in a hypothesis-free way with great depth and sensitivity. Experiments of this kind can either be performed as ‘bottom-up’ or ‘topdown’ approaches. The latter injects intact proteins into the mass spectrometer, which leads to a high sequence coverage and a potential full protein characterization [141]. In contrast, for the ´bottom-up´ technology, proteins are chemically or enzymatically digested and mass-to-charge (m/z) ratios of the resulting peptides are acquired. While this methodology is more sensitive and provides a higher proteome coverage compared to ‘top-down’ measurements, the protein identification possessess some challenges. As the identification is inferred from peptides, and many peptides or peptide sequences are shared among proteins, the assignment of a peptide to one distinct protein can be ambiguous. Nevertheless, the advantages of ‘bottom-up’ proteomics prevail and technological developments during the last years yielded more than 17,000 identified proteins in the first draft of the human proteome [142, 143], which shows how powerful this method is and what impact it might have for future scientific research questions. Besides analysing the proteome, the high accuracy and sensitivity of mass spectrometers allows the measurement of PTMs of which many are connected to cell differentiation and development [144]. The following paragraphs provide details on classical ‘bottom-up’ workflows that have been applied throughout this work. 3.1 Bottom-up sample preparation Before the sample can be introduced into the mass spectrometer, it undergoes several sample preparation steps, such as protein extraction, protein digestion, and the removal of interfering substances (I-Figure 3). Depending on the sample type and aim of the experiment, different preparation steps are performed and some of them are described in more detail with a special focus on the methods used in this work. Protein extraction and digestion In general, proteomic sample preparation starts with protein extraction, which can vary depending on the type of sample (e.g. tissues, body fluids, cell lines) and the downstream analysis that is performed. Besides methods that use mechanical force to break cells (e.g. bead mills), non- 15 | P a g e Chapter I: General Introduction mechanical methods can be applied (reviewed in [145]). An example for a cheap and straightforward approach, is chemical lysis. Detergents (e.g. sodium dodecyl sulphate (SDS)) are very commonly used chemicals, because they contain hydrophilic and hydrophobic moieties, allowing the detergent to integrate into the phospholipid bilayer of the cell, which causes the membrane to break. Furthermore, detergents disrupt non-covalent interactions resulting in protein denaturation [146]. On the downside, detergents impair enzymatic digestion, hamper chromatographic separation, and suppress the ionization [147, 148], which requires their removal during the sample preparation for mass spectrometry-based proteomics. Chaotropic reagents, such as urea or guanidine, are alternatives for chemical lysis. They interact with water molecules surrounding the cell membrane, which leads to a decrease in hydrophobicity and subsequently in its disruption. Similar to detergents, urea breaks non-covalent interactions causing protein denaturation [149]. After their successful extraction, proteins are digested for bottom-up proteomics (reviewed in [150]). For this, disulfide bridges are reduced and the free cysteins are alkylated. Besides chemical digestion using acids or other chemicals like cyanogen bromide, peptides can be generated enzymatically. Trypsin is a widely used protease, because of its high efficiency and sequence specificity cleaving peptide bonds c-terminally at lysine and arginine residues. As lysine and arginine are highly abundant in the human proteome, the resulting peptides consist on average of around 14 amino acids [151]. Additionally, the positive charges of the two basic amino acids make tryptic peptides ideal for mass spectrometry (MS) as it is based on the manipulation of charged ions. Although trypsin digestion can be considered the ‘gold standard’ for proteomics, other sequence specific proteases, such as Glu-C, Lys-C, Asp-N, or chymotrypsin, are alternatively used and have been proven advantageous in some cases [152, 153]. Besides the large selection of proteases, the setting in which proteins are digested can be varied according to the sample type. Traditionally, enzymatic digestion for bottom-up proteomics was performed in-solution or in-gel. While in-gel digestion is robust and capable of removing any interferring contaminants, it is very time-consuming and needs many manual handling steps [154, 155]. On the contrary, the insolution protocol can easily be automated and is scalable, but interferring substance from the lysis buffer cannot easily be removed making further peptide cleanup necessary. This is especially of essence for workflows based on detergents, as their removal is tedious and usually leads to sample loss. While detergents often comprise multiple advantages, such as an increased yield of membrane proteins, they hamper digestion efficiency and impair the MS measurements. Filterbased methods like S-Trap [156] or the filter-aided sample preparation [157] facilitate the removal of SDS and provide a technology for protein digestion in a single spin column. Similar to this approach is the in-StageTip (iST) method [158], which combines all processing steps from cell lysis to the elution of purified peptides in a single pipette tip filled with C18 material. While this methodology enables to process minute amounts of samples [159], detergents cannot be removed. An additional alternative is the utilization of magnetic beads like the single-pot, solidphase-enhanced, sample preparation (SP3, [160]) method or the protein aggregation capture approach [161]. Here, proteins are precipitated onto magnetic particles facilitating the removal of unwanted contaminants, including detergents, by washing steps before proteins are digested. This technology proved to be advantageous for low-input samples [159, 160] and is additionally easily automatable, which makes it interesting for high-throughput experiments [162]. As 16 | P a g e Chapter I: General Introduction outlined, several options for protein extraction and digestion exist, but they all have their advantages and disadvantages. A robust, scalable, inexpensive, automatable, straightforward, and universal method with minimal sample loss is yet to be found. Peptide fractionation Digestion of the proteome results in a peptide mixture of tremendous complexity which makes data acquisition challenging. In order to decrease this complexity, additional sample fractionation strategies can be performed, which separate peptides based on their physicochemical properties. For this, usually high-performance liquid chromatography (HPLC) [163] or Stop-and-go-extraction tips (StageTips) [164] are used. In both methods the separation is based on the interaction of peptides between the mobile and the stationary phase (as reviewed in [165]). Reversed-phase chromatography is a fractionation method, which allows analyte separation based on hydrophobicity [166]. Here, the analyte is dissolved in the polar mobile phase and is separated via hydrophobic interactions with the non-polar stationary phase. This is a widely used chromatography approach in the field of proteomics, as it is highly orthogonal to the low pH reversed-phase chromatography commonly used for on-line fractionation preceeding the MS measurement. However, several other chromatographic methods, such as ion-exchange chromatography [167], hydrophilic interaction liquid chromatography [168], or a mixed mode [169, 170] can be applied as alternatives. Although thorough fractionation increases the number of protein and peptide identifications, this comes with the cost of increased measurement time. Phosphopeptide enrichment As many of the developmental signalling pathways are regulated by phosphorylation, the investigation of phosphorylation patterns and the inference of kinase activities are of great interest to elucidate biological function. While protein depth can be substantially increased by peptide fractionation, an additional enrichment step is required in order to achieve comprehensive phosphoproteome coverage, which is due to the substoichiometric abundance of phosphorylated peptides in nature (as reviewerd in [171]). Immobilized metal affinity chromatography (IMAC) is one widely utilized enrichment strategy [172]. Here, metal ions, such as Fe3+, are chelated either with iminodiacetic acid or nitrilotriacetic acid and form a coordinative bond with the negatively charged phospho groups. As this binding step is performed at low pH, the unspecific binding of negatively charged carboxyl side chains is reduced. The phosphorylated peptides are further eluted by the addition of competing alkaline solvents or phosphate buffers [173]. The Fe3+-IMAC enrichment can either be performed with beads [174] or in column format [175] and has recently been optimized for high-throughput and low input experiments on the AssayMap Bravo pipetting robot [176, 177]. In the last years a slightly modified setting using magnetic beads has been developed and is commercially available from ReSyn Biosciences (Pty) ltd (Edenvale, Gauteng, South Africa). Here, phosphopeptides are bound to metal(IV) ions, like Ti4+ or Zr4+, which are chelated to phosphonate groups and connected via a linker to the magnetic microparticles. This setup has been proven to be especially attractive for small protein amounts and is in addition easily automatable enabling high-throughput enrichments [161, 178]. 17 | P a g e Chapter I: General Introduction Acetylation enrichment Besides phosphorylation, acetylation of lysine residues is another PTM capable of altering protein features like the regulation of enzymatic activity [179, 180]. In addition, histone acetylation affects chromatin accessibility and therefore gene transcription, which has been shown to comprise an important role for developmental processes [181-184]. However, studying acetylation is challenging due to its substoichiometric nature, which makes an enrichment step inevitable. In general, bead-conjugated antibodies targeting a specific acetyl-lysine motif are used for this purpose. Coupling this enrichment strategy with MS enables a global survey of acetylation levels demonstrating the widespread prevalence on a variety of different proteins [185, 186]. Despite the need of large input amounts, Svinkina and colleagues demonstrated that with an optimized proteomic sample preparation workflow more than 10,000 acetylated peptides can be identified [187]. I-Figure 3: Classical bottom-up proteomics workflow. Proteins are extracted from cells using lysis buffer and are further enzymatically digested to peptides. Optionally, peptides are enriched for phosphorylation or acetylation groups, for example. The peptides are separated on- or off-line using one- or multidimensional separation strategies. The resulting peptide mixture is ionized and injected into the mass spectrometer where m/z ratios are measured. For identification and quantification of peptides/proteins, the acquired mass spectra are matched against a protein database. Modified with permission from Steen and Mann [188] and created with BioRender.com. 3.2 Basics of bottom-up proteomics Once sample preparation is finished, a clean peptide solution is injected into the mass spectrometer where m/z ratios are acquired. These m/z ratios can further be matched against an in-silico database, which leads to peptide identification and enables to infer protein information. Technological advances have generated a multitude of different online liquid chromatography (LC) systems, ionization methods, and mass analyzers. The next paragraphs will focus on the Orbitrap Fusion Lumos (Thermo Fisher) as this machine was solely used for this study (I-Figure 4). Online fractionation Before the sample is introduced into the mass spectrometer, peptides are separated using an online LC system. Reversed-phase HPLC is the most widely used separation technique, as it ensures high peak capacity and solvent compatibility [189, 190]. This approach is based on the direct interaction between hydrophobic peptide residues with the non-polar stationary phase, such as octadecyl alkane chains on silica beads. The analyte is dissolved in the acidic mobile phase, which puts a net positive charge on most tryptic peptides, a prerequisite for the generally applied positive MS mode. Moreover, it results in protonated hydroxyl groups of the silica beads, which 18 | P a g e Chapter I: General Introduction increases the peptide separation. One disadvantage of this approach is that polar peptide moieties are not retained by the stationary phase. To overcome this problem, ion-pairing reagents, such as formic acid (FA), are added to the mobile phase increasing the separation by indirect ion pairing effects [166, 191]. Peptides are eluted from the column by an increasing concentration of organic solvent in the mobile phase and further transferred to the mass spectrometer. Due to the high sensitivity and ionization efficiency, nanoflow LC has been widely used in the last years [192]. However, as the newly developed MS machines are more sensitive, micro-flow LC is gaining more popularity. At the expense of more sample material, this approach showed very promising results concerning throughput and robustness [193, 194]. Electrospray ionization Peptides eluting from the reversed-phase column are passed through a thin capillary, the emitter, on their way into the mass spectrometer. Since the mass spectrometer acquires m/z ratios of ion, peptides have to be transferred into the gas phase. One mild and commonly used ionization method for this is the electrospray ionization, which efficiently produces very stable ions (as reviewed in [195]). While the high voltage applied between the emitter and the mass spectrometer pulls the charged molecules from the liquid to the counter electrode, the surface tension pulls the liquid back to the emitter in order to decrease the surface area. At a specific voltage the Taylor Cone is formed, which is a pointed cone where droplets are released towards the counter electrode. During the flight, solvents are evaporated until the repulsive forces of the remaining charges exceed the surface tension (Rayleigh limit) of the droplet. This results in the Coulomb fission causing the burst of the droplet and the formation of multiple nanodroplets containing charged peptides [196]. Although the principle of this effect was already described in 1964 [197], the exact mechanism of the final peptide ionization has not been resolved yet and can only be approximated by two models. The ion evaporation model postulates a decrease of droplet size due to solvent evaporation until the charged analyte can be expelled from the droplet [198, 199]. On the contrary, the charged residue model assumes that each nanodroplet contains only one analyte and upon complete solvent evaporated, the remaining charge is transferred to the peptide [200]. Ions enter the high-vacuum of the MS through a heated capillary, which further supports the complete solvent evaporation. Precursor m/z acquisition Once the ions are delivered into the MS, they are steered through electromagnetic fields on their path to the mass analyzers where their m/z values are determined. Several mass analyzers have been developed, each with different properties regarding mass resolution, mass accuracy, sensitivity, scan speed, dynamic range, and m/z range, as reviewed by Savaryn and colleagues [201]. Depending on the research question and sample type, specific mass analyzers are favoured and very often combined in one machine. The Fusion Lumos, which was used for this study, incorporates a quadrupole, a linear ion trap, and an orbitrap mass analyzer (I-Figure 4). The quadrupole consists of four parallel metal rods, where the two opposing rods are electrically connected. By setting different Alternating Current and Direct Current voltages, the trajectories of ions travelling through the quadrupole can be stabilized or destabilized [201]. For this reason the quadrupole is very often used as a prefilter, which allows the selection of ions with specific 19 | P a g e Chapter I: General Introduction m/z ratios. The linear ion trap follows a very similar principle, but is further capable of storing and fragmenting ions as well as determining m/z values [202]. It also consists of four rod-shaped electrodes of which two have exit slits for ions to leave the ion trap and enter the detector. The third mass analyzer of the Fusion Lumos is an orbitrap, which consists of two barrel-like outer electrodes and a coaxial inner spindle-like electrode [203]. Before ions enter the orbitrap, they are collected in the C-Trap and then injected as ion packages where they oscillate around the inner electrode. As the axial movement is proportional to the m/z value, the frequency of this oscillation can be recorded as an image current and further used to deduce m/z values by Fourier Transformation [204]. Tandem mass spectrometry Although the determined m/z values from the MS1 scan are very accurate, this information alone is not enough to unambiguously identify peptides, because peptides with a different amino acid sequence but the same mass cannot be distinguished. In order to overcome this problem, two consecutive MS spectra are acquired in the so-called tandem MS. For mass spectrometers operated in data-dependent acquisition (DDA) an MS1 scan measures the m/z values and intensities of intact peptide percusors ions first. Then, the mass spectrometer decides automatically in real-time which precursors are selected for further fragmentation and MS2 spectrum acquisition. For a predefined number or time span, MS2 spectra are acquired based on their MS1 abundance, which limits the reproducibility due to the stochastic nature of precursor selection for fragmentation. In contrast, for data-independent acquisition (DIA) m/z windows are grouped and subsequently fragment ions are analyzed independent of the MS1 scan. During the fragmentation process, precursors collide with inert gas (e.g. helium, nitrogen) breaking the peptide bonds. The fragmented precursors are further used to record MS2 spectra which reveal the amino acid sequence. Combining the m/z values from the precursors with their amino acid sequence allows a very accurate peptide identification. 20 | P a g e Chapter I: General Introduction I-Figure 4: Schematic of the Orbitrap Fusion™ Lumos™ Tribrid™ mass spectrometer (reprinted with permission from [205]. Copyright (2013) American Chemical Society). 3.3 Mass spectrometry data analysis Depending on the acquisition mode, the measurement time and the mass analyzer used for data generation, usually thousands of MS1 and MS2 spectra are recorded. In order to retrieve peptide information from these spectra, search engines with specific algorithms are used. Furthermore, this peptide information is used to identify and quantify proteins. Protein identification As discussed above, peptide identification solely based on MS1 spectra is ambiguous. Hence, acquiring MS2 spectra are inevitable to gain (partial) amino acid sequences as an additional layer of information for peptide identification. One method to deduce peptide information from MS2 spectra is de novo sequencing, which uses the delta mass between fragment ions to determine the discrete amino acid sequence [206]. While this approach is suitable for detecting previously unknown peptides, it requires the complete amino acid sequence, which is not achieved in most mass spectrometry-based experiments. In addition, this method is restricted to less complex samples, because a widely accepted approach for validating results from large-scale datasets is still lacking [207, 208]. However, the validation is an essential step for proteomics experiments, as usually thousands of spectra are recorded which have to be controlled with respect to their correct identification. Some of these challenges are overcome by an alternative method, which is based on database matching performed by softwares like Maxquant [209]. Here, the implemented search engine Andromeda [210] compares the obtained fragment spectra to an in silico digested protein database of theoretical spectra. For this, the noise of fragment masses is reduced, the charge state is deconvoluted, and de-isotoped. Furthermore, search parameters, such as the 21 | P a g e Chapter I: General Introduction applied protease or the mass tolerance, are specified to decrease the search space. Then, the search engines calculate a score describing the probability of a true identification and usually the highest scoring hit is reported. The correct annotation of the peptide spectrum match (PSM) is crucial for bottom-up proteomics, as protein identification is inferred from the experimentally acquired peptides. To control for correct identification, the experimental spectra are additionally searched against a database of reversed or scambled peptide sequences in a target-decoy approach [211]. Each hit to this decoy database is false by definition and enables the calculation of a false discovery rate (FDR) which is very often set to 1% to keep the number of wrong identifications low. As a peptide sequence cannot always be uniquely assigned to just one protein, but rather multiple, MaxQuant reports them as razor peptides. Due to this ambiguous identification, usually protein groups instead of proteins are reported, which demonstrates the protein-inference problem of bottom-up proteomics. Another drawback of this method for identification is that only sequenced organisms can be used, because a comprehensive database is required for sufficient peptide matching. While protein identification is usually based on several peptides, in many cases one or only a few MS2 spectra are used to identify a phosphopeptide and to localize the P-site. As in theory each serine, threonine, or tyrosine can be modified, the search space is vastly elevated, which increases the possibility for false identification as well as localization. Hence, phosphorylation localization tools have been developed to this end [212, 213]. In addition, very often the data analysis is limited to so-called class I P-sites, which possess a localization probability >0.75 [212]. Protein quantification In addition to protein identification, protein abundance information is usually of interest for research questions. Depending on the experiment and sample type, several different quantification methods exist and can be classified into label-free or label-based methods. In labelfree approaches protein identification is based on MS2 spectra, while MS1 spectra are facilitated for quantification [214]. One quantification method is spectral counting, where the number of peptides for each protein is summed up and used to derive protein abundance. A more frequently used method uses the peak intensity of peptides, which correlates with their abundance [215]. However, a big drawback of label-free methods is that all samples from one experiment are only combined at the last step, the data processing. Hence, sample preparation and MS measurement has to be performed for each sample individually, which is not only time consuming but also error prone. In addition, due to the stochastic nature of DDA approaches missing values across many samples are unavoidable. In order to increase reproducibility and throughput, several techniques of multiplexing samples have been developed (reviewed in [216]). While some strategies are based on the incorporation of stable isotope labels with amino acids in cell culture, other approaches use chemical labeling, such as tandem mass tags (TMT). The latter technique uses succinimide chemistry to label the amino termini and lysine residues of peptides and thus enable the parallel quantification of up to 18 samples within a single experiment [217]. As TMT reagents have the same nominal mass, the MS1 spectra complexity is not increased. Upon precursor selection and subsequent fragmentation, the TMT reporter ions are cleaved off allowing for relative quantification as well as peptide identification based on MS2 scans. However, the precursor selection suffers from the coisolation of coeluting precursors and thereby leads to ratio 22 | P a g e Chapter I: General Introduction distortion of the TMT reporter ions [218, 219]. In order to overcome this problem, several ideas have been discussed in the past, such as decreased isolation window width [220], gas-phase purification [221], or utilization of complement reporter ions [222]. In this work, the quantification accuracy was increased by an additional MS3 scan, which can be performed by the Tribrid Fusion Lumos machine and has been shown to reduce ratio distortion of coisolating precursors [223]. Here, an MS2 spectrum was acquired for peptide identification and the most abundant fragments were further used for another round of fragmentation. These generated fragments were then recorded in an MS3 scan for quantification of reporter ions. Although this additional round of isolation leads to ratios closer to the reality, it suffers from decreased sensitivity. The MultiNotch MS3 measurements counteracts this pitfall with the synchronous precursor selection (SPS) of several of the most abundant MS2 fragments, which increases the MS3 intensity and thereby the sensitivity [224]. 23 | P a g e Chapter I: General Introduction 4 Objectives and Outline Mass spectrometry-based proteomics is the most widely used technique for analyzing protein levels of complex samples. However, relatively large protein quantities are required to achieve comprehensive measurement depth, preventing its usage in some applications. While developments on the mass spectrometer-side in recent years have led to enhanced sensitivity of the devices itself, the sample preparation turned out to be one of the major bottlenecks for analyzing small protein quantities. Therefore, methods that can cope with minute protein quantities are warranted. The study of in vitro cell differentiation is an area of research that could benefit from such improved experimental conditions on quantity-limited samples, as differentiation experiments are cost and labor intensive. Moreover, few comprehensive and global proteomic characterizations of directed cell differentiations have been performed. Thus, the aim of this work was to exploit the insights that can be gained from the temporal dynamics of global protein levels and their phosphorylation and acetylation during hepatocyte differentiation. At the beginning, different sample preparation methods were bechmarked and their suitability for low input material was assessed. This evaluation revealed the SP3 bead approach as a versatile sample preparation method that can easily be implemented without the necessity of expensive equipment (Chapter II). In a second part, deep proteome profiling was utilized to gain insights in the differentiation process of pluripotent stem cell-derived hepatocytes (Chapter III). The high temporal resolution of this experiment enabled the identification of novel stage-specific marker proteins and elucidated the molecular changes occuring during the differentiation process. In addition, the study of phosphorylation and acetylation dynamics allowed the inference of kinases and deacetylase activity as key drivers of various signaling pathways. Finally, the same experimental setup was adopted to facilitate the comparison of in vitro generated hepatocytes to fetal and mature liver samples for the characterization of current differentiation protocols (Chapter IV). 24 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics 1 Summary ............................................................................................................................ 29 2 Introduction ....................................................................................................................... 30 3 Material and methods ........................................................................................................ 32 4 Results and discussion ........................................................................................................ 38 4.1 Optimizing sample preparation for low input amount and TMT-labeling ...................... 38 4.2 Benchmarking different beads for fullproteome analysis ............................................. 42 4.3 Magnetic beads for phosphopeptide enrichment ......................................................... 45 27 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics The following chapter includes data generated from Karl Kristian Krull during his research internship “Influence of surface modification in SP3 technology for functional proteome analysis” and his Master Thesis “Phosphoproteomics for low input amount using paramagnetic IMAC beads”, which he conducted under the author’s supervision at the Chair of Proteomics and Bioanalytics at the Technical University of Munich. 28 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics 1 Summary Albeit mass spectrometry-based proteomics has been successfully used to analyse a variety of different biological samples, an extraction and sample preparation method that can be universally applied is missing. In this regard, several different approaches for sample processing were benchmarked with a special emphasis on methods suitable for small quantities as well as phosphopeptide enrichments. Optimizing the in-StageTip workflow resulted in the identification of more than 10,000 peptides from 2,000 human cells, which is 5 times more than with the standard protocol. Only the use of magnetic carboxyl-coated SP3 showed better performance with about 40% higher number of peptide identifications. Additionally, the SP3 approach was compatible with TMT labeling, while decreasing processing time and handling steps compared to the standard in-solution workflow. Of note, alternative magnetic microparticles, such as amine and hydroxyl-modified beads, performed similar as SP3. In contrast, beads based on a HILIC-like protein binding decreased the number of identified peptides, had a bias towards more hydrophilic peptides, and showed lower reproducibility. In addition, protocols for phosphopeptide enrichments using magnetic Ti-IMAC and Zr-IMAC beads were optimized leading to the detection of 20% more phosphorylation sites than in the manufacturer’s protocol. Based on these results a novel workflow, called OnePot, was developed which facilitates protein clean-up, digestion, and phosphopeptide enrichment in one reaction tube using magnetic IMAC beads. Compared to the Agilent AssayMAP® Bravo system the OnePot approach performed particularly well for enrichments of low input amounts. 29 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics 2 Introduction In bottom-up proteomics, protein abundances are indirectly inferred from the measurement of peptide solutions. Despite of multiple different methods for obtaining clean peptide solutions from protein extracts, their applicability depends on the experimental setup as well as the sample type. For example, different extraction buffers and additives can hamper enzymatic digestion, can interfere with fractionation columns, and can impair peptide labeling and enrichment. Hence, lysis reagents often need to be removed or diluted below a threshold to minimize their negative impact. In-solution digestion is a classical workflow for generating peptides, but is susceptible to contaminants and requires additional desalting steps which are time consuming and result in sample loss. Especially the latter is crucial for sample-limited experiments. In this regard, Kulak and colleagues designed an iST approach, which allows cell lysis, protein digestion, and desalting in a single reaction vessel [158]. While this approach decreases sample loss, it is not capable of removing detergents and is therefore not compatible with several lysis buffers. Another approach that has gained increasing interest in recent years is SP3 [160]. This method is based on the precipitation of proteins to magnetic carboxyl-coated beads using high concentrations of organic solvents. Once proteins are precipitated, detergents and other interfering substances can be removed. As the sample loss is small, this approach offers great potential for sample-limited experiments [159, 160, 225]. ReSyn Biosciences (Pty) ltd offers a commercially available alternative to SP3 beads, which feature a variety of different surface modifications. Compared to SP3 beads, they provide an increased surface area due to their bigger diameter as well as their hyper-porous characteristics which supposedly enhances their capacity of protein capturing. Apart from the variety of workflows targeting the fullproteome, multiple different approaches for phosphopeptide enrichment exist. About 30% of all proteins are estimated to be phosphorylated [226], which alters their biological characteristics, including activity, subcellular localization, or protein-protein interactions. As this PTM suffers from low stoichiometry and abundance, an enrichment step is in general necessary. One commonly applied strategy is IMAC, where chelated metal ions form coordinative bonds with the phosphate groups of peptides. Traditionally, metal(III) cations such as Fe3+ or Ga3+ have been used. However, several studies have introduced metal(IV) cations as alternatives with potentially superior enrichment selectivity and enhanced phosphopeptide binding [227-229]. This led to the development of the noncovalent immobilization of Ti4+ and Zr4+ cations to magnetic beads via phosphonate linkers. Nevertheless, a big drawback of phosphoenrichment is the requirement of high sample amount. To this end, several platforms and approaches for low input quantities were designed over the last years. For example, the EasyPhos approach has enabled the identification of more than 20,000 phosphopeptides from only 200 µg of protein by consecutively performing protein digestion and phospho enrichment in a 96-well-plate format [230, 231]. Chen and colleagues have published the column-based Phospho-SISPROT method, which utilizes spintips for protein digestion, IMACenrichment, and subsequent desalting allowing to process samples in the low µg range [232, 233]. Furthermore, the liquid handling robot AssayMAP® Bravo Agilent offers a high-throughput system with great potential for minimizing sample quantities [177]. The following experiments were designed to optimize and evaluate the commonly applied insolution, iST, and SP3 approaches for low amounts of starting material. Furthermore, magnetic 30 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics beads functionalized with several different chemical groups were benchmarked. Lastly, the enrichment of phosphorylated peptides using magnetic Ti4+ and Zr4+ IMAC-beads was tested. Utilizing the IMAC beads for protein precipitation, digestion, and phospho enrichment, led to the development of an integrated workflow performed in a single tube which revealed promising results for sample-limited experiments. 31 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics 3 Material and methods Protein extraction Cells cultured in suspension were transferred into a falcon tube and centrifuged. The medium was discarded and the cell pellet was washed twice with phosphate-buffered saline (PBS) (w/o CaCl2/MgCl2). Then, the cell pellet was lysed with lysis buffer containing 8 M urea, 40 mM Tris/HCl (pH 7.6), 1x EDTA-free protease inhibitor (cOmplete™, Roche), and 1x phosphatase inhibitor mix (prepared in-house according to the Phosphatase Inhibitor 1, 2, and 3 from Sigma). The lysate was sonicated for 15 cycles of 30 sec ON/OFF using a Bioruptor® (diagenode) and the protein concentration was determined with the Pierce™ Coomassie assay (Thermo Scientific™). Alternatively to the urea lysis, washed cells were lysed with 2% SDS in 50 mM Tris/HCl (pH 8.5) and subsequently heated at 95°C for 10 min. In order to hydrolyse nucleic acids, a final concentration of 2% trifluoroacetic acid (TFA) was added to the boiling lysate for 1 min. The pH was quenched back using 3 M Tris until a neutral pH was reached and the lysate was additionally sonicated for 15 cycles of 30 sec ON/OFF using a Bioruptor® (diagenode). Finally, the protein concentration was determined using the Pierce™ BCA protein assay (Thermo Scientific™) and the lysate was stored at -80°C until further use. In-solution workflow Urea cell lysate was thawed and the respective protein amount was transferred to an Eppendorf tube. Next, proteins were reduced with 10 mM DTT for 45 min at 37°C and subsequently alkylated using CAA for 30 min at room temperature (RT). For an efficient digestion, the sample was diluted to a urea concentration below 1.6 M using 40 mM Tris/HCl (pH 7.6).Trypsin was added at a 1:50 enzyme-to-protein ratio and incubated at 37°C overnight. On the next day, the digestion was stopped by adding FA to a final concentration of 1%. The digested peptides were further transferred onto self-packed StageTips for desalting as described previously [164]. Shortly, C18 material (Octadecyl Extraction Disks, 3M Empore™) was packed into a 200 µL pipette tip and equilibrated. The acidified peptides (pH 2-3) were loaded and contaminants were removed with two washes of 0.1% FA. Peptides were eluted with 50% acetonitrile (ACN) in 0.1% FA and dried down. For input amounts >100 µg protein, acidified peptides were loaded onto 50 mg SepPak columns (Water Corp.) and desalted. Contaminants were washed away with 0.1% FA, peptides were eluted with 0.1% FA in 50% ACN, and dried down. Depending on the experimental setup, peptides were further labeled with TMT (Thermo Scientific™). For labeling, the desalted peptides were reconstituted in 20 µL of 50 mM HEPES buffer (pH 8.5) as described previously [234]. Then, 5 µL 11.6 mM TMT reagent was added and incubated for 1 h at RT shaking at 400 rpm. The reaction was quenched using 2 µL of 5% hydroxylamine and subsequently all TMT channels were pooled. The labeled samples were further desalted using SepPak cartridges as described above and dried down. iST workflow The sample preparation workflow was based on the previously published study by Kulak and colleagues [158]. As described in the results section some modifications were made in order to optimize the workflow for low sample quantities. First, StageTips were constructed as described 32 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics in the previous section by packing 5 C18 disks in a pipette tip. While the original protocol recommended the use of 200 µL pipette tips, enhanced performance was observed utilizing 10 µL pipette tips in combination with lowering the digestion volumes. The StageTip was primed with 90 µL ACN followed by equilibration with 90 µL 100 mM Tris (pH 8.5). The column was heat-sealed and covered with 5 µL 100 mM Tris (pH 8.5) to avoid complete drying. Then, cells were washed twice with PBS (w/o CaCl2/MgCl2), counted in a Neubauer champer, and applied onto the StageTip (volume should be adjusted to 5 µL). For cell lysis, 10 µL urea buffer (8 M urea, 100 mM Tris, pH 8.5) was added and incubated for 10 min on ice. Proteins were reduced with a final concentration of 10 mM DTT for 45 min at 37°C and alkylated with 55 mM CAA for 30 min at RT. The urea concentration was decreased below 1.6 M using 25 mM Tris (pH 8.5) before trypsin was added with an enzyme-to-protein ratio of 1:50 and incubated at 37°C and 600 rpm overnight. On the next day, the sample was acidified with FA (1% final concentration) and the heat-sealed pipette tip was cut open. The digested peptides were loaded slowly to the C18 material by centrifugation, washed with 0.1% FA, and subsequently eluted with 0.1% FA in 60% ACN. SP3 workflow The following workflow was modified from the original protocol of Hughes and colleagues [160, 235]. Magnetic hydrophilic (GE45152105050250) and hydrophobic (GE65152105050250) carboxylate-modified Sera-Mag™ SpeedBeads (50 mg/ml) were removed from the fridge and kept at RT for 10 min. The beads were vortexed and mixed with a 1:1 ratio. They were washed 3 times with water using a magnetic rack for immobilization. Before use, bead pellet was reconstituted with water to a final concentration of 100 mg/ml. Urea lysate (chapter 4.1) or SDS lysate (chapter 4.2), respectively, was thawed and the desired protein amount was aliquoted into reaction tubes. For up to 50 µg protein lysate 2µl of the prepared bead stock was transferred to the lysate, whereas 10 µl of the bead stock was applied for protein amounts above 50 µg. Next, ACN was added for protein aggregation to a final concentration of 75% and incubated for 20 min shaking at 800 rpm. The beads were separated using a magnetic rack and the supernatant was discarded. Beads were washed twice with 80% ethanol and once with 100% ACN to remove contaminants. Subsequently, digestion buffer (50 mM Tris/HCl, pH 8.5 or pH 7.6, 2 mM CaCl2) and DTT (10 mM final concentration) were added and incubated for 45 min at 37°C. Afterwards cysteine residues were alkylated with 55 mM CAA for 30 min at RT in the dark. For the overnight digestion, trypsin was added in a 1:50 enzyme-to-protein ratio and incubated at 37°C shaking at 800 rpm. On the next day, beads were immobilized on the magnetic rack and the supernatant containing the digested peptides were transferred to a new vial. Beads were washed once with 1% TFA in water and the supernatant was pooled with the previous supernatant. Peptides were stored at -80°C until further use. For TMT labeling the following modifications were introduced to the protocol. After reduction and alkylation, proteins were reaggregated using ACN (final concentration of 75%). The beads were washed again twice with 80% ethanol and once with 100% ACN to remove the acidic CAA. Next, beads were reconstituted in 100 mM HEPES buffer (pH 8.5) and trypsin was added with a 1:50 enzyme-to-protein ratio. On the next day, the supernatant was removed and dried down. The dried peptides were further reconstituted in 20 µl water (HEPES concentration should be above 40 mM) and the labeling was initiated by adding 5 µl of 11.6 mM TMT reagent as reported 33 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics previously [234]. After incubating the samples for 1 h at RT and 400 rpm, the reaction was terminated using 2 µl of 5% hydroxylamine. Subsequently, all TMT channels were pooled and further desalted using SepPak cartridges as described above. Protein aggregation capture workflow Samples were prepared according to the manufacturer’s protocol (ReSyn Biosciences). In short, magnetic beads were removed from the fridge and kept at RT for 10 min before they were transferred to an Eppendorf tube. Beads were washed twice with 70% ACN and reconstituted with water to a final concentration of 20 µg/µl. SDS lysate was thawed and combined with the bead stock with a 5:1 bead-to-protein ratio. For protein aggregation, ACN was added to a final concentration of 75% and incubated for 20 min at RT shaking at 600 rpm. The subsequent washing steps were performed without removing the reaction tubes from the magnetic rack in order to keep the bead pellet intact. Beads were washed twice with 95% ACN followed by two washes with 70% ethanol. Next, beads were reconstituted in digestion buffer (50 mM Tris/HCl, pH 8.5, 2 mM CaCl2), reduced with 10 mM DTT for 45 min at 37°C, and alkylated with 55 mM CAA for 30 min at RT. Trypsin was added with an enzyme-to-protein ratio of 1:50 and incubated overnight at 37°C and 800 rpm. On the following day, beads were separated and the supernatant was transferred to a new reaction vessel. Remaining peptides were eluted from the beads with 1% TFA and pooled with the previous supernatant. Peptides were stored at -80°C until further use. HILIC workflow Samples were prepared according to the manufacturer’s protocol (ReSyn Biosciences). Briefly, beads were removed from the fridge and kept at RT for 10 min. Next, beads were transferred to a reaction tube, washed twice with equilibration buffer (15% ACN, 100 mM NH 4Ac, pH 4.5), and reconstituted in 2x binding buffer (30% ACN, 200 mM NH 4Ac, pH 4.5) to a final concentration of 20 µg/µl. The bead stock was subsequently mixed with the SDS lysate to obtain a 10:1 bead-toprotein ratio. An equivalent volume of binding buffer was added to the sample to reach a final volume of 50 µl. The solution was incubated at RT for 20 min shaking with 600 rpm. Then, the beads were washed twice with 95% ACN and reconstituted in digestion buffer (50 mM Tris/HCl, pH 8.5, 2 mM CaCl2). Cysteine residues were reduced and alkylated with DTT and CAA, respectively, as described above and trypsin was added to a 1:50 enzyme-to-protein ratio. The digestion was performed overnight at 37°C. On the following day, beads were immobilized on a magnetic rack and peptides were stored at -80°C until further use. StageTip fractionation The high pH fractionation was conducted as described previously [236]. In short, peptides in solution were acidifed (pH 2-3) and loaded onto self-packed StageTips like mentioned above. Peptides were washed once with 0.1% FA before 50 µl of 25 mM ammonium formate (pH 10) was added. The flow-through was reapplied and then transferred to a new vial or plate as this was pooled with one of the fractions later. Next, peptides were step-wise eluted with 5%, 10%, 15%, 17.5%, and 50% ACN in 25 mM ammonium formate. The 5% and 50% fractions as well as the 17.5% fraction and the flow-through were combined. All 6 fractions were frozen and dried down. For dried samples that were not in solution, the protocol was similar. The only difference was that the 34 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics dried peptides were directly resuspended in 25 mM ammonium formate and loaded onto StageTips. Phosphoenrichment using magnetic IMAC-beads The enrichment was performed according to the manufacturer’s protocol (ReSyn Biosciences) with some minor modifications. Briefly, Ti-IMAC and Zr-IMAC beads were removed from the fridge and kept at RT for 10 min. Depending on the experiment, equal amounts of beads were mixed or used separately, respectively. The magnetic beads were transferred to a reaction tube and equilibrated three times for 1 min with loading buffer (80% ACN, 5% TFA). The desalted peptides were reconstituted in 100 µl loading buffer, centrifuged for 5 min at 10,000 xg, and applied to the prepared bead solution. A bead-to-peptide ratio of 2:1 was used for quantities between 100 µg and 200 µg, while a 4:1 and 8:1 ratio was applied for 50 µg and 25 µg, respectively. This is half the bead amount recommended by the manufacturer, but this reduction revealed enhanced performance. For phosphopeptide binding, samples were incubated for 30 min at RT and 1,000 rpm. The beads were immobilized with a magnetic rack and the unbound fraction was discarded or further analyzed as the fullproteome fraction. Unbound and unspecifically bound sample was removed by washing once with loading buffer followed by two consecutive washes with wash buffer 1 (80% ACN, 1% TFA) and wash buffer 2 (10% ACN, 0.2% TFA). Subsequently, phosphopeptides were eluted by adding 100 µl of 1% NH4OH and shaking for 10 min at 800 rpm. This step was repeated two more times to ensure complete elution. The three fractions were pooled and combined with 50 µl of neutralization buffer (50% FA). OnePot workflow Ti-IMAC and Zr-IMAC beads were mixed with equal amounts and washed three times with water to remove the storage solution. Cell lysate was combined with the beads and ACN was added to a final concentration of 75% for protein aggregation. The sample was incubated for 20 min at RT shaking with 800 rpm and subsequently separated on a magnetic rack. The unbound fraction was removed and the beads were washed twice with 80% ethanol and once with 100% ACN. Next, digestion buffer (25 mM TEAB, pH 8, 2 mM CaCl2) was added, proteins were reduced with 10 mM DTT for 45 min at 37°C, and alkylated with 55 mM CAA for 30 min at RT. For protein digestion, trypsin was added to an enzyme-to-protein ratio of 50:1 and, depending on the experiment, incubated overnight or for 3 h at 37°C. For the subsequent phosphopeptide enrichment, 160 µl binding buffer (90% ACN, 6% TFA) was added to the 20 µl of digest and incubated for 45 min at RT and 800 rpm. Then, the beads were separated on the magnetic rack and the unbound fraction was discarded or analysed as the fullproteome. To remove unspecifically bound peptides, beads were washed with loading buffer (80% ACN, 5% TFA) followed by two washing steps with wash buffer 1 (80% ACN, 1% TFA) and wash buffer 2 (10% ACN, 0.2% TFA). Next, phosphopeptides were eluted by adding 100 µl of 1% NH4OH for 10 min at RT and 800 rpm. This step was repeated two more times and the fractions were pooled together with 50 µl of neutralization buffer (50% FA). Samples were stored at -80°C until further use. 35 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics Phosphoenrichment using the Agilent AssayMAP® BRAVO platform The standard programs from the AssayMAP® for cartridge charging (IMAC Cartridge Customization v1.0) and enrichment (Phosphopeptide Enrichment v2.0 App) were performed as recommended by the manufacturer. In short, AssayMAP® Fe(III)-NTA cartridges were stripped using 100 mM EDTA, washed with 0.1% TFA, and charged using 50 mM FeCl 3 with 100 mM acetic acid. The freshly charged cartridges were further primed with 80% ACN in 0.1% TFA and subsequently used. Desalted peptides were reconstituted in 200 µl loading buffer (30% ACN, 0.1% TFA) and loaded onto the cartridges. After washes with 0.1% TFA and 0.1% TFA in 99.9% ACN, phosphopeptides were eluted with 1% NH4OH. Peptides were dried down and stored for further processing. Data acquisition and processing A nanoflow LC-MS setup was used for data acquisition by coupling a Dionex Ultimate 3000 UHPLC+ system to a Fusion Lumos Tribrid mass spectrometer (Thermo Fisher Scientific). Peptides were loaded onto an in-house packed trap column (75 µm x 2 cm, 5 µm C18 resin; Reprosil PUR AQ, Dr. Maisch) and washed for 10 min with 0.1% FA and 5% DMSO. Subsequently, peptides were separated using an analytical column (75 µm x 40 cm, packed in-house with 3 µm C18 resin; Reprosil PUR AQ) with a flow rate of 300 nl/min and an increasing ACN gradient. Measurements were performed in DDA and positive ionization mode. The label-free fullproteome samples were acquired with a linear 50 min gradient from 4% to 32% LC buffer B (0.1% FA, 5% DMSO in ACN) in LC buffer A (0.1% FA, 5% DMSO in MS-grade water). The MS1 scan were recorded with 60,000 resolution in the orbitrap. The automatic gain control (AGC) target was set to 3e5 ions and the maximum injection time (maxIT) to 25 sec. The top 20 MS1 precursors were fragmented using higher-energy collisional dissociation (HCD) and MS2 spectra were subsequently acquired with 15,000 resolution in the orbitrap. Therefore, the AGC target was set to 1e5 charges and the maxIT to 25 sec. The label-free phosphoproteome was recorded with a two-step 80 min gradient. While during the first 50 min a linear gradient from 4-18% LC buffer B was applied, for the last 30 min a linear gradient from 15-27% buffer B was used. The MS2 method was similar to the label-free fullproteome method, except that the resolution for the MS2 scan was 30,000 and the maxIT was set to 120 sec. For the TMT6-plex labeled full proteome, a linear 50 min gradient from 8% to 34% buffer B (0.1% FA, 5% DMSO in ACN) in LC buffer A (0.1% FA, 5% DMSO in MS-grade water) was used. Full scan MS1 spectra were recorded at 60,000 resolution and a scan range from 360-1300 m/z in the orbitrap. The AGC target was set to 4e5 charges and a maxIT of 20 ms was used. Isolated precursor from the MS1 scan were fragmented via collision-induced dissociation (CID) and acquired with 15,000 resolution in the orbitrap. The AGC target was set to 5e4 charges and the maxIT to 22 ms. The MS3 spectra were obtained via SPS-MS3, which simultaneously selects 10 MS2 fragments that are further fragmented via HCD and read out in the orbitrap with 15,000 resolution. For this, the AGC target was set to 1e5 charges and the maxIT to 50 ms. A few minor changes were made for the TMT11-plex labeled FP samples. MS2 spectra were acquired in the ion trap (rapid mode) with an AGC target of 2e4 charges and a maxIT of 60 ms. SPS-MS3 spectra were recorded with 50,000 resolution, an AGC target of 1.2e5 charges and a maxIT of 120 ms. 36 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics The TMT-labeled phoshopeptides were acquired with a linear 80 min gradient from 4% to 32% LC buffer B. MS1 spectra were recorded with 60,000 resolution in the orbitrap, an AGC target of 4e5 charges, and maxIT of 20 ms (TMT6 samples) or 50 ms (TMT11 samples), respectively. MS1 precursors were isolated and fragmented via CID for subsequent MS2 spectra acquisition in the orbitrap. For this, 15,000 resolution, and AGC target of 5e4 charges and a maxIT of 22 ms was used. The following MS3 scan was recorded at 15,000 resolution in the orbitrap for the TMT6labeled samples and with 50,000 resolution for the TMT11-labled samples. Database searching Raw files were searched with the Maxquant software with its search Engine Andromeda [210, 237] (version [1.6.2.3]) against the UniProtKB human reference list (downloaded 22.07.2013). Default setting were applied, unless stated otherwise. The enzyme trypsin was specified as protease allowing for up to two missed cleavages. Carbamidomethylation of cysteine was set as a fixed modification, while oxidation of methionine, and N-terminal protein acetylation were defined as variable modifications. In addition, serine, threonine, and tyrosine phosphorylation was set as variable modification for the phosphoproteome. A target-decoy approach was used to adjust the data to 1% PSM and protein FDR. For the TMT-labeled samples, MS3-based reporter ion quantification was enabled and the corresponding TMT correction factors were added. Data processing Information about the peptide IDs and quantification were obtained from the summary.txt and evidence.txt, which are both output files from the Maxquant software (version [1.6.2.3]). Entries with reverse hits or potential contaminants were excluded from the analyses. Protein information was based on the proteingroups.txt from which ‘only identified by site’, reverse hits, and potential contaminants entries were removed. Phosphosites from the phosphoproteome analyses were retrieved from the phosphosite.txt. Again, reverse hits and potential contaminants were excluded and entries with a localization probability <0.75 were removed. To calculate the phosphopeptide selectivity, reverse hits and potential contaminants were removed from the evidence.txt and the ratio of phosphorylated/unphosphorylated entries was calculated. For data processing and visualization, mostly Microsoft Excel, Perseus software suite [238, 239] (v.1.6.2.3), GraphPad Prism 5, and RStudio (version [4.0.2]) were used. 37 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics 4 Results and discussion A multitude of different sample preparation methods in the field of (phospho)proteomics have been developed over the last years. In this chapter several approaches were evaluated and optimized with a special emphasis on low input quantities. 4.1 Optimizing sample preparation for low input amount and TMT-labeling At first, the iST approach was benchmarked against an in-solution workflow, which is very common in bottom-up proteomics. For the standard in-solution workflow, cells were harvested and lysed in urea buffer, before the resulting protein was transferred into a new Eppendorf tube. Proteins were digested and subsequently desalted in separate StageTips. On the contrary, the iST approach was entirely performed in a single StageTip. For this, StageTips were prepared and cells were directly transferred onto the C18 disks. Then, lysis buffer was added and proteins were digested, desalted, and subsequently eluted from the same StageTip. To assess the performance on low starting amounts, 10,000 HL60 cells were processed in parallel with the in-solution workflow either using standard 1.5 ml or 0.5 ml low-binding tubes as well as with the iST protocol. Each method was performed in dublicate and proteins were subsequently acquired with MS. Utilizing the 0.5 ml low-binding tubes resulted in an increase of about 1,800 peptides and 150 proteins compared to the standard 1.5 ml tubes (II-Figure 1A). However, results from the iST approach still demonstrated superior performance, as more than 2,000 peptides and 200 proteins were identified here. Hence, these experiments demonstrated that minimizing the reaction tube surface area and the manual sample handling steps, were advantageous for low input amounts, as this decreased sample loss during the preparation process. Based on these promising results, the next step was to optimize and minimize sample loss of the iST protocol. For this, the digestion volume was decreased from 250 µl to 55 µl and 10 µl pipette tips instead of the 200 µl were used for sample preparation. With this optimized protocol, approximately 5 times more peptides and twice as many proteins were obtained than in the standard iST protocol using only 2,000 cells as starting material (II-Figure 1B). With more than 9,000 peptides and 2,000 proteins, the optimized protocol yielded similar numbers as the first experiment (II-Figure 1A) despite applying only a fifth of the starting material. The experiment was performed in dublicate and demonstrated good quantitative reproducibility based on the peptide abundance (II-Figure 1C). To assess the detection limit, a downscaling of input amount was conducted next. For this, 2,000, 1,000, 500, and 100 cells were processed with the optimized workflow. Although the reduction from 2,000 to 1,000 cells led to a substantial decrease, still around 1,500 proteins were detected (II-Figure 1D). However, for less input material no considerable number of peptides/proteins were detected, which implies that the lower detection limit of this approach was reached. 38 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics II-Figure 1: Optimizing the iST workflow. (A) Barplots depicting the numbers of peptides and proteins identified with the standard in-solution workflow (green) in 1.5 ml or 0.5 ml tubes, respectively, as well as the iST approach (orange). The experiment was performed in technical dublicate and each bar represents one replicate. (B) Number of peptides and proteins identified with the standard and the optimized iST protocol. (C) Scatterplots showing the correlation between replicates of peptides identified with the standard (n=743 peptides, left panel) or the optimized iST protocol (n=6,149 peptides, right panel). (D) Barplots showing the number of peptides and proteins identified with decreasing input amounts using the optimized iST protocol. Bars represent the mean of three replicate and the error bars depict the standard deviation. Next, the optimized iST protocol was compared with the previously published SP3 workflow [160], which utilizes paramagnetic beads for protein aggregation followed by washing steps and subsequent digestion. For this, 2,000 HL60 cells were processed separately in triplicate with both protocols and applied to a mass spectrometer. Approximately 40% more peptides and 30% more proteins were identified with the SP3 approach (II-Figure 2A). Furthermore, the number of peptides with one or two missed cleavage sites, which constitutes a relevant issue of the iST method, was greatly decreased (II-Figure 2B). One reason for the increased digestion efficiency in the SP3 method might be the improved removal of urea and other reagents potentially diminishing trypsin activity. Another reason for enhancement might be the lower digestion volume enabling a higher trypsin concentration and a higher digestion efficiency due to increased trypsin-protein interactions. While the number of identified peptides was higher for SP3 compared to iST, both approaches were similarly reproducible (II-Figure 2C). Furthermore, the peptide quantification showed comparable R2 values when analysing raw intensities of the respective replicates (II-Figure 2D). 39 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics II-Figure 2: Comparison of iST and SP3 workflows. (A) Number of identified peptides and proteins from 2,000 HL60 cells with the optimized iST and the SP3 approach. Bars represent the mean and error bars the standard deviation of the three replicate. (B) Stacked barplot showing the percentage of 0, 1, and 2 missed cleavage sites of the three replicate derived with the iST or SP3 protocol, respectively. (C) Peptide overlap between replicates (R1-R3) of both approaches. (D) Scatterplot and coefficient of determination comparison between the peptides of two replicates acquired from the iST and SP3 workflow. As the SP3 protocol was superior for label-free samples from minute quantities, its performance and reproducibility with TMT labelling was assessed in a next step. This method offers the advantage of increasing sample material by multiplexing up to 18 samples [217], which is desirable for comprehensive phosphopeptide enrichment as well as for deep-fractionation. To validate this workflow, triplicate of 160 µg protein lysate were processed with the SP3 or a standard in-solution protocol, respectively. For the in-solution protocol, protein lysate was digested overnight in Tris/HCl buffer, transferred onto SepPak cartridges, desalted, and dried down. The dried peptides were reconstituted in HEPES buffer and subsequently TMT-labeled. In contrast, the SP3 workflow reduced the number of manual handling steps as well as the processing time. First, proteins were precipitated on beads and washed. Then, contrary to the in-solution protocol, proteins were digested in HEPES buffer. The digested peptides were dried down and dissolved with water for subsequent TMT labelling. As interfering substances were washed away before the digestion, the additional SepPak desalting step from the in-solution protocol was superfluous for the SP3 approach. The two methods were not only benchmarked based on the fullproteome, but also on phosphoproteome level. Due to the sub-stoichiometric nature of phosphopeptides, their detection is very challenging and usually accompanied by special enrichment strategies. Here, the TMT-labeled peptides were enriched for phosphopeptides via a Fe3+-IMAC column, while the flowthrough, which contained the unphosphorylated fullproteome, was collected, and further fractionated using StageTips. The fullproteome as well as the phosphoproteome were acquired with a nanoflow system coupled to an Orbitrap Fusion Lumos, which was operated in MS3 mode with a 60 minute or 90 minute gradient, respectively. The SP3 approach outperformed the in- 40 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics solution protocol on the number of detected peptides, proteins, and P-sites (II-Figure 3A). However, the labeling efficiency (data not shown) for the in-solution experiment was only around 50%. On the contrary, the SP3 workflow led to more than 90% labeling, which is in the expected range based on previous TMT experiments. A reason for the unexpected low labelling efficiency might be the use of a malfunctioning TMT aliquot. Since the half-life of TMT reagent in buffer is only a few minutes, old aliquots can lead to decreased labeling efficiency if they were not stored adequately or for an extended period of time. This was presumably the case, as only the reporter intensities of the second replicate (R2) was considerably decreased (II-Figure 3B). This observation was further supported by comparing the reproducibility between replicates. While the reporter intensities of replicate 1 and 3 correlated nicely (R2 = 0.96), the correlation to R2 was strikingly decreased (II-Figure 3C). In contrast, the SP3 approach yielded comparable levels of reporter ion intensities (II-Figure 3B) as well as high quantitative reproducibility between replicates (II-Figure 3C). II-Figure 3: TMT labelling with the SP3 workflow. (A) Overlap of proteins, peptides, and P-sites identified with the in-solution and the SP3 approach. (B) Boxplots showing the log2 intensities of each TMT channel. Numbers depict the median. (C) Multi-scatter plot and coefficient of determination between replicate based on the reporter intensities of the identified proteins with both sample preparation workflow. 41 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics In summary, although optimizing the iST protocol increased the number of detected peptides 5 times compared to the standard iST protocol, the SP3 approach was superior for low input quantities in a label-free setting. In addition, this approach comprises advantages like the removal of detergents, scalability, and automation capability [162]. Furthermore, the coupling to subsequent TMT-labeling revealed high reproducibility indicating the high application area of this workflow from low input amount to very deep full- and phosphoproteome coverage. 4.2 Benchmarking different beads for fullproteome analysis The following results are largely based on data obtain from Karl Kristian Krull during his internship “Influence of surface modification in SP3 technology for functional proteome analysis” and his master thesis ”Phosphoproteomics for low input amount using paramagnetic IMAC beads” conducted under the author’s supervision at the Chair of Proteomics and Bioanalytics at the Technical University of Munich. In light of the promising results obtained with the SP3 protocol, several other magnetic beads from ReSyn Biosciences (Pty) ltd were tested as alternative approaches. Compared to the carboxyl-coated SP3 beads, ReSyn Biosciences offers beads with various different surface modifications, such as amine, hydroxyl, or carboxyl groups. These magnetic microparticles are based on a patented hyper-porous matrix, which increases the surface area leading to enhanced protein-bead interactions according to the manufacturer. In order to assess the performance regarding proteomic sample preparation, amine and hydroxyl-modified ReSyn Biosciences beads were compared to the previously established SP3 protocol. Therefore, 20 µg of K652 cell lysate was processed in triplicate according to the respective protocols. In short, lysates were incubated with the beads and interfering substances were removed before the proteins were digested overnight. The resulting peptides were loaded onto a StageTip and separated into 6 fractions. Subsequently, each fraction was measured with a 1 h gradient on a Lumos Fusion mass spectrometer and the acquired data was further searched with Maxquant against a human database. In total around 78,000 peptides were identified with the SP3 and amine beads, whereas slightly less was detected with the hydroxyl beads (II-Figure 4A). The high overlap of 70% implies that the mechanism of protein capture is similar for all three approaches. This is in line with the observation that the identified peptides showed no significant differences in length, hydrophobicity, charge, or molecular mass between the beads types (data not shown). Comparable robustness was demonstrated as approximately 6,700 proteins and 65,000 peptide IDs were obtained in each condition (II-Figure 4B), which was additionally confirmed by the high reproducibility (II-Figure 4C). The high analogy of beads supported the finding of Batth and colleagues which have demonstrated that the protein-bead interaction is solely based on protein precipitation and is thus independent of the functional group immobilized on the microparticles [161]. Nevertheless, the peptide intensities acquired with SP3 and hydroxyl beads were slightly more robust with R2 values around 0.9 compared to the amine beads (II-Figure 4D). Interestingly, the R2 value of SP3 beads was comparable to the previous experiment where only a fraction of the input material had been used (II-Figure 2D). This again emphasizes the robustness of the SP3 bead method for sample-limited experiments. 42 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics II-Figure 4: Evaluating different magnetic beads as alternatives for SP3 beads. (A) Venn diagram showing the overlap of all peptides identified with SP3, amine, and hydroxyl beads. (B) Barplots showing the mean numbers of identified peptides (left panel) and proteins (right panel). Error bars depict the standard deviation of three replicates. (C) Same as (A) but showing the overlap between replicates. (D) Representative scatterplots showing the correlation of log2 raw intensity of peptides between two replicates. In the next step, the commercially available HILIC beads from ReSyn Biosciences were compared with SP3 beads. According to the manufacturer’s protocol, this bead type uses a HILIC-like binding mechanism where proteins are trapped between the bead surface and an aqueous layer. With an ACN concentration of 15% and a pH of 4.5 the binding conditions are clearly different to the SP3 protocol, where an ACN concentration of 70% at a neutral pH is recommended. Complementary to the previous experiment, 20 µg of cell lysates were processed in triplicate and digested according to the SP3 or HILIC protocol, respectively. The resulting peptides were further separated into 6 fractions via high pH StageTips fractionation and subsequently measured with a 1 h gradient. Alike the previous experiment (II-Figure 4B) around 65,000 peptides and 6,800 proteins were identified from each replicate with the SP3 approach (II-Figure 5A). In contrast, approximately 10-30% less protein and peptide numbers were obtained with the HILIC beads (IIFigure 5A). Additionally, the median peptide intensity was 2-4 fold decreased compared to the SP3 beads, indicating that HILIC binding was less efficient and potentially explaining the ID drop. Furthermore, only 50% of peptides were robustly identified in all three replicates, which was the 43 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics least among all tested magnetic bead types (II-Figure 5B, upper panel). The overlap of peptides identified with SP3 and HILIC beads was also low with only 55% (II-Figure 5B, lower panel), which suggests that the HILIC mode enhances the binding of different proteins. In order to elucidate on this in more detail, the hydrophobicity of peptides identified exclusively with either method were analysed. The GRAVY score distribution, which is a measure for hydrophobicity, suggested that peptides identified with the HILIC beads were slightly more hydrophilic (II-Figure 5C). This was further supported by the observation that peptides identified with the HILIC beads had the tendency to elute earlier in the gradient and thus at a lower ACN concentration compared to the SP3 approach (II-Figure 5D). This is contrary to Moggridge and colleagues [240], who reported a small bias towards hydrophobicity with HILIC beads. II-Figure 5: Comparison of SP3 beads and HILIC beads. (A) Number of identified peptides (upper panel) and proteins (lower panel). Barplot show the mean of triplicate and error bars depict the standard deviation. (B) Venn diagrams showing the overlap of identified peptides between replicates from the HILIC approach (upper panel) or between peptides identified with SP3 and HILIC beads (lower panel). (C) Density plot showing the hydrophobicity distribution of peptides exclusively identified with HILIC or SP3 beads, respectively. (D) Peptide elution profile along the 1 h MS gradient. In summary, magnetic beads functionalized with carboxyl, amine, or hydroxyl groups performed similarly well regarding peptide identification, quantification, and reproducibility. This is in line with previous findings postulating that proteins are captured through precipitation rather than a specific binding to the functionalized group [161]. Only the HILIC beads performed slightly worse, which is probably due to the different binding mode and buffer conditions. Noteworthy, ReSyn Biosciences beads are 5-10 times bigger and settled quicker in the reaction tube compared to the SP3 beads, which slightly simplified manual handling steps. Since ReSyn Biosciences beads did not show any compelling benefits, the well-established SP3 beads were further used for digesting proteins. 44 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics 4.3 Magnetic beads for phosphopeptide enrichment Besides the application for fullproteome analysis, magnetic beads can also be employed to enrich phosphorylated peptides. Due to the very low abundance of phosphopeptides, an enrichment is inevitable for a comprehensive analysis. To this regard, ReSyn Biosciences developed magnetic microparticles with Ti4+ and Zr4+ cations coordinated to phosphonate groups which were evaluated in the following experiments. Besides the performance of each individual bead type, a combination of both beads was tested in order to increase the phosphoproteome coverage. Triplicate of 100 µg desalted peptides were applied to Zr-IMAC, Ti-IMAC, or a 1:1 mixture of both beads, respectively. Samples were processed according to the manufacturer’s protocol and further desalted using StageTips before half of the input amount was injected into a Fusion Lumos mass spectrometer. With a 1 h gradient around 7,400 P-sites were detected with the Ti-IMAC and the mixed approach, whereas 7,100 were acquired with the Zr-IMAC beads (II-Figure 6A). The high overlap of IDs indicates a high complementarity between Ti-IMAC and Zr-IMAC explaining why the mixed approach did not result in additional ID gain. All three methods demonstrated high robustness with the identification of 5,700 P-sites on average (II-Figure 6B). Moreover, the phosphopeptide selectivity was highly comparable between the different bead types (II-Figure 6C), albeit it was with 77% slightly lower than alternative enrichment methods, such as Fe-IMAC columns, Ti-IMAC tips, or TiO2 beads, which commonly yield more than 90% selectivity [173, 175]. In addition, also the multiplicity of phosphopeptides and the amino acid composition was similar among the three methods (II-Figure 6D and 6E). In order to evaluate the quantitative reproducibility, the distribution of coefficient of variations (CV) within each triplicate was analyzed (II-Figure 6F). The median CV was between 12% and 14% for all approaches which is comparable to other enrichment strategies [173, 175]. II-Figure 6: Benchmarking different magnetic IMAC-beads for phospho enrichment. (A) Overlap of phosphopeptides detected with Ti-IMAC, Zr-IMAC, or a 1:1 mix of both beads. (B) Number of P-sites identified with either of the three approaches. Barplot represents the average and error bars depict the standard deviation of the triplicates. (C) Same as (B) but for the phosphopeptide selectivity. (D) Stacked barplot showing the ratios of phosphopeptide multiplicity. (E) Same as (D) but for the amino acid 45 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics composition of the identified phosphopeptides. (F) Distribution of the coefficient of variation within each triplicate. In order to optimize the enrichment protocol, different bead-to-peptide ratios were tested. For this, dublicate of 100 µg desalted peptides were processed using a 4:1 bead-to-peptide ratio, as recommended by the manufacturer, and additionally with 2:1 and 8:1 ratios. Again, half of the enriched phospopeptides were measured in a 1 h gradient yielding in total around 9,300 P-sites of which almost 50% were detected with all three approaches (II-Figure 7A). Interestingly, decreasing numbers of identified P-sites indicate that the enrichment is diminished with higher protein-to-bead ratios (II-Figure 7B and 7C). This is potentially caused by the increased binding capacity resulting from the higher bead amount, which in turn enhanced binding and enrichment of non-phosphorylated peptides and therefore decreased enrichment selectivity (II-Figure 7C). However, a high selectivity is crucial as non-phosphorylated peptides are usually more abundant and have higher stoichiometry compared to their modified counterparts. As DDA favours the detection of highly abundant peptides, the identification of phosphorylated peptides would therefore be decreased [241]. This effect is even more prevalent in complex lysates where the acquisition speed is not sufficient to sequence all present peptides [241]. Furthermore, studies have suggested that negatively charged phospo groups suppress the ionization efficiency, which would also favor the identification of their unmodified cognates [242]. In addition to the enhanced identification numbers, the quantitative reproducibility was considerably increased with lower bead-to-peptide ratios (II-Figure 7D). Hence, these results suggest the application of a lower beadto-peptide ratio as recommended by the manufacturer. This is in line with a previous study which demonstrated that decreasing the amount of TiO2 beads was beneficial [243]. Although lowering the bead-to-peptide ratio enhanced phosphopeptide enrichment, low selectivity remained a challenge. In order to investigate the enrichment in more detail, samples were analyzed after each processing step. For this, dublicate of 100 µg desalted peptides were enriched with a bead-to-peptide ratio of 2:1 and washed as recommended by the manufacturer with two different washing buffers (washing buffer 1: 80% ACN, 1% TFA; washing buffer 2: 10% ACN, 0.2% TFA). Samples from the unbound fraction after IMAC-bead incubation (load), after each of the two washing steps, and the eluate were analyzed. As expected, the vast majority of P-sites was detected in the eluate fraction (II-Figure 7E, left panel). However, more than 1,000 P-sites were lost during the second washing step. To retain these phosphopeptides, the washing procedure was modified such that washing buffer 1 was used twice and washing buffer 2 was omitted. This decreased the number of P-sites identified in the second washing step, but did not result in higher IDs in the eluate (II-Figure 7E, middle panel). A third enrichment approach was based on the publication of Tsai and colleagues [244] in which they postulated that full deprotonation of phosphopeptides and thus efficient binding to the Fe-IMAC column was only achieved at a pH above 3. As this comes at the cost of unspecific peptide binding, 6% acetic acid was added to compete and hamper the unspecific binding. However, in this setting the approach did not only identify the lowest number of P-sites but further led to decreased selectivity (II-Figure 7E and 7F, right panels). These results demonstrate that the washing steps vastly influence the enrichment and further optimization is required to reach the full potential of this workflow. 46 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics In summary, Ti-IMAC beads performed slightly better than Zr-IMAC beads and mixing both beads did not increase the phosphoproteome coverage. Samples prepared in parallel showed high reproducibility, however some variation was observed between samples prepared on separate days. While the phosphopeptide selectivity was around 70% (II-Figure 7C) in one experiment, 90% was reached in a conformable experiment (II-Figure 7F) utilizing the same lysates and protocol. Bead stability is likely a reason for the day-to-day variation. In general further optimizations are required in order to reach the robustness and IDs of the well established Fe-IMAC column [173]. II-Figure 7: Optimizing conditions for phospho enrichment with magnetic IMAC beads. (A) Overlap of phosphopeptides identified with bead-to-peptide ratios of 2:1, 4:1, and 8:1. (B) Barplot showing the number of detected P-sites with either approach. Each bar displays one of the two replicates (R1 and R2). (C) Same as (B) but for the phosphopeptide selectivity. (D) Scatterplots of the log2 transformed phosphopeptide abundance of the two replicates R1 and R2. (E) Barplots showing the number of P-sites obtained after the phosphopeptide binding (load), the washing steps (wash 1 and wash 2), and the elution (eluate). Each bar represents a technical replicate. (F) Same as (E) but for the phosphopeptide selectivity. (G) Bar charts displaying the number of peptides obtained with IMAC or SP3 beads, respectively. Bars represent the average of triplicate and error bars depict the standard deviation. Although the magnetic IMAC beads revealed some challenges, this enrichment strategy was further tested for low input material because the previous experiment with magnetic SP3 beads performed particularly well for minute sample quantities (II-Figure 2). Extensive sample handling is one major reason for sample loss and thus unfavorable for sample-limited experiments. Hence, 47 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics the aim was to develop a workflow that can be performed in a single reaction tube with a minimal number of manual steps. This led to the development of the “OnePot” workflow, which allows both protein digestion and phosphopeptide enrichment with magnetic beads (II-Figure 8A). During the initial step proteins from cell lysate were precipitated on IMAC beads and digested according to the SP3 protocol. Changing the buffer conditions enforced the interaction of phosphopeptides with the metal(IV) cations from the IMAC beads, while unspecifically bound peptides were removed through several washing steps. Finally, phosphopeptides were eluted and transferred onto StageTips for desalting or fractionation before they were injected into the mass spectrometer. The initial protein capture and digestion was successfully performed with IMAC beads and yielded similar peptide IDs to the SP3 approach (II-Figure 7G). This result was in agreement with previous experiments, which revealed that protein precipitation is independent of the bead surface (IIFigure 4). Nonetheless, this was a crucial processing step for the OnePot method to be successful. As the phospho enrichment of desalted peptides was also demonstrated (II-Figure 6 and II-Figure 7), only evidence for the efficient combination of both steps was lacking. Therefore, 50 µg to 200 µg of protein lysate were processed to evaluate the novel OnePot protocol. For comparison, the same starting material was digested with SP3 beads, desalted, and subsequently applied to the Agilent AssayMAP® Bravo robot. This liquid handling platform enables phosphopeptide enrichment with Fe(III)-NTA cartridges and has been successfully implemented on minute sample amounts [177]. Despite high reproducibility, the numbers of identified P-sites as well as the phosphopeptide selectivity were considerably lower in the OnePot approach than after the Bravo enrichment (II-Figure 8B). Interestingly, the IDs from the OnePot approach were not correlating with the input amount, which would be expected and was observed from the Bravo samples. As the IMAC beads were not designed for an overnight digestion at a basic pH, these conditions were potentially compromising the bead stability and thus the enrichment efficiency. In order to optimize the workflow, the digestion time was decreased to only 3 hours similar to the protocol reported by Leutert and colleagues [245]. This resulted in around 50% more P-sites and a selectivity of up to 60%, while the standard deviation remained low indicating high robustness (IIFigure 8C). Still, the Bravo workflow acquired more IDs for quantities above 100 µg, whereas the OnePot approach was superior for the lower input amounts (II-Figure 8C). The number of unique phosphopeptides obtained from 25 µg was almost twice as high as in a previous study that used the Bravo system [177] and still 15% more than Leutert and colleagues who developed the R2-P2 workflow [245], a robotic platform for phospho enrichment using magnetic beads. Noteworthy, the experimental setup as well as the mass spectrometers were different in these studies which makes a one-to-one comparison difficult. While the low standard deviation from the OnePot confirmed high robustness over the complete range of sample amount, the high variation of the Bravo system (II-Figure 8C) was unexpected as this method worked reproducibly in multiple different in-house experiments and studies before [176, 177]. To conclude, even though the OnePot revealed promising results from small sample quantities, a similar day-to-day variation was observed as described before (II-Figure 6 and II-Figure 7). This challenge has to be overcome in order to exploit the full potential of this workflow and to make it widely applicable. However, a benefit of this approach is the potential automatization, which would increase sample throughput as well as robustness. In principal automatization should be 48 | P a g e Chapter II: Optimizing sample preparation workflows for proteomics easily implementable as the protein digestion [162] as well as the phosphopeptide enrichment [245, 246] have been performed with pipetting robots before. II-Figure 8: Comparing phosphopeptide enrichment of the OnePot approach with the Agilent Bravo platform. (A) Schematic workflow of the OnePot. (B) Number of identified P-sites (upper panel) and phosphopeptide selectivity (lower panel). Bar represents the average of the triplicate and error bars display the standard deviation. (C) Same as (B) but with 3 h digestion instead of overnight. 49 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation 1 Summary ............................................................................................................................ 53 2 Introduction ....................................................................................................................... 54 3 Material and methods ........................................................................................................ 55 4 Results and discussion ........................................................................................................ 60 4.1 Proteomics to study the temporal expression changes during hepatocyte differentiation .......................................................................................................................................... 60 4.2 Protein changes suggest wide-ranging metabolic switch from hepatic endoderm to immature hepatocytes ...................................................................................................... 62 4.3 Phosphorylation changes regulating cell cycle precede dynamics on proteome level ... 64 4.4 Biological replicate confirms hepatocyte-specific protein changes ............................... 69 4.5 Widespread changes in protein families accompany hepatocyte differentiation .......... 72 4.6 Temporal protein profiling reveals novel stage-specific markers .................................. 75 4.7 New insights into hepatocyte differentiation by WNT signaling .................................... 78 51 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation The following chapter is largely based on the publication “High temporal resolution proteome and phosphoproteome profiling of stem cell-derived hepatocyte development” [247] published in Cell Reports on March 29, 2022. Authors contributions for [247]: Conceptualization, J.K., B.T., and B.K.; methodology, J.K., K.S., A.B., M.B., R.Y., A.K., H.T., B.T., J.G.C., and B.K.; software, J.K., P.S., and M.W.; validation, J.K., P.S., M.W., M.B., A.K., and B.K.; formal analysis, J.K., P.S., M.W., and B.K.; investigation, J.K., K.S., A.B., R.Y., and M.B.; resources, J.K., K.S., A.B., M.W., B.T., J.G.C., and B.K.; data curation, J.K., P.S., and M.W.; writing – original draft, J.K. and B.K.; visualization, J.K., P.S., and M.W.; supervision, B.T., J.G.C., and B.K.; project administration, J.K. and B.K.; funding acquisition, B.T. and B.K. 52 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation 1 Summary Pluripotent stem cell differentiation provides a novel resource of generating hepatocytes in a standardized and expandable way. To elucidate on the complex biological processes occuring during the in vitro differentiation, deep proteome and phosphoproteome datasets with high temporal resolution of two independent iPSC lines were acquired. This led to the quantification of more than 9,000 proteins, which offered sufficient depth to separate the different intermediates along the maturation with PCA analysis. A massive metabolic switch towards higher utilization of oxidative phosphorylation and fatty acid degradation was observed between the hepatic endoderm and immature hepatocyte-like stage. Simultaneously, multiple proteins associated to DNA replication showed a congruent drop in abundance. Interestingly, this drop on protein level was preceded on cell cycle checkpoint phosphorylation. Furthermore, statistical analysis revealed the differential expression of 78 stage-specific novel protein markers, which demonstrated the high regulation of multiple WNT-related activators and inhibitors. The high temporal resolution led to a detailed roadmap through hepatocyte differentiation, which does not only show alteration of biological processes, but is furthermore able to propose potential key regulators and elucidate on the activities of multiple kinases. These datasets enable to appreciate the timely sequence of biological processes and suggest starting points for future protocol improvements in order to increase the maturation of in vitro hepatocytes. 53 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation 2 Introduction Stem cells possess the ability to regenerate and to differentiate into various different cell types, which make them an attractive model system for regenerative medicine, developmental processes, and various diseases. Depending on their differentiation potential they can be classified into totipotent, pluripotent, multipotent, and unipotent cells [1]. One widely used model system for directed cell differentiation are ESCs, but they come with restricted accessibility and ethical concerns. In 2006 Yamanaka and colleagues discovered that somatic cells can regain pluripotency by transducing the transcription factors Oct4, c-Myc, Klf4, and Sox2 [7, 8]. This discovery led to the generation of iPSCs as an alternative to ESCs. Since iPSCs can be easily generated, expanded, and have less ethical contraints, the stem cell research was highly simplified, thereby, paving the way for a much broader application in medical research. Based on this knowledge, tissue-specific cell types from each germ layer can nowadays be derived from iPSC by the addition of various different supplements [248-251]. For hepatocyte differentiation, iPSCs are initialy differentiated towards the DE lineage. This is achieved by the supplementation of activin A from the TGF-ß family and by activation of the WNT pathway [103], either with the addition of WNT3A or the small molecule CHIR99021, which inhibitis glycogen synthase kinase 3 [252]. The DE is a progenitor stage for multiple endoderm-derived organs like the pancreas, the liver or the intestines. Next, hepatocyte differentiation is promoted by the addition of hepatocyte growth factor, dexamethasone, and oncostatin M. Although, stem cell research helped to understand basic concepts of directed differentiation towards specific cell types, the in vitro models mostly result in a heterogenous mix of cells. In order to improve the thruthful mimicry of fully mature hepatocytes, several studies have tried to characterize the differentiation of stem cell-derived hepatocytes in detail. However, such studies were based on a limited number of selected proteins [123, 253, 254] or solely on mRNA expression [255]. Camp and colleagues for example have used single-cell transcriptomics to shed light on the cell lineage progression and to derive information about the heterogeneity of cell populations during hepatocyte differentiation [100]. However, as the correlation between mRNA and protein expression is not fully understood [140], such studies have limitations in fully recapitulating the differentiation process. Since proteins and not mRNAs are the key players controlling most of the biological functions, a thorough investigation of the global proteome is inevitable to understand hepatocyte differentiation in more detail. Few studies focusing on protein expression have been performed so far, but insights have either been limited to early embryogenesis [62, 63, 256, 257] or have not provided a comprehensive proteome depth and data analysis [258, 259]. Besides protein expression, studies of PTMs, such as phosphorylation or acetylation, are of special interest as they are key regulators of many enzymes and control various signaling pathways. In recent years the depth and robustness of such experiments has been increased but their potential to understand and improve in vitro differentiations has not been exploited. To elucidate to this end, deep proteome, phosphoproteome, and acetylome data was acquired from multiple developmental stages along hepatocyte differentiation of two iPSC lines from different donors. 54 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation 3 Material and methods 2D Hepatocyte differentiation Hepatocyte-like cells were generated from two different human iPSC cell lines, TkDA3-4 [260] and Ff-I01 [123]. Both cell lines were tested for mycoplasma before the experiment was conducted. The TkDA3-4 cells were differentiated like stated previously [100]. Briefly, TkDA3-4 cells were cultivated in standard feeder-free conditions in mTeSR1 (StemCell Technologies) on laminin 511E8 (iMatrix-511, Nippi)-coated dishes, dissociated using Accutase® (Sigma-Aldrich) and seeded in RPMI 1640 (Gibco™) medium with 1% B27™ (Gibco™), 50ng/mL WNT3a (R&D Systems), and 100 ng/mL activin A (Sigma-Aldrich) on laminin 511-E8 (iMatrix-511, Nippi)-coated dishes. For the first day after seeding, 10 µM ROCK Inhibitor Y-27632 (Fujifilm Wako Pure Chemical) was supplemented. Between day 6 and day 13, cells were cultivated in KnockOut™-DMEM (Gibco™) with 1% (vol/vol) DMSO (Sigma-Aldrich), 20% (vol/vol) KnockOut™ Serum Replacement, 1 mM GlutaMax™, 1% (vol/vol) non-essential amino acids, and 0.1 mM β-mercaptoethanol (all Gibco™). Lastly, cells were cultured in hepatocyte culture medium (Lonza) without EGF and supplemented with 20 ng/mL hepatocyte growth factor and 20 ng/mL oncostatinM (both R&D Systems) until day 21. The medium was exchanged daily during the differentiation process and the cell morphology was monitored by microscopy. For qPCR and proteomics analysis, samples at day 0 (iPSC), day 6 (definitive endoderm, DE), day 10 (hepatic endoderm, HE), day 13 (immature hepatocyte, IH), and day 21 (mature hepatocytes, MH) were harvested and further processed. The Ff-I01 cells were differentiated like described earlier [253] and were kindly provided from our collaborators from the research group of Dr. Keisuke Sekine (Department of Regenrative Medicine, Yokohama City University, Japan; Laboratory of Cancer Cell System, Tokyo, Japan). In short, cells were cultivated in StemFit™ Basic03 (Ajinomoto) medium with 80 ng/ml bFGF (Fujifilm Wako Pure Chemical) and dissociated for differentiation using Accutase® (Sigma-Aldrich). Dishes were coated with laminin 511-E8 (iMatrix-511, Nippi) and cells were seeded in the presence of ROCK inhibitor Y-27632 (Fujifilm Wako Pure Chemical). For the first 6 days, cells were cultivated in RPMI 1640 (GIBCO™) supplemented with 20% StemFit™ For Differentiation and 100 ng/mL activin A (both Ajinomoto). Additionally, 2 µM CHIR99021 (Cayman Chemical) was added for the first 3 days and 0.5 mM sodium butyrate (Sigma-Aldrich) was supplemented from day 1 to day 4, which resulted in almost 100% CXCR4 positive cells [253]. From day 6 to day 13, cells were cultured in StemFit™ Basic03 (Ajinomoto) medium supplemented with 1% DMSO (Sigma-Aldrich), 0.1 mM β-mercaptoethanol, 0.5% L-glutamine, and 1% non-essential amino acids (all Gibco™). The medium was exchanged daily during the first 13 days of differentiation. For the final 8 days, cells were cultured in DMEM medium (GIBCO™) supplemented with 5% StemFIT™ For Differentiation (Ajinomoto) and 0.1 µM dexamethasone (Sigma-Aldrich). The medium was exchanged every second day. At day 0 (iPSC), day 6 (DE), day 10 (HE), day 13 (IH), and day 21 (MH) samples were taken for further qPCR and proteomics analysis. RNA extraction and qPCR analysis RNA from the harvested cells was isolated using the RNeasy Kit (QIAGEN) by following the manufacturer´s protocol. The concentration was determined via Nanodrop2000 and around 150 ng of RNA were further used for cDNA synthesis using the iScript™ cDNA Synthesis Kit (Bio- 55 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation Rad). For each reaction, 500 ng of cDNA were combined with the SensiMix SYBR Kit (Bioline) and data was acquired with the QuantStudio 3 Real-Time PCR System (Thermo Fisher Scientific). Raw data was normalized to the housekeeping gene GAPDH (TkDA3-4) or to the S18 RNA (Ff-I01) before the averaged 2(-Δ CT) was calculated. The RNA extraction and qPCR measurement for the Ff-I01cells was performed by our collaborators from the research group of Dr. Keisuke Sekine (Department of Regenrative Medicine, Yokohama City University, Japan; Laboratory of Cancer Cell System, Tokyo, Japan). For the heatmap visualizations, the calculated values were z-scored across all five time points. A list of all used primers can be found in 0-Table 1. Protein extraction, digestion, and labeling For cell harvest, medium was removed and cells were washed with PBS before they were detached with Accutase. Then, cells were pelleted and washed twice with PBS (w/o CaCl 2/MgCl2, SigmaAldrich) and resuspended with lysis buffer containing 8 M urea, 40 mM Tris/HCl (pH 7.6), 1x EDTAfree protease inhibitor (cOmplete™, Roche), and 1x phosphatase inhibitor mix (prepared in-house according to the Phosphatase Inhibitor 1, 2, and 3 from Sigma-Aldrich). Cell lysates were frozen at -80°C, thawed on ice, and centrifuged at 20.000 xg at 4°C for 20 min before the protein concentration was determined via the Pierce™ Coomassie Bradford solution (Thermo Scientific). From each time point, 100 µg (for TkDA3-4 cells) or 70 µg (for Ff-I01 cells) protein aliquots were reduced with 10 mM DTT for 45 min at 37°C, alkylated for 30 min at RT, and subsequently diluted below 1.6 M urea using 40 mM Tris/HCl (pH 7.6). For subsequent pre-digestion, trypsin was added at a 1:100 enzyme:substrate ratio and incubated at 37°C shaking with 700 rpm. After 3 h, the same amount of trypsin was additionally added to a final 1:50 ratio, and proteins were digested overnight. The reaction was stopped by adding FA to a final concentration of 1%. Next, peptides were desalted using self-packed StageTips as described previously [164]. For this, C18 material (Octadecyl Extraction Disks, 3M Empore ™) was packed into a 200 µL pipette tip and the acidified peptides were loaded. After washing twice with 0.1% FA, peptides were eluted with 50% ACN in 0.1% FA, and dried down. TMT labeling was performed as described previously [234]. In short, dried peptides were reconstituted in 20 µl 50 mM HEPES (pH 8.5) and 5 µl of 11.6 mM TMT reagent was added and incubated for 1 h at RT shaking at 400 rpm. The reaction was stopped by adding 2 µl of 5% hydroxylamine (Sigma-Aldrich) and all channels were pooled. The reaction vessels were rinsed with 10% FA in 10% ACN and combined to the pooled samples. The labeled peptides were dried down and further desalted using 50 mg SepPak columns (Water Corp.). After loading, peptides were washed with 0.07% TFA and eluted with 0.07% TFA in 50% ACN. Phosphopeptide enrichment, immunoprecipitation, and off-line fractionation Phosphopeptides were enriched using a Fe-IMAC column (Thermo Fisher Scientific) connected to an Aekta HPLC system (GE Healthcare Life Sciences) as described previously [173]. Samples were loaded in IMAC loading buffer (0.07% TFA in 30% ACN) onto the column, while the unbound full proteome flow-through was collected and dried for further analysis of the non-phosphorylated fullproteome. The bound phosphopeptides were eluted from the column with an increasing gradient of elution buffer (0.315% NH4OH) and dried. Phosphopeptides were separated into 6 fractions using a micro-column format (StageTips with 5 discs of C18 material, 3M Empore™) and 56 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation an increasing ACN concentration (5%, 7.5%, 10%, 12.5%, 15%, 17.5%, and 50%) [236]. To obtain 6 fractions, the 5% fraction was combined with the 50% fraction and the 17.5% fraction with the flow-through. The desalted and fractionated phosphopeptides were dried and stored at -20°C until they were measured on the mass spectrometer. The non-phosphorylated IMAC flow-through from the TkDA3-4 cells was further enriched for acetylated peptides via immunoprecipitation according to the manufacturer´s protocol with some modifications. PTMScan® Acetyl-Lysine Motif beads were aliquoted (1/8 of antibody kit per enrichment) and washed with PTMScan® IAP (immunoaffinity purification, both Cell Signalling Technology) buffer and ice-cold PBS. The dried peptides were reconstituted in 1 ml IAP buffer, mixed with the antibody beads, and incubated at 4°C for 1 h on an end-over-end rotator. Beads were pelleted at 2,000 xg and the unbound full proteome was retained for downstream analysis. The beads were further washed with IAP buffer and PBS. The acetylated peptides were eluted with 0.15% TFA and desalted using StageTips (3 disks of C18 material, 3M Empore™). Desalted peptides were dried and stored at -20°C until they were measured on the mass spectrometer. The full proteome fractions were further fractionated via Trinity (TkDA3-4 cells) or high-pH reversed-phase fractionation (Ff-I01 cells) as described previously [170]. For Trinity fractionation, samples were reconstituted in 10 mM NH4OAc (in water, pH 4.7) and loaded onto an Acclaim AmG C18 column (2.1x150 mm, Thermo Scientific) connected to a Dionex Ultimate 3000 HPLC system (Thermo Fisher). Peptides were eluted with an increasing gradient of elution buffer (10 mM NH4OAc in ACN) and 32 fractions were collected. For high-pH reversed-phase fractionation, samples were reconstituted in 25 mM NH4HCO3 (pH 8) and loaded onto a C18 column (XBridge BEH130, 3.5 µm, 2.1x150mm, Waters Corp.) coupled to a Dionex Ultimate 3000 HPLC system (Thermo Fisher). Peptides were eluted with an increasing ACN concentration and 96 fractions were collected and further pooled to 48. Fractions were dried and stored at -20°C until measurement. Data-dependent LC-MS acquisition A nanoflow LC-MS setup was used for data acquisition by coupling a Dionex Ultimate 3000 UHPLC+ system to a Fusion Lumos Tribrid mass spectrometer (Thermo Fisher Scientific). Peptides were loaded onto an in-house packed trap column (75 µm x 2 cm, 5 µm C18 resin; Reprosil PUR AQ, Dr. Maisch) and washed for 10 min with 0.1% FA and 5% DMSO. Subsequently, peptides were separated using an analytical column (75 µm x 40 cm, packed in-house with 3 µm C18 resin; Reprosil PUR AQ) with a flow rate of 300 nl/min and an increasing ACN gradient. Measurements were performed in DDA and positive ionization mode. For the TMT6-plex labeled full proteome, a linear 50 min gradient from 8% to 34% buffer B (0.1% FA, 5% DMSO in ACN) in LC buffer A (0.1% FA, 5% DMSO in MS-grade water) was used. Full scan MS1 spectra were recorded at 60,000 resolution and a scan range from 360-1300 m/z in the orbitrap. The AGC target was set to 4e5 charges and a maxIT of 20 ms was used. Isolated precursor from the MS1 scan were fragmented via CID and acquired with 15,000 resolution in the orbitrap. The AGC target was set to 5e4 charges and the maxIT to 22 ms. The MS3 spectra were obtained via SPS-MS3, which simultaneously selects 10 MS2 fragments that are further fragmented via HCD and read out in the orbitrap with 15,000 resolution. For this, the AGC target was set to 1e5 charges and the maxIT to 50 ms. The following minor changes were made for the TMT11-plex labeled FP 57 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation samples. MS2 spectra were acquired in the ion trap (rapid mode) with an AGC target of 2e4 charges and a maxIT of 60 ms. SPS-MS3 spectra were recorded with 50,000 resolution, an AGC target of 1.2e5 charges, and a maxIT of 120 ms. For phoshopeptide analysis, a linear 80 min gradient from 4% to 32% LC buffer B was used. MS1 spectra were recorded with 60,000 resolution in the orbitrap, an AGC target of 4e5 charges, and maxIT of 20 ms (TMT6 samples) or 50 ms (TMT11 samples), respectively. MS1 precursors were isolated and fragmented via CID for subsequent MS2 spectra acquisition in the orbitrap. For this, 15,000 resolution and AGC target of 5e4 charges and a maxIT of 22 ms was used. The following MS3 scan was recorded at 15,000 resolution in the orbitrap for the TMT6-labeled samples and with 50,000 resolution for the TMT11-labeled samples. Acetylated peptides were eluted in a linear 100 min gradient from 6% to 34% LC buffer B. The MS1 full scan was recorded in the orbitrap with 60,000 resolution within a scan range of 360-1,300 m/z, an AGC target of 4e5 charges, and a maxIT of 20 ms. MS1 precursors were fragmented via CID and subsequent MS2 spectra were acquired in the orbitrap with 15,000 resolution, an AGC target of 1e5 and a maxIT of 200 ms. For the SPS-MS3 spectra, 10 precursors were selected simultaneously and recorded in the orbitrap with 15,000 resolution, an AGC target of 1.2e5 charges, and a maxIT of 300 ms. Database searching Raw files were searched with the Maxquant software with its search Engine Andromeda [210, 237] (version [1.6.2.3]) against the UniProtKB human reference list (downloaded 22.07.2013). Default setting were applied, unless stated otherwise. The enzyme trypsin was specified as protease allowing for up to two missed cleavages. Carbamidomethylation of cysteine was set as a fixed modification, while oxidation of methionine, and N-terminal protein acetylation were defined as variable modifications. In addition, serine, threonine, and tyrosine phosphorylation was set as variable modification for the phosphoproteome as well as lysine acetylation for the acetyl-IP. A target-decoy approach was used to adjust the data to 1% PSM and protein FDR. MS3-based TMT reporter ion quantification was enabled and the corresponding TMT correction factors were added. Fullproteome data processing For yielding protein quantification, the Maxquant output proteingroups.txt was used. Reversed hits and proteins that were only identified by site were removed. Then, the reporter ion intensities were log2 transformed and normalized to the averaged median of all 6 or 11 TMT channels. Batch effects between replicates were removed with ComBat [261] from the ´sva´package (version [3.30.1]) in RStudio (version [4.0.2]). Differentially expressed proteins were determined by analysis of variance (ANOVA) test with multiple testing correction according to the BenjaminiHochberg (BH) procedure. A protein with a fold change >2 and an FDR <0.05 at one or more time points was definied as significant. Significantly changing proteins were row-wise z-scored and hierarchically clustered with the Perseus software [238]. Kyoto Encyclopedia of Gene and Genomes (KEGG) enrichment analysis was performed with the ´clusterProfiler´ [262] package (version [3.16.0]) using an FDR (BH corrected) threshold of 0.05. The GeneRatio is defined as the ratio of k/n, where k is the size of the overlap of the input proteinset with the specific KEGG 58 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation pathway and n is the size of the overlap of the input proteinset with all possible KEGG pathways. The principal component analysis (PCA) was performed with the ‘factoextra’ package (version [1.0.7]) and plotted with the ‘ggplot2’ package (version [3.3.2]). Transcription factor-target relationships were derived from TRRUST [263] and analysed for differentially expressed protein. Phosphoproteomics and acetyl-IP data processing Phosphoproteome quantification was deduced from the Maxquant output phosphosite.txt. Entries with a localization probability <0.75 were removed as well as reverse hits. Reporter ion intensities were log2 transformed and normalized with the correction factors of the corresponding full proteome data set. Batch effect correction was again removed with the ComBat package. Congruent to the full proteome, differentially expressed P-sites were defined by an FDR <0.05 (ANOVA with BH correction) and a fold change >2. Kinase-substrate relationships were predicted using the networkin web-tool [264]. The acetyl-IP data was processed like the phosphoproteome except that the quantitative analysis was performed on the acetyl(K)sites.txt. scRNA-seq data processing Transcriptomics data was derived from a previously published work [100] that studied hepatocyte differentiation with the same TkDA3-4 cell line. The transcriptomes of 425 single cells were acquired at the iPSC (n=80), DE (n=70), HE (n=113), IH (n=81), and MH (n=81) stages along this process. The normalized log2(FPKM) expression of all single cells at one time point were averaged to make it comparable to the bulk proteomics dataset. 59 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation 4 Results and discussion In order to elucidate biological processes during in vitro hepatocyte differentiation, temporally resolved protein expression, phosphorylation, and acetylation dynamics were acquired using quantitative MS. 4.1 Proteomics to study the temporal expression changes during hepatocyte differentiation Hepatocyte differentiation was studied based on a previously published protocol [100] using human iPSCs (TkDA3-4 cells) in a 2D approach. First, stem cells were supplemented with activin A and WNT3a to form DE (III-Figure 1A), which is the common origin of multiple organs such as the liver, pancreas, and the intestines. After 6 days, the differentiation medium was modified and cells were further specified towards the hepatic endoderm (HE) for two more days. After additional 5 or 13 days, immature (IH) and mature hepatocytes (MH), respectively, have formed. To appreciate the differentiation process, samples from each developing stage were taken and used for subsequent proteomic sample preparation (III-Figure 1B). Although the workflow optimized previously (chapter II) would have been the preferred choice, it was not used for this experiment because the evaluation had not been completed when the experiment was started. After cell lysis, equal amounts of protein were employed for in-solution digested and the resulting peptides were labeled with a unique isobaric TMT reagent. Next, the samples were pooled and used for studying PTMs to expand the understanding and to add more information to the protein expression data. For this, phosphorylated peptides were enriched using a Fe-IMAC column and acetylated peptides via immunoprecipitation. To increase acquisition depth, the remaining fullproteome was fractionated before peptide spectra were recorded on a Lumos Fusion mass spectrometer. With this experimental setup around 9,000 proteins, 12,000 P-sites, and 800 acetylation sites (Ac-sites) were confidently quantified with a high overlap between replicates, which facilitates a detailed investigation (III-Figure 2A). Overlapping proteins which were identified in both replicates were further used for PCA (III-Figure 2B). Close clustering of replicate indicated high reproducibility of the differentiation as well as sample preparation. Moreover, a clear time-dependent separation along PC1 and PC2 was observed confirming sufficient depth of the dataset to study the underlying molecular mechanisms of the differentiation process. 60 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 1: Workflow for the proteomics experiment. (A) Microscopic characterization of the hepatocyte differentiation process from human iPSC. DE, definitive endoderm; HE, hepatic endoderm; IH, immature hepatocyte-like; MH, mature hepatocyte-like. d0-d21 denote the time in days and the information below the axis depict cell culture media and supplements. RPMI: RPMI medium; B27: B27™ supplement; ActA: activin A; KO-DMEM: KnockOut™ DMEM medium; Glu: GlutaMax™; KSR: KnockOut™ Serum Replacement; NEAA: non-essential amino acids; DMSO: dimethylsulfoxid; HCM: HCM™ Hepatocyte Culture Medium; Dex: dexamethasone; OSM: oncostatin M; HGF: hepatocyte growth factor; FBS: fetal bovine serum. (B) Proteomics workflow from protein extraction in urea buffer to digestion and labeling with TMT. The pooled TMT channels were enriched for phosphorylated and acetylated peptides, before the fullproteome was deep-fractionated and spectra were acquired via LC-MS3. To evaluate the differentiation efficiency, a set of typical hepatocyte markers was quantified via qPCR and their expression was referenced by the averaged single-cell transcriptomes of a previous study [100] sharing the same experimental setup (III-Figure 2C). The temporal expression of all 61 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation selected proteins revealed high congruence confirming the successful differentiation and high quality of the proteomics data. III-Figure 2: Quality control and differentiation check. (A) Venn diagrams showing the number of identified proteins, P-sites, and Ac-sites of both independent replicates. (B) PCA of the proteomics experiment. (C) Zscored normalized heatmaps of the quantities of known stage-specific protein markers. Left panel: proteomics data; middle panel: previously published single-cell RNA-seq data [100]; right panel: qPCR data. 4.2 Protein changes suggest wide-ranging metabolic switch from hepatic endoderm to immature hepatocytes After the proteomics data quality was confirmed, the global protein dynamics were investigated in more detail. For this, the dataset was filtered for differentially expressed proteins (ANOVA: FDR <0.05, fold-change <2) in order to detect the most decisive changes. More than half of the quantified proteins, in total 4,956, showed significant expression changes at any of the investigated time points, which suggests broad protein rearrangements during the differentiation. To classify the apparent protein dynamics, hierarchical clustering was performed (III-Figure 3A, left panel), which allowed the discrimination of 10 distinct expression patterns (III-Figure 3A, right panel). With 1,859 or 2,114 members, most significantly regulated proteins were observed in cluster 3 or 10, reflecting an increase or decrease of protein expression between HE and IH, respectively. A KEGG analysis further suggested that mostly proteins related to metabolism were enriched in cluster 3, whereas cell cycle-related proteins appeared overrepresented in cluster 10 (III-Figure 3B). 62 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 3: Global proteomics analysis. (A) Z-scored heatmaps of hierarchical clustering from all 4,965 differentially expressed proteins (ANOVA, BH corrected: FDR <0.05 and fold change >2). Right panel shows the dynamics of all 10 distinct expression profiles (n depicts the number of proteins in this cluster). (B) KEGG enrichment of clusters defined in (A). As multiple metabolic pathways were upregulated in cluster 3, a more detailed analysis was conducted. All 13 detected TCA cycle proteins had a synchrounous expression pattern, which showed a concerted upregulation between HE and IH (III-Figure 4A). Besides providing multiple important intermediates for biomolecules, the TCA cycle generates NADH, which is further submitted into the oxidative phosphorylation pathway and used for energy production. In this regard, a particularly interesting protein is the succinate dehydrogenase (SDHA/B), as it participates in the TCA cycle as well as the electron transport chain. The simultaneous upregulation of the succinate dehydrogenase together with multiple other proteins connected to the electron transport chain (III-Figure 4B) indicates a concerted metabolic switch towards the increased utilization of energy obtained from the oxidative phosphorylation. In addition, the three pyruvate dehydrogenase subunits responsible for converting pyruvate into acetyl-CoA, which is further fed into the TCA cycle, were upregulated from HE to IH (III-Figure 4C). Moreover, proteins associated with the mitochondrial ribosome, which mainly translates proteins from the electron transport chain, demonstrated elevated levels, whereas proteins related to the cytoplasmic ribosome decreased (III-Figure 4D). The peroxisome pathway, which possesses a key role in fatty acid metabolism, was another highly enriched pathway in cluster 3. 48 proteins associated with the peroxisome were detected in this cluster with very similar expression patterns (III-Figure 4E, right panel). Interestingly, mRNA levels of the transcription factor (TF) peroxisome proliferatoractivated receptor alpha (PPARA) and the number of significantly regulated PPARA downstream targets followed the same trend (on protein expression level) pointing to an important role of this 63 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation TF in the metabolic switch (III-Figure 4E, left panel). To conclude, the numerous examples provide evidence for a metabolic switch between HE and IH. This switch includes the increased oxidation of fatty acids and higher utilization of oxidative phosphorylation. As the liver fulfills multiple metabolic functions, this metabolic switch also reflects the successful differentiation of hepatocyte-like cells. The highly synchronous switch combined with very pronounced expression changes within only 5 days was hereby striking, further suggesting the HE/IH transition as the key stage of in vitro hepatocyte differentiation. III-Figure 4: Metabolic switch between HE and IH. (A) Z-scored expression of proteins related to the tricarboxylic acid (TCA) cycle. (B) Temporal protein expression of the electron transport chain complexes IIV. Data are normalized to iPSC and represent the median log2-fold-change (n depicts the number of proteins in each complex). (C) Temporal protein expression of the pyruvate dehydrogenase complex. Data are normalized to iPSC, represent the average log2-fold-change of both replicates, and error bars denote the range. (D) Heatmap showing the z-scored expression of proteins associated with the mitochondrial ribosome (left panel) and the cytoplasmic ribosome (right panel). (E) Left panel in black: temporal mRNA expression of PPARA. Data is normalized to iPSC and depicts the average of all cells quantified with scRNAseq [100]. Left panel in gray: shows the number of significantly (ANOVA, BH corrected: FDR <0.05 and foldchange >2) regulated PPARA targets. Right panel: heatmap of z-scored proteins associated with the peroxisome. 4.3 Phosphorylation changes regulating cell cycle precede dynamics on proteome level Furthermore, the KEGG enrichment revealed significant changes of multiple cell cycle-related proteins in cluster 10 (III-Figure 3B). A very consistent and concerted downregulation of the 37 identified proteins related to the DNA replication was observed between HE and IH (III-Figure 5A, 64 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation right panel). DNA replication changes were temporally coordinated with the metabolic switch, albeit with opposing direction. Single-cell transcriptomics data from the aforementioned study [100], using the same experimental setup, additionally confirmed decreasing levels of DNA replication during the differentiation (III-Figure 5A, left panel). However, the timing was different as mRNA levels were already decreased at the DE stage. This temporal difference of mRNA and protein expression was also apparent for the TCA cycle and the majority of differentially expressed transcription factors (III-Figure 5B). Noteworthy, this comparison indicated a more explicit as well as robust temporal behavior on protein level enabling an improved data interpretation. In this regard, the global correlation of mRNA and protein levels was analysed in more detail revealing a rather weak correlation at each of the 5 different developmental stages (III-Figure 5C). However, the experimental setup was not designed a priori for direct comparison of mRNA and protein levels because the analytes were extracted from different experiments, introducing additional variance due to the different differentiation batches. Nevertheless, the poor correlation is in accordance with previous studies reporting on the discrepancy of mRNA and protein level [140, 153]. Additionally, the ratios of proteins and transcripts between adjacent time points was calculated in order to see if similar conclusion can be drawn from either experiment. Interestingly, these data correlated even worse suggesting that the timely expression of mRNA and protein was quite different (III-Figure 5D). However, when looking at the ratio of the two stages that are furthest apart (iPSC and MH) the correlation was better (III-Figure 5E). This implies that the general trend can be recapitulated on protein and transcript level, however their temporal dynamics are discrepant. 65 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 5: Correlation of protein and mRNA expression levels. (A) Z-scored heatmaps of proteins related to DNA replication from the the previously published [100] transcriptomics (left panel) and proteomics (right panel) data. (B) Same as (A) but for TCA cycle proteins (upper panel) as well as transcription factors (lower panel). (C) Global correlation of mRNA and protein levels at each time point. (D) Scatterplot showing the log2-fold change between two adjacent time points on protein and mRNA level. The blue lines display the trendline, while the red lines depict the angle bisector. (E) Same as (D) but for iPSC and MH. Since the cell cycle is mostly controlled via kinases and phosphatases at the two cell cycle checkpoints G1/S and G2/M, these were analyzed in more detail to elucidate on the substantial changes of the DNA replication. The retinoblastoma-associated protein 1 (RB1) is one key regulator for the G1/S checkpoint. Phosphorylation of RB1 inhibits the interaction with members of the E2F transcription factor family, which usually translocate into the nucleus to control protein expression, ultimately leading to G1/S progression (III-Figure 6A, left panel). On the contrary, 66 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation unphosphorylated RB1 binds E2Fs, resulting in impaired transclocation of the TF family and thereby impaired cell cycle progression. As the mRNA and protein levels of the DNA replication machinery revealed discrepant results, a phosphoproteomic experiment was conducted to elucidate the cell cycle alteration. The phosphoproteome analysis revealed a decrease of more than 80% of the two RB1 P-sites responsible for E2F binding, serine 807 and 811, between iPSC and DE. Noteworthy, the protein level of RB1 decreased similarly to the phospho data, although to a lesser extent. The effect of less RB1 abundance and phosphorylation levels was further confirmed by reduced levels of E2F1 downstream target proteins, which indicated a lower E2F1 activity due to scavenging of this TF by unphosphorylated RB1 (III-Figure 6B). On the other hand, the G2/M checkpoint is mostly controlled by the cyclin-dependent kinase 1 (CDK1), whose activity is directly regulated by phosphorylation. High CDK1 activity enhances cell cycle progression, whereas low activity leads to its impairment (III-Figure 6A, right panel). While the CDK1 protein levels were constant between iPSC and DE stage, the phosphorylation at threonine 161, which induces kinase activity, dropped significantly (III-Figure 6C). In addition, cyclin B1 (CCNB1) levels were reduced, which is essential for CDK1 activity as it forms an active complex with the kinase. In agreement with lower kinase activity, phosphorylation of the downstream CDK1 substrates were diminished during the course of differentiation (III-Figure 6D). In detail, all significantly changing P-sites (ANOVA, BH corrected: FDR <0.05 and fold-change relative to iPSC >2) were annotated with the Networkin tool [265], which uses an algorithm that combines the information from known substrates and motif-based predictions to retrieve kinasesubstrate relationships. To conclude, the detailed study of P-sites elucidated temporal dynamics of biological changes. For example, the combined analysis of protein expression and phosphorylation revealed that the changes at cell cycle checkpoints regulated by phosphorylation preceded the changes in protein expression involved in DNA replication. In general, alteration of the cell cycle are not unexpected during differentiation, as high proliferation has been previously reported as a feature of immature undifferentiated cells [266-268]. However, deciphering the discrete timing and mechanism of these processes can help to recapitulate the differentiation better and might be useful for the study of altered cell proliferation. 67 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 6: Cell cycle-related protein and phosphorylation changes. (A) Left panel: Model showing the regulation of the G1/S checkpoint. Hyperphosphorylation of RB1 impairs interaction with E2F, which leads to cell cycle progression. In contrast, unphosphorylated RB1 promotes interaction with E2F, which inhibits further G1/S progression. Right panel: Model showing CDK1-dependent G2/M checkpoint regulation. Phosphorylation at threonine 161 and binding of cyclin B (CCNB) activates CDK1, which enhances G2/M progression, while inactive CDK1 leads to the opposite effect. The y-axes show protein or phosphorylation, respectively, levels of DE relative to iPSC. Bars represent the average of two replicates and error bars denote the range. Asteriks show the significance (ANOVA, BH corrected: *FDR <0.05, ** FDR <0.01). (B) Number of significantly downregulated (ANOVA, BH corrected: FDR <0.05, fold-change <-2) E2F1 downstream targets relative to iPSC. (C) Temporal protein expression of CDK1 and ERK1/2 with their corresponding activity inducing P-sites. Data are normalized to iPSC, represent the average log2-fold-change of two replicates and error bars depict range. (D) Number of significantly (ANOVA, BH corrected: FDR <0.05, fold-change >2) up/downregulated substrates of CDK1 relative to iPSC. (E) Same as (D) but with ERK1/2 substrates. Another interesting finding was the diametrical opposing trend of ERK protein expression and phosphorylation. While the protein levels of ERK1 and ERK2 increased at IH, the phosphorylation at tyrosine 204 and 187 decreased substantially (III-Figure 6C). These two P-sites are positively 68 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation correlated with kinase activity, and indeed a reduction of downstream substrate phosphorylation could be confirmed by the networkin analysis (III-Figure 6E). In summary, the two depicted examples illustrated differences between mRNA, protein, and phosphorylation levels. mRNA and protein levels can be divergent due to complex regulation of transcription, mRNA stability, translation, protein stability, and protein turn-over [269]. The addition of PTMs adds another layer of depth to comprehend the actual functionality of concrete proteins. The observed discrepancies of transcriptome, proteome, and phosphoproteome highlight the value of the herein generated datasets and the depth of information that can be retrieved. Hence, the global proteomic and phosphoproteomic datasets are important additions to the previously published single-cell transcriptomics experiment and allow detailed analysis of multiple biological processes during hepatocyte differentiation. 4.4 Biological replicate confirms hepatocyte-specific protein changes To evaluate the concepts derived from TkDA3-4 cells, hepatocytes were differentiated using a second iPSC line (Ff-I01 cells), reprogrammed from a different donor and differentiated with a previously published protocol [253]. Hepatocyte differentiation with the Ff-I01 cells was performed with the collaborating research group of Keisuke Sekine. Compared to the previous experiment, the number of time points was increased to 12 to improve the temporal resolution (III-Figure 7A). Samples from day 0 (iPSC), day 6 (DE), day 10 (HE), day 13 (IH), and day 21 (MH) were applied for direct comparison to the previous experiment with TkDA3-4 cells. With the same proteomic sample preparation as depicted earlier around 9,000 proteins and 12,000 P-site were identified (III-Figure 7B), which is comparable to the previous experiment (III-Figure 2A). The success of the differentiation was again confirmed by the expression of well-established hepatocyte markers via proteomics and qPCR (III-Figure 7C). In addition, the PCA analysis indicated high reproducibility and a good time-dependent separation along PC1 and PC2 (III-Figure 7D). 69 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 7: Experimental setup and quality control of Ff-I01 iPSC cell line. (A) Number of samples and time points (d=days) harvested for proteomic experiment. (B) Venn diagrams of identified proteins and P-sites in all three replicates. (C) Heatmap of z-score normalized markers quantified via proteomics (left) or qPCR (right). (D) PCA of the three replicates at iPSC (d0), DE (d6), HE (d10), IH (d13), and MH (d21). Significantly changing proteins (n=5,105; ANOVA, BH corrected: FDR <0.05, fold-change >2) at one or more time points were selected and used for hierarchical clustering revealing 10 distinct expression profiles (III-Figure 8A, right panel). The majority of proteins was present in cluster 2 and 8 with a linear increasing or decreasing trend over time. To study the proteins in more detail, a KEGG analysis was performed for each cluster (III-Figure 8A, left panel). In cluster 8 multiple proteins related to metabolism, e.g. TCA cycle (III-Figure 8B), were enriched with a very congruent expression pattern. This upregulation is in line with the metabolic switch revealed from the previous experiment. At the same time, Cluster 2 revealed an overrepresentation of cell cyclerelated processes. While proteins related to DNA replication decreased rather constantly over time (III-Figure 8C), the cell cycle checkpoints revealed substantial alterations on protein and phosphorylation levels already at the DE stage. For instance, phosphorylation of RB1 serine residues essential for binding to the E2F transcription factor family dropped to 10% relative to the iPSC stage (III-Figure 8D, upper panel). Moreover, the protein levels of cyclin B (CCNB1) were reduced, which are both indications of a diminished cell cycle progression. Noteworthy, although the activating P-sites of CDK1 and ERK1/2 were not detected, the downregulation of downstream substrates indicated decreasing activity (III-Figure 8E). In general, the proteome and phosphoproteome analysis confirmed the previous results obtained from TkDA3-4 cells, establishing the observed changes as common characteristics of in vitro hepatocyte differentiation. However, the massive proteomic remodeling between HE and IH was less pronounced in the Ff-I01 compared to the TkDA3-4 differentiation. One reason for this could 70 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation be the slightly modified protocol, as the HE samples were only taken at day 10 (Ff-I01 cells) and not at day 8 (TkDA3-4 cells). This extended HE differentiation in the first protocol might have further fostered the expression changes. Furthermore, since two independent iPSC lines with different baseline protein expression were used, some differences might be also due to cell line specificities. III-Figure 8: Proteome and phosphoproteome analysis of Ff-I01 cells during hepatocyte differentiation. (A) Left panel: KEGG enrichment analysis of clustered proteins. Top three pathways are shown. Right panel: Expression profiles after hierarchical clustering (n=number of proteins in this cluster). (B) Dynamic expression (z-scored) of TCA cycle-related proteins. (C) Z-score normalized heatmap of proteins associated with DNA replication. (D) Upper panel: Model showing the regulation of the G1/S checkpoint. Hyperphosphorylated RB1 inhibits E2F interaction, which leads to cell cycle progression. In contrast, unphosphorylated RB1 interacts with E2F, which inhibits G1/S progression. Lower panel: Model showing CDK1-dependent G2/M checkpoint regulation. Phosphorylation at threonine 161 and binding of cyclin B (CCNB) activates CDK1, which enhances G2/M progression, while inactive CDK1 leads to the opposite effect. 71 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation The y-axes show protein or phosphorylation, levels of DE relative to iPSC. Bars represent the average of two replicates and error bars denote the range. (E) Number of significantly regulated (ANOVA, BH corrected: FDR <0.05; fold change >2) substrates of CDK1 and ERK1/2 relative to iPSC. Substrate predictions are based on Networkin. In order to address if the metabolic switch fostered the differentiation into hepatocytes or vice versa, the dynamics of several metabolic pathways as well as some selected IH markers were studied. To increase the temporal resolution of the metabolic switch, the TkDA3-4 experiment (IIIFigure 9, left panel) was repeated and samples were taken every 12 h between the HE and IH stage. Of note, the increase in maturation marker preceded the metabolic switch (III-Figure 9, middle panel). The timely separation was further supported by the differentiation of Ff-I01 cells (III-Figure 9, right panel) and has not been described before. Indeed, the timing of this switch is of interest considering that energy production of PSCs relies on aerobic glycolysis [270]. This phenomenon to maintain an anaerobic metabolism even in the presence of sufficient oxygen levels has previously been described in cancer cells and is called the Warburg-effect [271]. Despite elaborated research, the exact function of this effect remains unclear. With a decrease in pluripotency a metabolic switch from the less-energy efficient anaerobic glycolysis would be directly expected. However, the switch to aerobic glycolysis was only observed after 8-13 days. III-Figure 9: Metabolism vs. maturity. Temporal expression of selected markers for immature hepatocytes (blue) and metabolism pathways (orange) from TkDA3-4 cells (left panel), from TkDA3-4 cells with 12 h samples between HE and IH (middle panel), and from the Ff-I01 cells (right panel). Lines represent the median expression (z-scored) of all proteins related to the corresponding pathway. 4.5 Widespread changes in protein families accompany hepatocyte differentiation To gain a more detailed knowledge of functional processes that are happening during the differentiation, the previously selected differentially expressed proteins (ANOVA, BH corrected: FDR <0.05; fold-change >2) from both cell line experiments (TkDA3-4 and Ff-I01 cells) were further divided into classes with distinct biological functions. Firstly, cell surface proteins (n=492 for TkDA3-4, n=518 for Ff-I01), derived from the cell surface protein atlas [272], and epigenetic modifiers (n=350 for TkDA3-4, n=383 for Ff-I01), retrieved from the Epifactor database [273], were hierarchically clustered (n=8 clusters) to detect proteins with similar expression profiles. From each protein class the temporal profiles of the cluster comprising the most proteins is illustrated in III-Figure 10A. Within the group of cell-surface proteins, lysosomal proteins were 72 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation overrepresented in both biological replicates (III-Figure 10B and 0-Figure 1A). Since the lysosome is responsible for degradation of multiple biopolymers, like fatty acids or carbohydrates, its upregulation might be associated with the co-occuring metabolic switch. The impact of different lysosome levels were linked to metabolic changes in quiescent and primed hematopoetic stem cells before [274] and were suspected to be involved in epidermal differentiation [275]. There are additional studies claiming that the lysosome not solely executes metabolic functions, but is rather an organelle recognized with an emerging role in cell signaling [276, 277]. However, additional experiments are required to unravel the exact functions of lysosomes for hepatocyte differentiation. Noteworthy, from the 726 detected cell surface proteins, 97 were differentially expressed at at least one time point (0-Table 2). For example, the two fibroblast growth factor receptors (FGFR) 1 and 4 were significantly upregulated at the HE stage suggesting increased utilization of FGF signaling, which was previously proposed as an important pathway for early hepatocyte development [105, 278]. This list of stage-specific cell surface proteins can serve as resource for controlling future hepatocyte differentiations and for purifying specific cell populations, e.g. via fluorescence-activated cell scanning (FACS) sorting. With 88% (TkDA3-4) or 71% (Ff-I01), the vast majority of proteins annotated as epigenetic modifiers substantially decreased during the differentiation suggesting a global epigenetic change of the cells (III-Figure 10A). One example of an epigenetic modification is the acetylation of histone, thereby regulating gene expression [279]. Acetylation, however, is not limited to histones, but is a PTM on lysine residues that can be actively added or removed by enzymes and can thereby alter biological functions. One group of deacetylating enzymes are the NAD+dependent Sirtuins (SIRTs), which have previously been associated with cell differentiation and metabolic remodeling [280]. Especially SIRT1 and SIRT2 have been thoroughly investigated in this regard [281-288]. Interestingly, these two proteins showed strikingly opposite dynamics in their protein expression (III-Figure 10C, left panel), which was further confirmed with higher temporal resolution (0-Figure 1B). To elucidate more on the deacetylase activity during the differentiation, an immunoprecipitation (IP) was performed to enrich acetylated peptides. Given that SIRT2 was the only significantly increasing deacetylase from HE to IH, increasingly downregulated Ac-sites in this time span were presumed potential SIRT2 substrates. Indeed, 93 significantly decreasing Acsites (ANOVA, BH corrected: FDR <0.05; fold-change <-2) were identified between HE and IH. Among these was the previously reported SIRT2 substrate phosphoglycerate mutase PGAM-K100, which has important roles in energy production [285]. In addition, several other metabolic enzymes were deacetylated, including ALDOA-K13, ENO1-K262, and PKM-K115 (III-Figure 10C, right panel), thereby indicating novel Ac-sites to the SIRT2 substrates previously identified by Cha et al. [281]. To conclude, SIRT1 and 2 were identified as potential key regulators for the observed metabolic switch. Moreover, novel putative SIRT2 substrates together with their dynamics during hepatocyte differentiation were identified. In order to verify the biological function of these substrates and to attribute them to SIRT2, additional experiments are required. For example, a knockout or a specific inhibition of the activity could elucidate more on the influence of SIRT2 as a key regulator. 73 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 10: Expression profiles of cell surface proteins and epigenetic modifiers. (A) Temporal expression of the majority of cell surface proteins and epigenetic modifiers. Median and standard deviations are shown as lines and range (n=number of proteins with this profile). Solid lines represent the TkDA3-4 experiment and dashed lines the Ff-I01 experiment. (B) Z-scored expression of lysosomal proteins (TkDA3-4 experiment) depicting a subset of the ´cell surface´group in (A). (C) Left panel: temporal expression of SIRT1 and SIRT2 as log2-fold-change relative to iPSC. Right panel: log2-fold-change of protein (FP, dark orange) and corresponding Ac-site (ac-IP, light orange) of known and putative SIRT2 substrates. Data in both plots represent the average of two replicates, the error bars denote the range, and asteriks show the significance (ANOVA, BH corrected: *FDR <0.05; **FDR <0.01; ***FDR<0.001). The liver synthesizes large quantities of multiple biomolecules, such as bile, proteins, and lipids, with vital functions for the body. In order to secret these molecules, hepatocytes need to express high levels of transporter proteins, which makes this an interesting class of proteins to study in more detail. Therefore, from the list of previously selected differentially expressed proteins, transporter proteins were hierarchically clustered to detect common expression patterns. With 62% (179 of 289 detected) in TkDA3-4 and 65% (219 of 337 detected) in Ff-I01 cells, respectively, the expression of most transporter proteins showed a consistent increase over time (III-Figure 11A). Among these, multiple ATP-binding cassette (ABC) transporters were detected, which are in general highly expressed in the liver. This protein class is tightly connected to metabolism, efflux, and drug resistance in the liver making primary hepatocytes a frequently used cell system for preclinical drug toxicity tests [289]. The upregulation of ABC transporters is additional evidence of a successful differentiation towards hepatocytes (III-Figure 11B, left panel and 0-Figure 1C, left panel) and a prerequisite for the suitability of these cells for toxicity tests. Interestingly, expression of the ABCC1 transporter, which positively regulates efflux of the antioxidant glutathione, decreased along differentiation. However, the decrease of ABCC1 might be a consequence of the metabolic switch to oxidative phosphorylation for energy consumption during hepatocyte differentiation. The change to an aerobic energy production causes elevated levels of reactive oxygen species (ROS) as a byproduct, which is in excess harmful to the cell and therefore needs to be removed. Indeed, multiple enzymes for eliminating ROS were additionally upregulated (IIIFigure 11B, right panel and 0-Figure 1C, right panel) and might constitute with the decrease in glutathione efflux the cell adaptations to aerobic metabolism. 74 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 11: Expression profiles of transporter proteins and transcription factors. (A) Temporal expression of the majority of transporter proteins and transcription factors. Median and standard deviation are shown as lines and range (n=number of proteins with this profile). Solid lines represent the TkDA3-4 experiment and dashed lines the Ff-I01 experiment. (B) Left panel: Heatmap of z-scored ABC transporter protein expression. Right panel: log2-fold-change of enzymes reducing ROS at IH relative to HE. Bar represents the average, error bars denote the range of two replicates, and asterisks show the significance (ANOVA, BH corrected: *FDR <0.05, **FDR <0.01, ***FDR <0.001). (C) Temporal expression of the NuRD complex. Data points depict the median of the relative log2-fold-change (normalized to iPSC) of all complex members shown in the lower left corner. Error bars represent the standard deviation. Transcription factors (TFs) are proteins that bind to specific gene regions regulating their transcription rates and thereby control cellular function and identity. TFs are of special interest for developing organs, as they orchestrate cell fade and lineage specification. To elucidate their influence on hepatocyte differentiation, the abundances of TFs at the development stages were investigated. For this, significantly expressed TFs (ANOVA, BH corrected: FDR <0.05; fold-change >2) were hierarchically clustered, as the protein classes before. The analysis revealed for the majority of TFs (76% for TkDA3-4 and 51% for Ff-I01) constantly high abundance levels in the beginning of the differentiation (III-Figure 11A). Co-expression of a broad spectrum of TFs might enable the rapid and dynamic response to different signaling cues to differentiate into different cell types. With increasing cell specification, a substantial drop of TF levels was observed (III-Figure 11A), which suggested that the cells removed most nonessential TFs during maturation, whereas some important TFs for hepatocyte differentiation were elevated. One example for this is the NuRD (nucleosome remodeling deacetylase) complex (III-Figure 11C), which is expected to finetune the expression of a variety of genes important for lineage commitment [290, 291] and maintains the heterogeneity in ESCs [292]. In addition, similar expression dynamics were observed by the multiprotein complex SIN3 (0-Figure 1D), which, like the NuRD complex, belongs to the HDAC1/2 complex family, which are known chromatin modifier that regulate transcription [293]. Since the majority of proteins altered their expression levels between HE and IH (III-Figure 3A and III-Figure 8A), substantial changes of TFs at this time can be expected. 4.6 Temporal protein profiling reveals novel stage-specific markers With the depth of these proteomic datasets, an additional expression analysis was conducted to elucidate marker proteins that are specific for a certain time point. To obtain stage-specific markers, the differentially expressed proteins (ANOVA, BH corrected: FDR <0.05) were filtered for 75 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation proteins that were at least 2-fold higher than at all other time points. This analysis revealed 78 proteins that were quantified with high reproducibility and congruent expression profiles in both iPSC lines (III-Figure 12A). Among the 17 novel iPSC markers was the acetyl-CoA carboxylase (ACACA), which is the ratelimiting enzyme in long fatty acids biosynthesis (III-Figure 12B). Another protein with similar biological function is the fatty acid synthase (FASN). Although FASN was not part of the marker list, the expression profiles were quite similar (III-Figure 12C). As PSCs proliferate rapidly, the high abundance of key enzymes for fatty acid production might be due to a high demand of fatty acids in processes like membrane synthesis. The DNA methyltransferase (DNMT) 3B was another interesting iPSC marker (III-Figure 12C) that emerged from this analysis. While DNMT3A and DNMT3B are required for de novo methylation, DNMT1 maintains the methylation state (0-Figure 2A). Although both DNMT3s are highly homologous and share multiple common functions [294], their temporal expression during hepatocyte differentiation diverge from each other. While DNMT3B levels dropped considerably (2 to 5-fold) between iPSC and DE, DNMT3A expression remained constant until the HE stage (III-Figure 12C). This trend was further supported by the transcriptomics analysis (0-Figure 2B) suggesting distinct functions of both proteins as previously described in hESCs [294]. Interestingly, DNMT1, but not DNMT3A/B, has been shown to be essential for cardiomyocyte and hematopoietic differentiation before [295], which implies that the maintenance of methylation levels might be more crucial for cardiac development than the de novo methylation [296]. The overall decreasing levels of all three DNMTs suggest a reduction of methylation during hepatocyte differentiation but further experiments are required in order to get a more detailed view on the methylation state. 76 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation III-Figure 12: Novel stage-specific protein marker. (A) Heatmap of proteins significantly upregulated (ANOVA, BH corrected: FDR <0.05) with more than 2-fold compared to any other time point. Data is z-scored and represents the average from both iPSC lines. (B) Temporal expression of selected examples for stagespecific protein markers from (A). (C) Temporal expression of proteins related to (A) or (B). Data points show the average and error bars denote the range of replicates. PSCs and hepatocytes are both known for expressing high levels of the epithelial marker Ecadherin. However, instead of maintaining their epithelial phenotype throughout the differentiation, PSCs undergo an EMT for DE generation [297]. The transcriptomics and proteomics data supported this previous finding by a drop of E-cadherin levels upon DE differentiation followed by a steady increase towards MH (0-Figure 2C). Moreover, mRNA levels of the mesenchymal marker N-cadherin increased substantially at the DE-stage and maintained constant, while protein levels showed a steady increase over time. This EMT for DE development is an interesting finding considering that GRB2-associated binding protein 2 (GAB2) was detected as a novel DE marker (III-Figure 12B). While GAB2 was previously associated with EMT in cancer cell [298], the proteomics analysis suggested an additional connection to DE development. Given that most TFs dropped upon differentiation (III-Figure 11A), the identification of the TF CCAAT/enhancer-binding protein alpha (CEBPA) as an IH marker was of particular interest. CEPBA was previously linked to several metabolic genes [299] and the knock-out in mice was lethal due to the inability to accumulate hepatic glycogen [300, 301]. Furthermore, CEBPA was shown to be highly abundant in early mice embryos and coexpressed with the IH marker alpha-fetoprotein (AFP, [302]). The high-resolution proteomics data adds more temporal information to these previous findings by showing the very high and specific upregulation of CEBPA at IH with a more 77 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation than 10-fold increase (III-Figure 12B) but a subsequent drop at MH stage, which indicates a very time-dependent role for hepatocyte development. As the CEBPA upregulation is co-occuring with the observed metabolic switch, this could explain the lethality of CEBPA knock-out in mice due to insufficient hepatic metabolism [299, 300]. Another important TF in the liver is CCAAT/enhancerbinding protein beta (CEBPB), which originates from the same basic region leucine zipper family and has demonstrated the induction of hepatocyte proliferation [303]. This TF is highly upregulated during liver regenerative response [304] and its overexpression enables the conversion of pancreatic cells into hepatocytes [136]. However, the temporal profiling of CEBPB revealed no significant expression changes along the hepatocyte differentiation (III-Figure 12C), suggesting different regulation and function of CEBPA and CEBPB during hepatocyte differentiation. While many proteins that are specific for functional hepatocytes were already elevated prematurely at the IH stage, 52 of the 78 novel marker proteins were nevertheless upregulated at MH stage. Interestingly, 19 of these MH markers were related to the immune system, which is a substantial and often underestimated biological function of the liver [305, 306]. Two prominent examples of these marker proteins are the complement proteins C5 as well as C3 (III-Figure 12A), which is the most abundant complement component in the blood being primary synthesized in hepatocytes [305]. The more than 20-fold increase in MHs compared to iPSCs suggests the thruthful recapitulation by the in vitro model. Similarly, carbamoyl-phosphate synthase (CPS1), which is the rate-limiting enzyme of the urea cycle, was upregulated more than 16-times at MH. Such changes only occurring in late hepatocyte differentiation further substantiate the physiological relevance of the proteomics data on the in vitro hepatocyte differentiation. 4.7 New insights into hepatocyte differentiation by WNT signaling The canonical WNT signaling is recognized as an essential driver for DE development [103, 307], which reasons the supplementation of WNT3A or the WNT activator CHIR99021, respectively, during the first 6 days of differentiation. The previous marker analysis (III-Figure 12A) revealed substantial alterations of multiple proteins associated with the WNT signaling pathway at specific time points during the differentiation. For example, the WNT inhibitor RRM2 was highly abundant in iPSC and subsequently decreased upon differentiation (III-Figure 12A and III-Figure 13A). Additionally, the WNT-receptor frizzled 5 (FZD5) increased more than 8-fold during the first 6 days suggesting FZD5 as an important marker for DE development (III-Figure 13A). At the same time WNT5A, which is a driver of the non-canonical WNT pathway [308-310] and antagonist of WNT3A [311], dropped considerably (III-Figure 13B). This implies that not only the canonical WNT signaling is upregulated for DE development, but also the non-canonical WNT pathways are downregulated. While FZD5 receptor levels dropped beyond DE differentiation, the WNT inhibitors estrogenrelated receptor gamma (ESRRG) [312] and WNT11 [310] were increased at the HE stage (III-Figure 13A and III-Figure 13B). This increase was followed by the upregulation of another WNT inhibitor Low-density lipoprotein receptor-related protein 4 (LRP4) [313] at the IH stage, suggesting that these inhibitors might inhibit WNT in a different way or even different WNT proteins. The inhibition of WNT beyond the DE is in accordance with previous studies demonstrating negative effects of DE development due to prolonged WNT3A exposure [101, 103]. If WNT signaling was 78 | P a g e Chapter III: High temporal resolution investigation of in vitro hepatocyte differentiation not stopped after 7 days, DE cells had started losing their progenitor capacities and failed further differentiation towards the pancreatic or liver lineage. In summary, the analysis of WNT signaling during hepatocyte differentiation identified several novel key regulators and shed further light on this not fully understood developmental process. III-Figure 13: WNT signaling during hepatocyte differentiation. (A) Temporal expression of WNT-related proteins that were identified as stage-specific markers (see III-Figure 12A). (B) Temporal expression of proteins related to the WNT pathway. (C) High-temporal dynamics of WNT11 and LRP2 from the Ff-I01 experiment. Data points show the average and error bars denote the range of replicates. Low-density lipoprotein receptor-related proteins (LRPs) are a family of co-receptors involved in WNT signaling. Upon WNT binding to the frizzled receptor, LRP5 and LRP6 are involved in the transduction of the canonical pathway [314]. So far only these two family members have been associated with the WNT pathway, while the others are mostly known for regulating lipid metabolism [315]. Interestingly, the expression profiles of LRP2 and WNT11 were highly analogous in both iPSC lines (III-Figure 13B) suggesting a novel receptor/ligand interaction. The enhanced resolution of the Ff-I01 experiment confirmed the high temporal correlation of both proteins (IIIFigure 13C). In addition, the temporal dynamics were not only congruent, but with a relative foldchange to iPSCs of more than 20-fold all the more striking. So far, WNT11, a known WNT inhibitor, and LRP2, a multiligand receptor for mediating endocytosis of several molecules [316-318], have not been associated with each other. Owing to the prior knowledge of LRP5 and LRP6, it is tempting to hypothesize that LRP2 is a novel co-receptor of WNT11 which is involved in its inhibitory function. However, further experiments are indispensable to prove the direct functional interaction. 79 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples 1 Summary ............................................................................................................................ 83 2 Introduction ....................................................................................................................... 84 3 Material and Methods ........................................................................................................ 85 4 Results and discussion ........................................................................................................ 89 4.1 In vitro models generate rather immature hepatocytes ............................................... 89 4.2 3D liver buds are superior to 2D hepatocytes ............................................................... 92 4.3 Expression of ADME/Tox-related proteins of in vitro models reflect fetal-like stage ..... 94 4.4 Charting in vitro hepatocyte differentiation and future optimization strategies............ 95 81 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples The following chapter is largely based on the publication “High temporal resolution proteome and phosphoproteome profiling of stem cell-derived hepatocyte development” [247] published in Cell Reports on March 29, 2022. Authors contributions for [247]: Conceptualization, J.K., B.T., and B.K.; methodology, J.K., K.S., A.B., M.B., R.Y., A.K., H.T., B.T., J.G.C., and B.K.; software, J.K., P.S., and M.W.; validation, J.K., P.S., M.W., M.B., A.K., and B.K.; formal analysis, J.K., P.S., M.W., and B.K.; investigation, J.K., K.S., A.B., R.Y., and M.B.; resources, J.K., K.S., A.B., M.W., B.T., J.G.C., and B.K.; data curation, J.K., P.S., and M.W.; writing – original draft, J.K. and B.K.; visualization, J.K., P.S., and M.W.; supervision, B.T., J.G.C., and B.K.; project administration, J.K. and B.K.; funding acquisition, B.T. and B.K. 82 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples 1 Summary Primary human hepatocytes thruthfully preserve the functionality of their in vivo counterparts and are hence very valuable for multiple medical research fields such as drug discovery, toxicology studies, and regenerative medicine. However, primary hepatocytes are very scarce and show high donor variability, limiting their broader application and highlight the need for alternatives. Pluripotent stem cell-derived hepatocytes present such an alternative. Apart from the classical monolayer culture, hepatocyte can also be generated as 3D liver organoids thereby most likely recapitulating cellular functionality and organization of hepatocytes closer than 2D culture systems. But so far, most studies classifying the maturity of pluripotent stem cell-derived hepatocytes have been limited to transcriptomics data or a few selected marker proteins. For an improved evaluation of the complex biology of the developmental process, a global proteomic and phosphoproteomic analysis was performed. For this, stem cell-derived hepatocytes from a 2D and 3D model were compared and benchmarked to fetal liver and primary human hepatocytes. With 8,800 proteins and 12,700 phosphorylation sites, the up to now most comprehensive proteomic and phosphoproteomic datasets comparing in vitro and in vivo hepatocytes was acquired. A PCA and the upregulation of several liver-specific proteins verified the successful differentiation into hepatocytes of both in vitro models. However, the 3D differentiation into hepatocytes was indeed superior to the monolayer approach, as the purity of the derived 3D cultures was higher and various liver-specific metabolic pathways were upregulated. Nevertheless, multiple proteins related to crucial liver-specific functions, such as the metabolism of alcohol and aldehyde, or the absorption, distribution, metabolism, and excretion of bioactive compounds, were still substantially lower expressed than in the fetal liver and primary hepatocytes. In addition, analysing kinase-substrate relationships uncovered significantly altered activity of multiple kinases between the in vitro and in vivo samples. In summary, although the results demonstrated superior differentiation efficiencies of the 3D protocol compared to 2D, both in vitro models fell short in completely resembling the in vivo counterparts. Owing to the deep proteome and phosphoproteome datasets multiple starting points for future protocol improvements were discovered, which exemplifies the value of the resources collected in this thesis. 83 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples 2 Introduction Initial protocols for generating tissue-specific cell types from PSCs were utilizing 2D monolayer formats. This has changed over the last years as more and more 2D protocols have been supplemented or replaced by 3D models [42]. These 3D models enable the integration of multiple different cell types and are therefore capable of resembling the complex structure of an organ in a more physiological manner. For example, iPSCs can be differentiated into complex liver buds (LBs), comprising hepatic endodermal, mesenchymal, and endothelial cells [123, 253]. By coculturing these three cell types, they self-aggregate to form vascularized 3D organoids and their transplantation partly rescued liver failure in mice [122]. The recapitulation of essential liver functions by the LBs demonstrates the great potential of this technology for regenerative medicine. Moreover, PSC-derived hepatocytes are extensively studied as a potential model system for assessing absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) in the context of drug discovery. So far the ‘gold standard’ for such experiments have been PHH [128, 319], which are less demanding concerning culturing and can be employed for high-throughput drug screening [320]. In recent years, this technology has also evolved from 2D monolayer to 3D models, which better reflects the conditions in vivo [321]. However, PHH come with some disadvantages, as they are scarce, show high donor-to-donor variability, and are prone to changing their characteristics upon in vitro cell culturing [322]. As stem cell differentiation overcomes the shortage of cells by providing a virtually unlimited source as well as the potential for standardized and high-throughput protocols, they are increasingly explored as a promising alternative [323, 324]. Although PSC-derived hepatocytes possess a great potential for various applications, a full characterization on protein levels is still missing, as previous studies were mostly based on transcriptomics data [100, 255] or covered only a limited number of proteins [123, 253, 254]. In order to evaluate how closely they recapitulate their in vivo counterpart, the global protein expression as well as the phosphorylation pattern of 2D and 3D derived hepatocytes were benchmarked against fetal liver and PHH. 84 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples 3 Material and Methods 2D Hepatocyte differentiation The in vitro generated 2D hepatocytes used for this experiment were kindly provided by our collaborators from the research group of Dr. Keisuke Sekine (Department of Regenrative Medicine, Yokohama City University, Japan; Laboratory of Cancer Cell System, Tokyo, Japan). Hepatocytes were generated from Ff-I01 cells as described previously [253]. Briefly, stem cells were cultured in StemFit™ Basic03 (Ajinomoto) supplemented with 80 ng/ml bFGF (Fujifilm Wako Pure Chemical). For initiating differentiation, the cells were dissociated with Accutase® (SigmaAldrich) and plated on laminin 511-E8 (iMatrix-511, Nippi)-coated dishes in the presence of the ROCK inhibitor Y-27632 (Fujifilm Wako Pure Chemical). Cells were differentiated in RPMI1640 (GIBCO™) supplemented with 20% StemFit™ For Differentiation and 100 ng/ml activin A (both Ajinomoto) for the first 6 days. Additionally, 2 µM CHIR99021 (Cayman Chemical) was supplemented during the first three days and 0.5 mM sodium butyrate (Sigma-Aldrich) was added from day 1 to day 4. This step led to the expression of the DE marker CXCR4 in almost 100% of the cells [253]. Next, cells were differentiated in StemFit™ Basic03 (Ajinomoto) medium supplemented with 1% DMSO (Sigma-Aldrich), 0.1 mM 2-mercaptoethanol, 0.5% L-glutamine, and 1% non-essential amino acids (all GIBCO™). During the first 13 days, the medium was exchanged daily and thereafter every second day. For the final 8 days, cells were cultured in DMEM medium (GIBCO™) supplemented with 5% StemFit™ For Differentiation (Ajinomoto) and 0.1 µM dexamethasone (Sigma-Aldrich). Samples were harvested at day 0 (iPSC), day 6 (DE), day 10 (HE), day 13 (IH), and day 21 (MH), and washed twice with PBS (w/o CaCl2 and MgCl2). The cell pellet was stored at -80°C until further processing for proteomics analysis. Differentiation of endothelial cells (ECs) and mesenchymal cells (MCs) The in vitro generated ECs and MCs used for this experiment were kindly provided by our collaborators from the research group of Dr. Keisuke Sekine (Department of Regenrative Medicine, Yokohama City University, Japan; Laboratory of Cancer Cell System, Tokyo, Japan). ECs and MCs were generated from iPSCs as described previously [123]. For EC differentiation, iPSCs were plated onto laminin 511-E8 (iMatrix-511, Nippi)-coated dishes in mesoderm induction medium consisting of DMEM/F-12 (1:1 mixture) medium supplemented with 1% GlutaMAX™, 1% B-27™ (all GIBCO™), 8 µM CHIR99021 (Cayman Chemical), and 25 ng/mL BMP4 (R&D Systems). For initial seeding, 10 µM ROCK inhibitor Y-27632 (Fujifilm Wako Pure Chemical) was added. From day 4 to day 10, cells were cultured in EC Induction Medium consisting of StemPro-34 SFM medium supplemented with 200 ng/ml VEGF (both GIBCO™), and 2 µM forskolin (Cayman Chemical). The medium was exchanged every second day. For MC/STM differentiation, iPSCs were seeded onto laminin 511-E8 (iMatrix-511, Nippi)-coated dishes and cultured in mesoderm induction medium. After 4 days, cells were cultured with LPM induction medium consisting of DMEM/F-12 supplemented with 1% B-27™, 10 ng/ml PDGFBB, 2ng/ml activin A, and 1% GlutaMAX™ for 2 days. Then, the medium was changed to STM induction medium consisting of DMEM/F-12, 1% B-27™, 10 ng/ml bFGF, 12 ng/ml BMP4, and 1% GlutaMAX™. 85 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples 3D liver buds generation The 3D LBs used in this thesis were kindly provided by our collaborators from the research group of Dr. Keisuke Sekine (Department of Regenrative Medicine, Yokohama City University, Japan; Laboratory of Cancer Cell System, Tokyo, Japan). As described previously [325], LBs were generated by seeding HE, EC, and MC cells at a 10:7:2 ratio onto Matrigel (BD Bioscience)-coated dishes. Cells were cultured in a 1:1 mixture of DMEM (GIBCO™) and KBM-VEC1 basal medium (Fujifilm Wako Pure Chemical) supplemented with 2.5% StemFit™ For Differentiation (Ajinomoto) and 0.1 µM dexamethasone (Sigma-Aldrich). For initial seeding, 10 µM of the ROCK inhibitor Y27632 was added and thereafter half of the medium was exchanged daily. Adult and fetal liver cells Adult liver cells were kindly provided from the Universitätsklinikum Leipzig with the donor´s informed consent. PHH were isolated by the research group of Dr. Georg Damm as described previously [326]. Briefly, liver tissues were dissociated using a two-step EGTA/collagenase P perfusion incubation and the PHH cells were separated from the non-parenchymal cells by centrifugation at 50 xg. PHH were subsequently washed twice with PBS (w/o CaCl2 and MgCl2) and the cell pellet was stored at -80°C until further processing. The human fetal liver samples were provided by the Joint MRC/Wellcome Trust HDBR with ethical approval. In this work, human fetal liver samples from gestation week 16 and 17 were used for the proteomics study. Protein extraction, digestion and labeling Cell pellets of PHH and fetal liver samples as well as from the 2D and 3D hepatocyte differentiation were resuspended in lysis buffer containing 8M urea, 40 mM Tris/HCl (pH 7.6), EDTA-free protease inhibitor (cOmplete™, Roche), and 1x phosphatase inhibitor mix (prepared in-house according to the Phosphatase Inhibitor 1, 2, and 3 from Sigma) and stored at -80°C. Cell lysates were thawed on ice and centrifuged with 20.000 xg for 20 min at 4°C. Protein concentration of the supernatants was determined using the Pierce™ Coomassie Bradford solution (Thermo Scientific) according to the manufacturer´s protocol. 70 µg of protein was used for subsequent reduction with 10 mM DTT for 45 min at 37°C and alkylation using 55 mM CAA for 30 min at RT. After diluting the urea concentration below 1.6 M with 40 mM Tris/HCl (pH 7.6), trypsin at a 1:100 enzyme:protein ratio was added for pre-digestion at 37°C and 700 rpm. After 3 h, trypsin was added again at a 1:100 ratio and proteins were digested overnight. The digestion was stopped by adding FA to a final concentration of 1% and the acidified peptides were subsequently desalted by loading onto StageTips [164] (10 C18 disks, Empore™ 3M), washing with 0.1% FA, and eluting with 0.1% FA in 50% ACN. For TMT labeling, desalted peptides were reconstituted in 20 µl 50 mM HEPES buffer (pH 8.5) and mixed with 5 µl of 11.6 mM TMT reagent. After 1 h shaking with 400 rpm at RT, the labeling reaction was stopped by adding 2 µl of 5% hydroxylamine (Sigma-Aldrich). The 11 TMT channels were pooled and the reactions vessel were rinsed with 20 µl of 10% FA in 10% ACN and combined to the pool. The labeled peptides were dried and desalted using SepPak column (Water Corp.). For this, peptides were loaded and washed with 0.07% TFA before they were eluted using 0.07% TFA in 50% ACN. 86 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples Phosphopeptide enrichment and off-line fractionation For phosphopeptide enrichment, desalted peptides were reconstituted in 0.07% TFA in 30% ACN and loaded onto a FE3+-IMAC column (ProPac™ IMAC-10 4x50 mm, Thermo Fisher Scientific) as described previously [173]. The unbound flow-through consisting of non-phosphorylated peptides was dried and stored for off-line fractionation. The bound phosphopeptides were eluted with 0.315% NH4OH and further fractionated on StageTip [236]. For this, StageTips with 5 C18 disks (Empore™ 3M) were constructed and peptides were eluted wit increasing ACN concentration (5%, 7.5%, 10%, 12.5%, 15%, 17.5%, and 50% ACN). To obtain six fractions, the 5% and 50% as well as the 17.5% and flow-through fractions were combined and subsequently dried. The non-phosphorylated IMAC flow-through was further deep fractionated via hPH reverse phase fractionation as described previously [170]. Briefly, peptides were reconstituted in 25 mM NH4HCO3 (pH 8) and loaded onto a C18 column (XBridge BEH130, 3.5 µm, 2.1x150 mm, Waters Corp.) coupled to a Dionex 3000 HPLC system (Thermo Fisher). Peptides were eluted with an increasing ACN concentration in 25 mM NH4HCO3, collected, and pooled to 48 fractions. Data-dependent LC-MS acquisition Fullproteome and phosphoproteome were measured in DDA on a nanoflow system consisting of a Dionex 3000 UHPLC+ connected to a Fusion Tribrid mass spectrometer (both Thermo Fisher Scientific). Peptides were dissolved in 0.1% FA (fullproteome) or 0.1% FA with 50 mM citrate (phosphoproteome) and loaded onto an in-house packed trap column (75 µm x 2 cm, 5 µm C18 resin; Reprosil PUR AQ, Dr. Maisch). An in-house packed analytical column (75 um x 40 cm, packed in-house with 3 um C18 resin; Reprosil PUR AQ) was used to separate peptides, which were further injected into the Fusion Lumos and subsequently measured in positive ionization mode. The fullproteome was eluted with a linear 50 min gradient from 8% to 34% LC buffer B (0.1% FA and 5% DMSO in ACN) in LC buffer A (0.1% FA and 5% DMSO in water). MS1 spectra were acquired in the orbitrap with 60,000 resolution, an AGC target of 4e5 charges, a maxIT of 20 ms, and a scan range of 360-1,300 m/z. For the following MS2 scan, charge states between 2 and 6 were allowed with decreasing priority and fragmented via CID before they were recorded in the ion trap with an AGC target of 2e4 charges and a maxIT of 60 ms in rapid mode. The MS3 spectra were acquired from simultaneous precursor selection (SPS-MS3) of 10 precursors that were further fragmented with HCD and measured in the orbitrap with 50,000 resolution. The AGC target was set to 1.2e5 charges and the maxIT to 120 ms. The phosphoproteome was measured with a slightly different acquisition method and an 80 min linear gradient of LC buffer B. Full scan MS1 spectra were recorded with 60,000 resolution in the orbitrap, a scan range of 360-1,300 m/z, an AGC target of 4e5 charges, and a maxIT of 50 ms. CID was used for precursor fragmentation and subsequent MS2 spectra were acquired at 15,000 resolution in the orbitrap with an AGC target of 5e4 charges and a maxIT of 22 ms. For quantification, an additional MS3 scan was performed in the orbitrap with 50,000 resolution, an AGC target of 1.2e5 charges, and a maxIT of 120 ms. Multi-notch isolation was utlilized for selecting fragment ions of 10 notches with subsequent HCD fragmentation. 87 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples Database searching Fullproteome and phosphoproteome were searched together in separate parameter groups with the Maxquant software [209, 210] (version [1.6.2.3]) and against the human UniProtKB reference list (downloaded 22.07.2013). Unless stated otherwise, default setting were applied. Trypsin was defined as protease and up to two missed cleavages were allowed. Carbamidomethylation was set as a fixed modification and oxidation of methionine as well as N-terminal protein acetylation were selected as variable modifications. For the phosphoproteome, serine, threonine, and tyrosine phosphorylation were additionally set as variable modification. TMT11-plex was defined as the quantification type and the corresponding correction factors were specified. Fullproteome data processing Quantitative fullproteome analysis was performed on the proteingroups.txt output file. Before data was normalized, reversed hits and protein entries that were only identified by site were removed. The reporter ion intensities were log2 transformed and the median of each TMT channel was normalized to the averaged median of all channels (median centering). Then, batch effects between replicates were removed with ComBat [261] from the ´sva´package (version [3.30.1]). These normalized reporter intensities were used for further downstream analyses. The PCA analysis was compiled from the ‘factoextra’ package (version [1.0.7]) and plotted with the ‘ggplot2’ package (version [3.3.2]). To determine variance, ANOVA testing with BH correction was performed and proteins with a fold-change >2 and a FDR <0.05 were classified as significantly altered. Phosphoproteome data processing The phosphosite.txt was used for quantitative phosphoproteome analysis. Reverse hits and entries with a localization probability <0.75 were removed for further processing. Reporter intensities were log2 transformed and median-centered using the correction factors calculated from the corresponding fullproteome data. As for the fullproteome, batch effects between replicates were removed by ComBat and significantly regulated P-sites were calculated (ANOVA: FDR <0.05 and fold-change >2). For kinase-substrate relationships, the networkin prediction tool was implemented [264]. 88 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples 4 Results and discussion In order to assess the hepatocyte differentiation of 2D and 3D culturing models, their proteomic and phosphoproteomic profiles were compared with profiles from in vivo fetal liver and PHH. 4.1 In vitro models generate rather immature hepatocytes In the previous chapter, 2D monolayer-derived hepatocytes were examined to extrapolate molecular changes during epithelial cell differentiation. Apart from this approach, an enhanced culturing method for hepatocyte generation are LBs. These are 3D organ-like structures that consist of multiple cell types which self-organize upon co-culturing. For the LBs generated in the following study, cells from the HE, endothelial cells (EC), and mesenchymal cells (MC) were individually differentiated from iPSCs and then combined for hepatocyte maturation. The 2D and 3D derived hepatocytes for this experiment were generated in collaboration with the research group of Keisuke Sekine from Japan. For a direct comparison, proteomes from the 2D differentiation (HE, IH, MH), 3D differentiation (MC, EC, and LBs after pooling at day 3, 10, and 11), fetal liver (from week 16 and 17 after gestation), and PHH were gathered (IV-Figure 1A). The experiment was performed as dublicate (R1 and R2) with the previously described workflow (IIIFigure 1). The deep (phospho)proteomic measurements led to 8,800 identified proteins and 12,700 P-sites (IV-Figure 1B). PCA analysis of the overlapping 8,100 proteins of both replicates revealed a clear separation between 2D, 3D, fetal liver, and PHH along PC1 and PC2, which implied that differences on protein levels are sufficient to discriminate the samples (IV-Figure 1C). Furthermore, the direction of 2D and 3D differentiation stages indicated a temporal maturation towards fetal liver and PHH, which suggested a successful development towards functional hepatocyte-like cells. IV-Figure 1: Quality control of differentiation. (A) Overview of the samples used for the in vitro versus in vivo comparison. EC, endothelial cells; MC, mesenchymal cells. (B) Venn diagrams showing the number of identified proteins and phosphorylation-sites (P-sites) of both replicates, respectively. (C) PCA of the proteomics experiment. Next, the MH, LB d11, fetal liver (average of week 16 and 17), and PHH were analysed in more detail and the differentially expressed proteins (ANOVA, BH corrected: FDR <0.05 and FC>2) were 89 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples subselected from the dataset. These 1,332 proteins were z-scored and hierarchically clustered leading to 9 distinct expression profiles of which cluster 3 and 8 comprised the majority of proteins (IV-Figure 2A). Cluster 8 contained progenitor proteins like AFP, hepatocyte nuclear factor 1-beta (HNF1B), and transcription factor GATA4, which were elevated in the 2D samples and decreased towards PHH (IV-Figure 2B, left panel). This suggested incomplete maturation of the 2D samples, as these proteins are associated with the developing liver rather than mature hepatocytes. In contrast, the expression of cluster 3 proteins was highest in PHH and comprised several liverspecific proteins, including albumin (ALB), fructose-bisphosphate aldolase B (ALDOB), and the liver carboxylesterase 1 (CES1). While the abundance levels of ALB did not diverge substantially between the samples, ALDOB and CES1 were 8-300 times higher expressed in PHH compared to the other samples (IV-Figure 2B, right panel). In addition, KEGG analysis revealed that multiple proteins related to metabolic pathways were enriched in cluster 3 (IV-Figure 2C). Among them were several pathways associated with cytochrome P450, which is a family of enzymes that is highly abundant in the liver and very important for drug metabolism. One significantly upregulated member of this family was for example CYP2A6, which is the primary enzyme involved in nicotine metabolism (IV-Figure 2D). 90 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples IV-Figure 2: Elevated levels of metabolic pathways in PHH. (A) The left panel displays the z-scored heatmap of hierarchical clustering of the 1,332 differentially expressed proteins (ANOVA, BH corrected: FDR<0.05 and FC>2). The right panel shows the expression profiles of proteins in all 9 clusters (n indicates the number of proteins in the cluster). (B) Scatter plots showing the protein expression of selected members from cluster 8 and 3 relative to the 2D differentiation. (C) KEGG enrichment analysis of the protein clusters from (A). The top five enriched pathways are shown. (D) Z-scored expression of proteins related to nicotine metabolism, which is part of the ´metabolism of xenobiotics by cytochrome P450´ from (C). In summary, this analysis revealed that several progenitor markers were higher expressed in the 2D model than in the 3D model and the in vivo samples with a similar expression pattern of 3D and fetal liver samples. On the other hand, multiple liver-specific proteins were only highly expressed in PHH. 91 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples 4.2 3D liver buds are superior to 2D hepatocytes Since the above analysis indicated incomplete hepatocyte differentiations for either of the two models, a more detailed analysis was employed. To assess differences between the samples, the number of significantly altered proteins (ANOVA, BH corrected: FDR<0.05 and FC>2) relative to PHH were determined (IV-Figure 3A). Decreasing numbers of differentially expressed proteins from 2D to 3D to fetal liver suggested a convergence towards PHH. This trend was further confirmed by comparing the intensities of the top 25-1,000 most abundant proteins of PHH across all other samples (IV-Figure 3B). These analyses indicate that the 3D model better resembles PHH than the 2D model, albeit further maturation steps are required to better recapitulate the in vivo counterpart. IV-Figure 3: Protein expression of 3D samples resemble PHH better than the 2D approach. (A) Number of significantly regulated (ANOVA, BH corrected: FDR<0.05 and FC>2) proteins relative to PHH. (B) Line plot showing the mean intensity of the top 25 to 1,000 PHH proteins in mature 2D, 3D, and fetal liver. (C) Number of significantly regulated (ANOVA, BH corrected: FDR<0.05 and FC>2) downstream substrates of CDK1/2 (left panel), ERK1/2 (middle panel), and CK1/2 alpha (right panel) relative to PHH. To add additional information to the protein expression data, phosphorylation patterns were explored by analysing kinase-substrate relationships. This analysis was performed with Networkin [264], which integrates known and predicted relationships based on kinase recognition motifs. Previously, this type of analysis revealed a trend of decreasing CDK and ERK activity during the 2D hepatocyte differentiation (III-Figure 6D and 6E, III-Figure 8E). Indeed, the comparison of all samples further suggested low CDK1/2 and ERK1/2 activity in functional hepatocytes, as many substrates were downregulated in mature PHH (IV-Figure 3C, left and middle panel). A similar trend was also observed for the casein kinase (CK) 1 and 2 alpha (Figure 3C, right panel), which belong to the casein kinase family that is primarly implicated for its role in DNA repair, DNA transcription, circadian rhythm, and cell cycle control [327-330]. To elucidate on the divergence between the 2D and 3D models in more detail, a differential expression analysis was performed (IV-Figure 4A). The solute carrier SLCO1B3, which was among the 214 proteins upregulated in the 3D LBs, is for example involved in the transport of bile acids [331] and steroid conjugates in the liver [332]. Besides that, multiple metabolic pathways were enriched in the 3D samples according to KEGG analysis (IV-Figure 4B). For example, two key enzymes of the urea cycle, arginosuccinate synthase (ASS1) and arginase-1 (ARG1), were elevated 92 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples more than 5 or 35 times, respectively. Additionally, the previously mentioned liver specific aldolase (ALDOB) as well as the fatty acid-binding protein (FABP1), which plays an important role for cholesterol uptake in hepatocytes [333], were among the most upregulated proteins in LBs (IVFigure 4A). In contrast, proteins upregulated in the 2D samples suggested that the differentiation towards hepatocytes was incomplete and a more heterogeneous cell population was obtained. For example, with transthyretin (TTR), dual oxidase 2 (DUOX2), and the two transporters SLC3A2 and SLC7A5 several proteins associated with the two thyroid hormones T3 and T4 were highly increased in the 2D differentiation [334-337]. In addition, SPINK1, a serine protease from the pancreas, was highly upregulated in the 2D samples. Since the thyroid gland as well as the pancreas share their endodermal origin with the liver, an incomplete commitment from the endoderm towards hepatocytes can be suspected. To elucidate on the differentiation specificity, upregulated proteins between d11 versus d3 of the LBs and MH versus HE of the 2D dataset were compared to a number of tissue-specific marker proteins. The marker list was retrieved from Wang et al. [153] where protein expression data of 29 human tissues were acquired in order to define their tissue specific abundance. By far the highest number of upregulated markers in the LBs were liver-specific (IV-Figure 4C). While also at the end point of the 2D differentiation mainly liver-specific markers were upregulated, the number of non-liver tissue-specific markers was higher than in LBs. While this comparison further illustrated the successful differentiation into liver-like cells, this analysis also confirmed the previous notion that the 3D approach is more specific leading to less lineage bifurcation. IV-Figure 4: Comparison of 2D and 3D-derived hepatocytes. (A) Volcano plot illustrating significantly regulated proteins (in red; ANOVA, BH corrected: FDR<0.05 and FC>2) between the 2D and 3D hepatocyte differentiations. (B) KEGG enrichment of proteins significantly up or downregulated from (A). Top five 93 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples enriched pathways are shown. (C) Bar plots depicting the number of upregulated tissue specific proteins between MH and HE for the 2D or LB d11 and LB d3 for the 3D differentiation. (D) Boxplots showing the fold-change (day 11 to day 3) of selected metabolic pathways, maturation marker, redox-reducing enzymes, and proteins related to DNA replication. For metabolic pathways and DNA replication, the fold-change of all associated proteins were taken into account. Despite differences between both in vitro models, the main concepts that have been described from the previous 2D differentiation, such as the metabolic switch between the HE and IH stage (III-Figure 4 and III-Figure 8), could be confirmed in the 3D model. Also, essential hepatic pathways, like the TCA cycle, ketogenesis, and the gluconeogenesis were substantially increasing during the 3D maturation (IV-Figure 4D). Additionally, the early differentiation markers TTR and AFP showed modest changes during the LB maturation, whereas the liver-specific proteins RBP4 and ALB increased by more than 4-fold. Moreover, the expression of important redox-reducing enzymes was elevated and proteins associated with the DNA replication decreased similar as in the 2D differentiation (III-Figure 5, III-Figure 8, and III-Figure 11). 4.3 Expression of ADME/Tox-related proteins of in vitro models reflect fetal-like stage One potential application of in vitro generated hepatocytes is the toxicity assessment of therapeutic drugs during their development. Critical for this process are proteins related to absorption, distribution, metabolism, and excretion as well as toxicity (ADME/Tox), which makes them an interesting protein class to analyse. Therefore, ADME/Tox-related proteins were retrieved from Schroder et al. [338] and considered for analysis. Differential expression analysis led to 104 ADME/Tox-related proteins that were significantly different (ANOVA, BH corrected: FDR<0.05 and FC>2) in either of the four samples (2D, 3D, fetal liver, or PHH). Further hierarchical clustering of these differentially regulated proteins revealed 8 distinct expression profiles (IVFigure 5A). Cluster 3 and 4 comprised in total 17 proteins which were almost exclusively elevated in the 2D differentiation (IV-Figure 5B), such as the transcription factor FOXA1, which is essential for DE development and early lineage specification [339]. In cluster 5 the glutathione synthetase (GSS) was included, a protein that catalyses the second synthesis step of glutathione, which is an important antioxidant that is used in multiple assays to assess drug toxicity effects. Proteins that were most abundantly expressed in PHH were mainly found in cluster 6. Of note, Cluster 6 was with 56 members the biggest cluster showing that most ADME/Tox proteins were predominantly expressed in PHH and highlighting the need for further improvements in the current in vitro differentiation models. In this cluster various ABC transporters, metabolic proteins, including the alcohol and aldehyde dehydrogenase, and multiple liver-specific proteins were present (IV-Figure 5B). In addition, several proteins from the cytochrome P450 family were contained in cluster 6. Lastly, enzymes reducing ROS, such as CAT and CBR1, were highly abundant in PHH. As outlined in chapter 4.2 on page 62, this could be due to a high aerobic energy consumption of metabolic proteins driving the generation of ROS. 94 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples IV-Figure 5: Highly expressed ADME/Tox-related proteins in PHH. (A) Heatmap showing hierarchical clustering of all 104 significantly regulated proteins associated with ADME/Tox. (B) Bar graph of the relative expression of protein clusters from (A) normalized to PHH. Bars denote mean values and the error bars represent the range. In conclusion, the majority of ADME/Tox proteins were siginificantly lower expressed in the in vitro samples and in the fetal liver than in PHH, which is in line with previous results claiming that PSC-derived hepatocytes rather mimic fetal than adult hepatocytes [258]. While the expression of FOXA1, ABCC1, and GSS were rather similar among all samples, multiple proteins from cluster 6 were up to 1,000 times more abundant in PHH. Therefore, additional experiments directly assessing the metabolization of distinct metabolites in iPSC-derived hepatocytes and PHH are required prior to a broader use of iPSC-derived hepatocytes in drug toxicity assessments. 4.4 Charting in vitro hepatocyte differentiation and future optimization strategies In the following section the molecular roadmap of in vitro hepatocyte differentiation will be delineated with a special emphasis on deducing strategies for future protocol improvements (IVFigure 6). Stem cells as well as cancer cells are rapidly dividing cells [266-268]. In agreement with a high demand for cellular membranes [340, 341], the fatty acid synthase (FASN), a key enzyme for de novo lipogenesis, was the most abundant protein in the TkDA3-4 cells and among the top20 in FfI01 cells. Upon differentiation, FASN levels dropped significantly at the DE stage. A similar expression pattern was observed for the acetyl-CoA carboxylase 1 (ACACA), the rate-limiting enzyme for de novo fatty acid biosynthesis. These results suggested an immediate change of fatty acid metabolism, similar to the upregulation of FASN during neural stem cell differentiation [342]. However, the most striking metabolic switch happened between HE and IH. Putative key regulators for this switch might be the two sirtuin enzymes SIRT 1 and 2. While high levels of SIRT1 have been positively correlated with pluripotency [281, 287], the reduced expression has 95 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples promoted differentiation of stem cells [284, 288]. In agreement with reduced cell proliferation during differentiation, low SIRT1 activity has been associated with accumulation of fatty acids and carbohydrates [283]. On the contrary, SIRT2 overexpression has been associated with spontaneous cell differentiation and diminished activity of glucolytic enzymes caused by deacetylation [281]. The detailed proteomic expression analysis in this thesis revealed a simultaneous switch of both proteins after the HE stage, in which SIRT1 levels decreased significantly and SIRT2 showed a congruent increase. These diametrically opposing expression profiles appeared to be highly regulated in a time-dependent manner in both iPSC lines. The high temporal resolution of the experiments increased the understanding of processes during hepatocyte development and generated new potential starting points for protocol improvements. For example, the temporal dissection of the SIRT1 downregulation suggests that the supplementation of an SIRT1 inhibitor at this stage might increase differentiation efficiencies. This is of special interest since the supplementation of SIRT1 inhibitors, for example, have promoted neural development and hematopoiesis before [282, 284, 343]. Moreover, unbiased expression analysis led to multiple novel and stage-specific marker proteins. Among these, various proteins have been associated with Wnt signalling, a known and essential pathway for cell development [98, 344]. One novel marker was the Wnt receptor frizzled-5 (FZD5) showing high protein levels at the DE stage with a subsequent drop during further differentiation. Although additional experiments are needed to explain the specific role of FZD5 in more detail, this striking expression profile suggested a key role for this development stage. More experiments are also required to validate the influence of the observed stage-specific upregulation of the Wnt inhibitors WNT11, ESRRG, and LRP4 and the assumption that Wnt signalling is not sustained after the DE stage. These coordinated expression changes in the Wnt signalling pathway illustrate a potential starting point for protocol optimization. Several studies have shown the benefit of using Wnt activators to guide cell development, such as the addition of Wnt3a for DE specification [97, 103]. Only very few studies have investigated phosphorylation of cell differentiation so far [62, 63, 256] and none has specifically focused on hepatocyte differentiation. Hence, in this global phosphoproteomic study multiple interesting findings complementing transcriptomics and proteomics data were retrieved. One example was the temporal unravelling of protein expression and phosphorylation levels of proteins related to the cell cycle, which suggested a preceding switch observed on phosphorylation level. Consistent with previous studies the CDK activity was elevated in PSCs [345, 346] and decreased upon differentiation. However, the activity was still higher than in the in vivo samples. Several studies have shown before that CDK1 activity is crucial to maintain pluripotency [346-349] and thus, it appears plausible that a partial downregulation or inhibition of CDK1 promotes the differentiation process. 96 | P a g e Chapter IV: Benchmarking in vitro hepatocytes and in vivo liver samples IV-Figure 6: Roadmap of hepatocyte differentiation. Overview of multiple proteins and pathways significantly changing during in vitro hepatocyte differentiation. Dashed lines indicate pathways with distinct activity compared to the in vivo liver samples. Another interesting set of targetable kinases that emerged from the analysis are ERK1/2, because they were linked to early lineage specification before [350-353] but their complex role has remained inconclusive. The protein expression analyses revealed increasing abundance levels of ERK during the differentiation, whereas the phosphorylation indicated decreased activity. In addition, the corresponding substrate phosphorylation decreased after the HE stage confirming that ERK activity was not sustained in later stages. However, compared to in vivo liver samples the activity was still elevated. A similar trend was observed for CK1/2 activity, which illustrates potential targets as ERK and CK inhibitors have previously been used to support differentiation [354-356] but not in the context of hepatocytes. Although additional experiments are needed to validate these findings, a recently published study [357] reported the beneficial application of CK1 inhibitors for differentiating pancreatic progenitors. As pancreas and liver share the DE as a common origin, the utilization of ERK and CK inhibitors might be a good starting point to improve in vitro hepatocyte differentiation. 97 | P a g e Chapter V: General discussion and outlook Omics technologies to study cell line differentiation - merits and limitations ....................... 100 Single-cell proteomics - future technology to study development? ...................................... 104 99 | P a g e Chapter V: General discussion and outlook Omics technologies to study cell line differentiation - merits and limitations The first systematic study of human embryonic development emerged in the 19th century when researchers started to collect and investigate human embryos derived from pregnancy abortions and maternal deaths. Big collections, such as the Carnegie Collection (more than 10,000 specimen) or the Kyoto Collection (more than 44,000 specimen), set the base for the understanding of early human development (reviewed in [358]). Their descriptive and morphological studies helped to gain insights into sequential developmental processes and to classify different embryonic stages. Since then, the possibilities of studying development has changed drastically. On the one hand, the ability to isolate and culture human hESCs in 1998 [5] as well as the discovery of reprogramming iPSCs in 2006 [7, 8] has faciliated the study of human development in vitro. On the other hand, the methods to follow cell lineage specification have developed from descriptive studies towards interventional studies allowing the dissection of the underlying molecular mechanisms. Proteomics as the preeminent technology among protein-centric approaches Fluorescence-activated cell sorting (FACS), western blot, and immuno staining are widely used technologies to analyze protein abundance during hepatocyte differentiation [123, 253, 254]. However, since these approaches are all based on antibodies, they have some drawbacks, such as their limitation to the low number of proteins for which specific antibodies exist or the low throughput of proteins that can be analysed in an experimental setting. Moreover, these semiquantitative approaches require prior knowledge or hypotheses of candidate proteins and thereby hamper the identification of unexpected protein hits. Mass spectrometry-based proteomics overcomes some of these limitations as it provides an unbiased way to identify and quantify thousands of proteins in one experiment. Hence, the analysis is not limited to a few protein markers but provides a global picture of protein expression, which is especially interesting for the comparison of different cell types. The comparison of in vitro versus in vivo liver samples in this thesis (chapter IV) showed how important such a thorough analysis can be. Although multiple hepatocyte-specific proteins were upregulated in the in vitro samples, the global protein expression allowed the discrimination in a fetal-like and an adult liver state suggesting that the current in vitro differentiations do not completely recapitulate the in vivo development. A second example where high measuring depth proved useful was the study of in vitro hepatocyte differentiation with high temporal resolution (chapter III). Analysis of multiple time points during this process did not only allow validation of differentiation based on the expression of known protein markers, but also generation of a list of novel stage-specific proteins. These novel marker proteins can be used as validation sets for future differentiations or to enrich cells from a specific development stage using FACS. Moreover, the close investigation of such marker proteins will help to elucidate developmental concepts as shown in this thesis. For example, the substantial upregulation of FZD5 upon activation of the WNT pathway indicated that this is a crucial WNT receptor for DE specification. Several examples like this are presented in this work showing how proteomics can be utilized to retrieve new concepts and hypotheses. Although the demonstrated findings are based on statistically significant expression changes, further follow-up experiments 100 | P a g e Chapter V: General discussion and outlook are desirable to test causality and functional relevance. Thus, the protein changes during hepatocyte differentiation were confirmed with a second iPSC line to exclude cell line-specific effects. Interventional follow-up experiments including an ablation of FZD5 in iPSCs would for example help to answer the question how important the single WNT receptor for proper DE specification is. In comparison to genomics or transcriptomics, one drawback of protein-based studies is their inability to in vitro amplify analytes, which limits the detection of peptides/proteins to the sensitivity of the mass spectrometer. Therefore, proteins with low abundance may not be detected, although they could be of biological importance. Transcription factors (TFs) belong to such a key group of proteins that can define cell fate decision even at low abundance, making their study by proteomics difficult [359]. This issue is further enhanced by the DDA approach of bottomup proteomics, as the selection of ions picked for fragmentation and subsequent MS2 scans is based on their MS1 intensity. Thus, high abundant ions/peptides are more likely to be measured. However, it is not always the low abundance of proteins that hamper their detection, but also the high abundance of co-eluting peptides that can mask the identification [360]. In this work, peptides were extensively fractionated prior to MS measurement to mitigate these issues. By fractionation, the sample complexicity is reduced and thus, the number of identified peptides is increased substantially [168]. However, for thorough sample fractionation, sufficient amount of starting material is required. Because large-scale cultivation of stem cells is demanding and supplements for differentiation are expensive, sample amounts of cell differentiation experiments are often scarce. Hence, in the first part of this thesis the downscaling of input material for proteomic sample preparation was optimized, which led to the identification of more than 10,000 peptides from just 2,000 cells (Chapter II). The optimized iST and SP3 beads approaches showed very promising results without requiring much additional equipment, making these workflows attractive for sample-limited applications. Transcriptomics versus proteomics for studying cell differentiation The amplification of mRNA in transcriptomics workflows allows the sensitive detection of the global transcriptome at a very high throughput and with great coverage. In the published scRNAseq data integrated in this work [100], around 17,000 gene transcripts were covered in total. While a median of 5,500-7,500 transcripts were detected in a single cell, around 8,100 transcripts were covered across all five developmental stages. Interestingly, just 81 transcripts were identified in every single cell illustrating that missing values among single cells is a relevant drawback of this method. In contrast, the proteomics approach in this thesis prevented most missing values by labelling peptides with the isobaric TMT. This multiplexing strategy enables the concurrent identification and quantification of peptides from several samples, which reduces the issue of stochastic sampling of DDA and thus, decreases missing values within a single TMT batch. Thereby, for the vast majority of measurements either a peptide is detected in all TMT channels or it is missing completely. With this approach around 9,000 proteins were covered across all five time points, which is comparable to the 8,100 transcripts detected in the scRNA-seq dataset. Despite the comparable depth of proteomics and transcriptomics achieved in this thesis, in vitro differentiation protocols have been still mainly driven and characterized at the transcript level. Future PSC-based model systems will therefore highly benefit from focusing on proteins as the 101 | P a g e Chapter V: General discussion and outlook functional unit of the cells instead of sole transcriptomic analysis. This becomes even more evident in view of a rather poor temporal correlation of mRNA and protein abundances as shown in this work (chapter III) and in previous studies [140, 153]. Interestingly, the correlation between the two most separated time points, iPSCs and MH, was acceptable indicating that mRNA levels can be used to infer overall protein abundances to some extent, but fail to delineate the exact temporal expression. Besides an inherent chronological sequence of transcription and translation, the diverging dynamics might be partly due to transcripts that are actually not translated into proteins and due to the on average 5 times higher stability of proteins compared to mRNA [361]. Especially in the dynamic environment of cell fate decisions, a high degree of fluctuations of mRNA might only partly mirror the actual presence of functional proteins in the respective developmental stage. This notion is further supported by the preceding decrease in transcript levels of proteins related to DNA replication (Chapter III). Nevertheless, the correlation between mRNA and protein levels is complex and several other factors, such as the vastly different dynamic range also contribute to the observed discrepancy. While transcripts quantified by RNA-seq span about four orders of magnitude, protein abundances recorded by MS span around eight orders of magnitude [153], further emphasizing the difficulties in direct correlation of the two omics approaches. To this notion, Schwanhausser and colleagues have shown that just 40% of the variance on protein levels can be explained from mRNA levels [361]. Thus, other processes, such as mRNA degradation or protein turnover, also influence actual protein levels in a cell. The buffering effect at protein level is yet another process that complicates this correlation. This effect describes the observation that the abundances of some proteins are not affected even with substantial changes on transcript level [140, 362]. Moreover, the cell lysate-based proteomics approach utilized in this study does not detect secreted proteins and thus biases the correlation to proteins residing in the cell. In any case, future developmental studies would benefit from the complementary study of secreted proteins as autocrine and paracrine signalling are crucial mediators of cell fate decisions. In addition to detecting protein expression levels, mass spectrometry-based proteomics is also capable of analysing PTMs, which allows to derive information about the functionality of proteins, such as their enzymatic activity. This is of particular interest because the activity of proteins cannot always be inferred from their higher expression levels. So far only a limited number of studies exist that analysed PTMs in the context of cell differentiation on a large scale, although many PTMs regulate essential signalling pathways. For example, some studies have investigated phosphorylation changes during early stem cell differentiation, which revealed insights about kinases and signalling pathways involved in maintaining pluripotency as well as general differentiation processes [62, 63, 256]. Yet, so far a global survey of PTMs in the context of hepatocyte differentiation has been lacking. As discussed previously, sample amounts from differentiation experiments can be scarce, which is even more challenging for the study of PTMs as they usually require specific enrichment strategies due to their low abundance [363]. In this study, phosphorylated peptides were enriched, yielding around 12,000 P-sites, from limited sample material. However, others have identified more than 37,000 P-sites from 10 times more peptide quantity [174], demonstrating that large amounts of starting material are essential for comprehensive phosphorylation analyses. Although technological advances in the field of mass spectrometry-based proteomics have even led to the identification of nearly 300,000 distinct P102 | P a g e Chapter V: General discussion and outlook sites [364] (PhosphoSitePlus® 12.01.2022), data mining remains one of the major bottlenecks for phosphoproteomics approaches. For more than 95% of these distinct P-sites no known kinase or biological function has been reported [365]. Furthermore, just 20% of the known kinases are responsible for phosphorylating more than 80% of the currently annotated substrates. This imbalance is presumably caused by the lack of highly selective tool compounds for most kinases as well as the bias of research studies towards a limited set of kinases [365]. However, as kinases and phosphatases recognize their substrates to some extent by specific amino acid sequences or motifs, algorithms can overlay this information to predict kinase-substrate relationships. In this thesis, the Networkin platform [264] was utilized to report known and predicted downstream substrates of CDK and ERK, as their phosphorylation pattern suggested significantly altered kinase activity. Indeed, the decreased phosphorylation of downstream substrates supported this finding. However, it is important to note that such an analysis requires the knowledge of the specific kinase motif, which again leads to a bias towards the low number of well-studied kinases. Considering acetylation events, the availability of enzyme-substrate information in public databases is very poor. One approach to shed more light on HDAC substrates was the study of acetylation changes after treating cells with 19 different HDAC inhibitors [366]. Although high inhibitor selectivity is required to infer the exact enzyme-substrate relationships, such experiments are important starting points. A similiar approach analyzes the acetylation response upon knockdown of writer (acetylation) or eraser (deacetylation) proteins. By retrieving results from such studies [281, 285], some of the acetylation events in this work could be linked to SIRT2 and the observed metabolic switch. Since the biological function for most Ac-sites remains elusive, elaborated follow-up experiments are required. Therefore, the global analysis of acetylation dynamics are important resources, which will become increasingly valuable as soon as the functional annotations of Ac-sites can be improved. Nevertheless, compared to single-cell transcriptomics, comprehensive proteomics approaches are so far limited to analyse the expression of bulk experiments. Indeed, the information from each single cell is lost by the proteomics approach because the averaged signal from all cells is recorded. Considering that in vitro differentiations often result in heterogenous cell populations, single-cell profiles would be however highly desirable. Therefore, the information obtained from the scRNAseq experiment was exploited to investigate cell heterogeneity at the single cell level. Almost all cells commited successfully to the DE lineage and around 90% expressed IH specific marker genes. Only the HE and MH stages showed slightly higher heterogeneity [100]. Hence, the high differentiation efficiency allowed the measurement of protein expression of the target cells by bulk proteomics. Indeed, the high temporal separation in the PCA analysis derived from the proteomics data indicated that the differentiation purity as well as the analysis depth facilitated the dissection of the different development stages. To conclude, both technologies have their pros and cons. While the high sensitivity of transcriptomics allows the study of single cells with good coverage, the measurement of the actual functional unit in proteomics better reflects the physiological function of the investigated cells. Taken together, both methods should be implemented in a complementary way for the investigation of lineage decisions and cellular functions. 103 | P a g e Chapter V: General discussion and outlook Single-cell proteomics - future technology to study development? Since the idea of single-cell proteomics emerged, it has mostly been devoted to antibody-based experiments. For example western blot analysis provides enough sensitivity to detect protein expression of single cells, but only for a very limited number of protein targets [367]. The same is true for fluorescence-activated cell sorting (FACS), which is a widely used technology to analyse and sort single cells. Combining the single cell resolution of flow-cytometry with MS led to the development of the CyTOF (cytometry by time-of-flight mass spectrometry). This technology has emerged in the field of single-cell proteomics in 2008 [368] and is based on the quantitative measurement of metal-conjugated antibodies (as reviewed in [369]). This technology has diminished one of the main drawbacks of FACS, the spectral overlap of fluorescent antibodies, and allows the simultaneous detection of more than 40 different proteins at once [370]. While these approaches are restricted in the numbers of protein targets that can be screened in parallel, several thousand cells can be analysed per second. However, major drawbacks of these technologies are the dependency on antibody availability and specificity as well as their targeted nature. In contrast, mass spectrometry-based approaches that do not rely on antibodies can study a much larger number of proteins, but with lower throughput due to the very time-consuming data acquisition. One attempt to overcome this limitation was SCOPE-MS (Single Cell ProtEomics) introduced by the Slavov group [371]. Here, the throughput was increased by multiplexing single cells with TMT, which enables the simultaneous analysis of up to 16 samples at the same time [372]. Moreover, a so-called carrier or booster channel, a sample with up to a few hundred or even a thousand times the protein amount of a single cell, is added to increase the signal intensity and thereby the identification rate of peptides. However, this approach is discussed controversial [373] because the dynamic range of mass spectrometers is limited and the signal from the carrier proteome overlays the single cell signals, which negatively affects the quantification accuracy (reviewed in [373, 374]). Another way to enhance sensitivity of single-cell proteomics is to improve methods for sample preparation, such as the nanoPOTS (nanodroplet processing in one pot for trace samples) technology [375], which utilizes a pipetting robot that is capable of handling nanoliter droplets. Thereby, the loss during sample preparation can be decreased, which resulted in the quantification of around 670 proteins from single cells [376]. Since its first publication in 2018, the idea of nanoPOTS was further developed and is now a commercially available picoliter dispensing platform from Cellenion (cellenONE F1.4, Cellenion, France). This system allows to perform the complete sample preparation from sorting and lysing of single cells to the digestion of proteins, and finally the labelling with TMT. Coupling this platform with a nanoPOTS autosampler [377] allows the direct injection of samples from the nanowells to the LC-MS system, thereby increasing sensitivity and throughput for single-cell analysis. Employing this workflow to 100 single cells led to the robust quantification of around 1,500 proteins from each single cell [378]. Although a carrier proteome was used in this study, the miniaturization and automatization is very promising for future single-cell proteomics applications. Apart from the sample preparation, several developments on the LC-MS side have also improved sensitivity. For example the FAIMS (Field Assymmetric Ion Mobility Spectrometry) device, which is able to remove singly charged background contaminants, gained 10 times more sensitivity compared to previous LC-MS setups [379]. With this ion mobility device the ion accumulation time can also be extended 104 | P a g e Chapter V: General discussion and outlook resulting in enhanced signal intensity and thus, increased peptide identification [380]. Furthermore, decreasing the column diameter and flow rate to 20 µm and 20 nL/min, respectively, has shown augmented sensitivity compared to the standard proteomics setup, which is performed with 75 µm columns at 300 nL/min [381, 382]. In recent years, several innovations in sample preparation and instrumentation have increased sensitivity and led to major advances in the field of single-cell proteomics. This area of research has gained more interest, and single-cell analysis has made great strides in feasibility and comprehensiveness. However, some of the general problems of mass spectrometry-based proteomics discussed previously are even more pronounced at the single-cell level. For example, the dynamic range of proteins in a cell cannot be reduced by fractionation because of the limited sample amount. Since proteomics tends to identify the highly abundant proteins, it will be even more difficult to achieve significant depth. However, depth will need to be increased considerably to compete with scRNA-seq, which routinely measures transcripts from more than 5,000 genes. So far, mass spectrometry-based single-cell proteomics is still in its infancy, and the next years will show whether it can compete with scRNA-seq or whether other single-cell proteomics approaches, such as nanopore sequencing [383], will emerge as the leading technology. The development will be particularly interesting with regard to the analysis of PTMs at the single-cell level, thereby offering the opportunity to thruthfully reconstruct a complete picture of the effectors of cellular function and dysfunction in an unprecedented manner. 105 | P a g e References [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] W. Zakrzewski, M. Dobrzynski, M. Szymonowicz, and Z. Rybak, "Stem cells: past, present, and future," Stem Cell Res Ther, vol. 10, no. 1, p. 68, Feb 26 2019, doi: 10.1186/s13287019-1165-5. F. Lu and Y. Zhang, "Cell totipotency: molecular features, induction, and maintenance," Natl Sci Rev, vol. 2, no. 2, pp. 217-225, Jun 2015, doi: 10.1093/nsr/nwv009. M. Boiani, E. Casser, G. Fuellen, and E. S. Christians, "Totipotency continuity from zygote to early blastomeres: a model under revision," (in eng), Reproduction, vol. 158, no. 2, pp. R49-r65, Aug 2019, doi: 10.1530/rep-18-0462. T. Vazin and W. J. Freed, "Human embryonic stem cells: derivation, culture, and differentiation: a review," Restor Neurol Neurosci, vol. 28, no. 4, pp. 589-603, 2010, doi: 10.3233/RNN-2010-0543. J. A. Thomson et al., "Embryonic stem cell lines derived from human blastocysts," (in eng), Science (New York, N.Y.), vol. 282, no. 5391, pp. 1145-7, Nov 6 1998, doi: 10.1126/science.282.5391.1145. B. E. Reubinoff, M. F. Pera, C. Y. Fong, A. Trounson, and A. Bongso, "Embryonic stem cell lines from human blastocysts: somatic differentiation in vitro," (in eng), Nat Biotechnol, vol. 18, no. 4, pp. 399-404, Apr 2000, doi: 10.1038/74447. K. Takahashi and S. Yamanaka, "Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors," Cell, vol. 126, no. 4, pp. 663-76, Aug 25 2006, doi: 10.1016/j.cell.2006.07.024. K. Takahashi et al., "Induction of pluripotent stem cells from adult human fibroblasts by defined factors," Cell, vol. 131, no. 5, pp. 861-72, Nov 30 2007, doi: 10.1016/j.cell.2007.11.019. Y. Kim, Y. A. Rim, H. Yi, N. Park, S. H. Park, and J. H. Ju, "The Generation of Human Induced Pluripotent Stem Cells from Blood Cells: An Efficient Protocol Using Serial Plating of Reprogrammed Cells by Centrifugation," Stem Cells Int, vol. 2016, p. 1329459, 2016, doi: 10.1155/2016/1329459. T. Aasen and J. C. Izpisua Belmonte, "Isolation and cultivation of human keratinocytes from skin or plucked hair for the generation of induced pluripotent stem cells," Nature protocols, vol. 5, no. 2, pp. 371-82, Feb 2010, doi: 10.1038/nprot.2009.241. J. Staerk et al., "Reprogramming of human peripheral blood cells to induced pluripotent stem cells," Cell stem cell, vol. 7, no. 1, pp. 20-4, Jul 2 2010, doi: 10.1016/j.stem.2010.06.002. T. Zhou et al., "Generation of human induced pluripotent stem cells from urine samples," Nature protocols, vol. 7, no. 12, pp. 2080-9, Dec 2012, doi: 10.1038/nprot.2012.115. Y. F. Jiang, M. Chen, N. N. Zhang, H. J. Yang, Q. Rui, and Y. F. Zhou, "In vitro and in vivo differentiation of induced pluripotent stem cells generated from urine-derived cells into cardiomyocytes," Biol Open, vol. 7, no. 1, Jan 8 2018, doi: 10.1242/bio.029157. L. Wang et al., "Generation of integration-free neural progenitor cells from cells in human urine," Nature methods, vol. 10, no. 1, pp. 84-9, Jan 2013, doi: 10.1038/nmeth.2283. G. Liu, B. T. David, M. Trawczynski, and R. G. Fessler, "Advances in Pluripotent Stem Cells: History, Mechanisms, Technologies, and Applications," Stem Cell Rev Rep, vol. 16, no. 1, pp. 3-32, Feb 2020, doi: 10.1007/s12015-019-09935-x. N. Malik and M. S. Rao, "A review of the methods for human iPSC derivation," Methods Mol Biol, vol. 997, pp. 23-33, 2013, doi: 10.1007/978-1-62703-348-0_3. 107 | P a g e References [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] M. Stadtfeld, N. Maherali, D. T. Breault, and K. Hochedlinger, "Defining molecular cornerstones during fibroblast to iPS cell reprogramming in mouse," Cell stem cell, vol. 2, no. 3, pp. 230-40, Mar 6 2008, doi: 10.1016/j.stem.2008.02.001. H. O. Li et al., "A cytoplasmic RNA vector derived from nontransmissible Sendai virus with efficient gene transfer and expression," (in eng), J Virol, vol. 74, no. 14, pp. 6564-9, Jul 2000, doi: 10.1128/jvi.74.14.6564-6569.2000. N. Fusaki, H. Ban, A. Nishiyama, K. Saeki, and M. Hasegawa, "Efficient induction of transgene-free human pluripotent stem cells using a vector based on Sendai virus, an RNA virus that does not integrate into the host genome," (in eng), Proc Jpn Acad Ser B Phys Biol Sci, vol. 85, no. 8, pp. 348-62, 2009, doi: 10.2183/pjab.85.348. K. Kaji, K. Norrby, A. Paca, M. Mileikovsky, P. Mohseni, and K. Woltjen, "Virus-free induction of pluripotency and subsequent excision of reprogramming factors," Nature, vol. 458, no. 7239, pp. 771-5, Apr 9 2009, doi: 10.1038/nature07864. P. Mali et al., "Butyrate greatly enhances derivation of human induced pluripotent stem cells by promoting epigenetic remodeling and the expression of pluripotency-associated genes," Stem cells (Dayton, Ohio), vol. 28, no. 4, pp. 713-20, Apr 2010, doi: 10.1002/stem.402. J. Deinsberger, D. Reisinger, and B. Weber, "Global trends in clinical trials involving pluripotent stem cells: a systematic multi-database analysis," NPJ Regen Med, vol. 5, p. 15, 2020, doi: 10.1038/s41536-020-00100-4. Y. Qiao, O. S. Agboola, X. Hu, Y. Wu, and L. Lei, "Tumorigenic and Immunogenic Properties of Induced Pluripotent Stem Cells: a Promising Cancer Vaccine," Stem Cell Rev Rep, vol. 16, no. 6, pp. 1049-1061, Dec 2020, doi: 10.1007/s12015-020-10042-5. M. Yoshihara, A. Oguchi, and Y. Murakawa, "Genomic Instability of iPSCs and Challenges in Their Clinical Applications," (in eng), Adv Exp Med Biol, vol. 1201, pp. 23-47, 2019, doi: 10.1007/978-3-030-31206-0_2. Y. Jiang et al., "Pluripotency of mesenchymal stem cells derived from adult marrow," (in eng), Nature, vol. 418, no. 6893, pp. 41-9, Jul 4 2002, doi: 10.1038/nature00870. D. Baksh, R. Yao, and R. S. Tuan, "Comparison of proliferative and multilineage differentiation potential of human mesenchymal stem cells derived from umbilical cord and bone marrow," Stem cells (Dayton, Ohio), vol. 25, no. 6, pp. 1384-92, Jun 2007, doi: 10.1634/stemcells.2006-0709. J. Saben et al., "Distinct adipogenic differentiation phenotypes of human umbilical cord mesenchymal cells dependent on adipogenic conditions," Exp Biol Med (Maywood), vol. 239, no. 10, pp. 1340-51, Oct 2014, doi: 10.1177/1535370214539225. J. Y. Hsieh, Y. S. Fu, S. J. Chang, Y. H. Tsuang, and H. W. Wang, "Functional module analysis reveals differential osteogenic and stemness potentials in human mesenchymal stem cells from bone marrow and Wharton's jelly of umbilical cord," (in eng), Stem Cells Dev, vol. 19, no. 12, pp. 1895-910, Dec 2010, doi: 10.1089/scd.2009.0485. Y. Yang, H. Lin, H. Shen, B. Wang, G. Lei, and R. S. Tuan, "Mesenchymal stem cell-derived extracellular matrix enhances chondrogenic phenotype of and cartilage formation by encapsulated chondrocytes in vitro and in vivo," Acta Biomater, vol. 69, pp. 71-82, Mar 15 2018, doi: 10.1016/j.actbio.2017.12.043. S. M. Phadnis et al., "Human bone marrow-derived mesenchymal cells differentiate and mature into endocrine pancreatic lineage in vivo," (in eng), Cytotherapy, vol. 13, no. 3, pp. 279-93, Mar 2011, doi: 10.3109/14653249.2010.523108. P. Stock, S. Bruckner, S. Winkler, M. M. Dollinger, and B. Christ, "Human bone marrow mesenchymal stem cell-derived hepatocytes improve the mouse liver after acute acetaminophen intoxication by preventing progress of injury," Int J Mol Sci, vol. 15, no. 4, pp. 7004-28, Apr 22 2014, doi: 10.3390/ijms15047004. 108 | P a g e References [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44] [45] [46] [47] [48] [49] A. Wilkins, K. Kemp, M. Ginty, K. Hares, E. Mallam, and N. Scolding, "Human bone marrowderived mesenchymal stem cells secrete brain-derived neurotrophic factor which promotes neuronal survival in vitro," Stem Cell Res, vol. 3, no. 1, pp. 63-70, Jul 2009, doi: 10.1016/j.scr.2009.02.006. J. Dulak, K. Szade, A. Szade, W. Nowak, and A. Józkowicz, "Adult stem cells: hopes and hypes of regenerative medicine," (in eng), Acta Biochim Pol, vol. 62, no. 3, pp. 329-37, 2015, doi: 10.18388/abp.2015_1023. D. G. Phinney and D. J. Prockop, "Concise review: mesenchymal stem/multipotent stromal cells: the state of transdifferentiation and modes of tissue repair--current views," Stem cells (Dayton, Ohio), vol. 25, no. 11, pp. 2896-902, Nov 2007, doi: 10.1634/stemcells.20070637. Y. Han, X. Li, Y. Zhang, Y. Han, F. Chang, and J. Ding, "Mesenchymal Stem Cells for Regenerative Medicine," Cells, vol. 8, no. 8, Aug 13 2019, doi: 10.3390/cells8080886. K. Ko et al., "Induction of pluripotency in adult unipotent germline stem cells," Cell stem cell, vol. 5, no. 1, pp. 87-96, Jul 2 2009, doi: 10.1016/j.stem.2009.05.025. K. Ko, M. J. Arauzo-Bravo, J. Kim, M. Stehling, and H. R. Scholer, "Conversion of adult mouse unipotent germline stem cells into pluripotent stem cells," Nature protocols, vol. 5, no. 5, pp. 921-8, May 2010, doi: 10.1038/nprot.2010.44. Z. Chen, Z. Li, and Z. He, "Plasticity of male germline stem cells and their applications in reproductive and regenerative medicine," Asian J Androl, vol. 17, no. 3, pp. 367-72, MayJun 2015, doi: 10.4103/1008-682X.143739. M. Berdasco and M. Esteller, "DNA methylation in stem cell renewal and multipotency," (in eng), Stem Cell Res Ther, vol. 2, no. 5, p. 42, Oct 31 2011, doi: 10.1186/scrt83. D. E. Cohen and D. Melton, "Turning straw into gold: directing cell fate for regenerative medicine," Nat Rev Genet, vol. 12, no. 4, pp. 243-52, Apr 2011, doi: 10.1038/nrg2938. P. Cahan and G. Q. Daley, "Origins and implications of pluripotent stem cell variability and heterogeneity," Nat Rev Mol Cell Biol, vol. 14, no. 6, pp. 357-68, Jun 2013, doi: 10.1038/nrm3584. C. Jensen and Y. Teng, "Is It Time to Start Transitioning From 2D to 3D Cell Culture?," Front Mol Biosci, vol. 7, p. 33, 2020, doi: 10.3389/fmolb.2020.00033. D. Antoni, H. Burckel, E. Josset, and G. Noel, "Three-dimensional cell culture: a breakthrough in vivo," Int J Mol Sci, vol. 16, no. 3, pp. 5517-27, Mar 11 2015, doi: 10.3390/ijms16035517. M. A. Lancaster and J. A. Knoblich, "Organogenesis in a dish: modeling development and disease using organoid technologies," Science (New York, N.Y.), vol. 345, no. 6194, p. 1247125, Jul 18 2014, doi: 10.1126/science.1247125. M. A. Lancaster et al., "Cerebral organoids model human brain development and microcephaly," Nature, vol. 501, no. 7467, pp. 373-9, Sep 19 2013, doi: 10.1038/nature12517. F. Antonica et al., "Generation of functional thyroid from embryonic stem cells," Nature, vol. 491, no. 7422, pp. 66-71, Nov 1 2012, doi: 10.1038/nature11525. F. Sampaziotis et al., "Cholangiocytes derived from human induced pluripotent stem cells for disease modeling and drug validation," Nat Biotechnol, vol. 33, no. 8, pp. 845-852, Aug 2015, doi: 10.1038/nbt.3275. J. F. Dekkers et al., "Characterizing responses to CFTR-modulating drugs using rectal organoids derived from subjects with cystic fibrosis," (in eng), Sci Transl Med, vol. 8, no. 344, p. 344ra84, Jun 22 2016, doi: 10.1126/scitranslmed.aad8278. Y. Du et al., "Development of a miniaturized 3D organoid culture platform for ultra-highthroughput screening," J Mol Cell Biol, vol. 12, no. 8, pp. 630-643, Aug 1 2020, doi: 10.1093/jmcb/mjaa036. 109 | P a g e References [50] [51] [52] [53] [54] [55] [56] [57] [58] [59] [60] [61] [62] [63] [64] [65] C. Mollinari, J. Zhao, L. Lupacchini, E. Garaci, D. Merlo, and G. Pei, "Transdifferentiation: a new promise for neurodegenerative diseases," Cell Death Dis, vol. 9, no. 8, p. 830, Aug 6 2018, doi: 10.1038/s41419-018-0891-4. R. L. Davis, H. Weintraub, and A. B. Lassar, "Expression of a single transfected cDNA converts fibroblasts to myoblasts," (in eng), Cell, vol. 51, no. 6, pp. 987-1000, Dec 24 1987, doi: 10.1016/0092-8674(87)90585-x. Q. Zhou, J. Brown, A. Kanarek, J. Rajagopal, and D. A. Melton, "In vivo reprogramming of adult pancreatic exocrine cells to beta-cells," Nature, vol. 455, no. 7213, pp. 627-32, Oct 2 2008, doi: 10.1038/nature07314. R. Zhang et al., "In vivo cardiac reprogramming contributes to zebrafish heart regeneration," Nature, vol. 498, no. 7455, pp. 497-501, Jun 27 2013, doi: 10.1038/nature12322. Y.-C. Wang, S. E. Peterson, and J. F. Loring, "Protein post-translational modifications and regulation of pluripotency in human stem cells," Cell Research, vol. 24, no. 2, pp. 143-160, 2013, doi: 10.1038/cr.2013.151. S. M. Buckley et al., "Regulation of pluripotency and cellular reprogramming by the ubiquitin-proteasome system," (in eng), Cell stem cell, vol. 11, no. 6, pp. 783-98, Dec 7 2012, doi: 10.1016/j.stem.2012.09.011. S. Tahmasebi et al., "Sumoylation of Krüppel-like factor 4 inhibits pluripotency induction but promotes adipocyte differentiation," (in eng), J Biol Chem, vol. 288, no. 18, pp. 12791804, May 3 2013, doi: 10.1074/jbc.M113.465443. Q. Yan et al., "O-fucose modulates Notch-controlled blood lineage commitment," (in eng), Am J Pathol, vol. 176, no. 6, pp. 2921-34, Jun 2010, doi: 10.2353/ajpath.2010.090702. M. E. Torres-Padilla, D. E. Parfitt, T. Kouzarides, and M. Zernicka-Goetz, "Histone arginine methylation regulates pluripotency in the early mouse embryo," (in eng), Nature, vol. 445, no. 7124, pp. 214-8, Jan 11 2007, doi: 10.1038/nature05458. Y. L. Deribe, T. Pawson, and I. Dikic, "Post-translational modifications in signal integration," (in eng), Nat Struct Mol Biol, vol. 17, no. 6, pp. 666-72, Jun 2010, doi: 10.1038/nsmb.1842. A. Bononi et al., "Protein kinases and phosphatases in the control of cell fate," (in eng), Enzyme Res, vol. 2011, p. 329098, 2011, doi: 10.4061/2011/329098. D. L. Swaney, C. D. Wenger, J. A. Thomson, and J. J. Coon, "Human embryonic stem cell phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry," (in eng), Proceedings of the National Academy of Sciences of the United States of America, vol. 106, no. 4, pp. 995-1000, Jan 27 2009, doi: 10.1073/pnas.0811964106. D. Van Hoof et al., "Phosphorylation dynamics during early differentiation of human embryonic stem cells," Cell stem cell, vol. 5, no. 2, pp. 214-26, Aug 7 2009, doi: 10.1016/j.stem.2009.05.021. P. Yang et al., "Multi-omic Profiling Reveals Dynamics of the Phased Progression of Pluripotency," Cell Syst, vol. 8, no. 5, pp. 427-445 e10, May 22 2019, doi: 10.1016/j.cels.2019.03.012. T. Narita, B. T. Weinert, and C. Choudhary, "Functions and mechanisms of non-histone protein acetylation," Nat Rev Mol Cell Biol, vol. 20, no. 3, pp. 156-174, Mar 2019, doi: 10.1038/s41580-018-0081-3. V. G. Allfrey, R. Faulkner, and A. E. Mirsky, "ACETYLATION AND METHYLATION OF HISTONES AND THEIR POSSIBLE ROLE IN THE REGULATION OF RNA SYNTHESIS," (in eng), Proceedings of the National Academy of Sciences of the United States of America, vol. 51, no. 5, pp. 786-94, May 1964, doi: 10.1073/pnas.51.5.786. 110 | P a g e References [66] [67] [68] [69] [70] [71] [72] [73] [74] [75] [76] [77] [78] [79] [80] [81] [82] S. M. Görisch, M. Wachsmuth, K. F. Tóth, P. Lichter, and K. Rippe, "Histone acetylation increases chromatin accessibility," (in eng), J Cell Sci, vol. 118, no. Pt 24, pp. 5825-34, Dec 15 2005, doi: 10.1242/jcs.02689. S. Efroni et al., "Global transcription in pluripotent embryonic stem cells," (in eng), Cell stem cell, vol. 2, no. 5, pp. 437-47, May 8 2008, doi: 10.1016/j.stem.2008.03.021. D. Huangfu et al., "Induction of pluripotent stem cells by defined factors is greatly improved by small-molecule compounds," (in eng), Nat Biotechnol, vol. 26, no. 7, pp. 7957, Jul 2008, doi: 10.1038/nbt1418. S. J. Kang, Y. I. Park, B. So, and H. G. Kang, "Sodium butyrate efficiently converts fully reprogrammed induced pluripotent stem cells from mouse partially reprogrammed cells," (in eng), Cell Reprogram, vol. 16, no. 5, pp. 345-54, Oct 2014, doi: 10.1089/cell.2013.0087. M. Borowiak et al., "Small Molecules Efficiently Direct Endodermal Differentiation of Mouse and Human Embryonic Stem Cells," Cell stem cell, vol. 4, no. 4, pp. 348-358, 2009, doi: 10.1016/j.stem.2009.01.014. J. de Boer, R. Licht, M. Bongers, T. van der Klundert, R. Arends, and C. van Blitterswijk, "Inhibition of histone acetylation as a tool in bone tissue engineering," (in eng), Tissue Eng, vol. 12, no. 10, pp. 2927-37, Oct 2006, doi: 10.1089/ten.2006.12.2927. X. Yao, J. R. Zhang, H. R. Huang, L. C. Dai, Q. J. Liu, and M. Zhang, "Histone deacetylase inhibitor promotes differentiation of embryonic stem cells into neural cells in adherent monoculture," (in eng), Chin Med J (Engl), vol. 123, no. 6, pp. 734-8, Mar 20 2010. J. Hsieh, K. Nakashima, T. Kuwabara, E. Mejia, and F. H. Gage, "Histone deacetylase inhibition-mediated neuronal differentiation of multipotent adult neural progenitor cells," (in eng), Proceedings of the National Academy of Sciences of the United States of America, vol. 101, no. 47, pp. 16659-64, Nov 23 2004, doi: 10.1073/pnas.0407643101. C. Elizalde et al., "Histone deacetylase 3 modulates the expansion of human hematopoietic stem cells," (in eng), Stem Cells Dev, vol. 21, no. 14, pp. 2581-91, Sep 20 2012, doi: 10.1089/scd.2011.0698. C. Haumaitre, O. Lenoir, and R. Scharfmann, "Histone deacetylase inhibitors modify pancreatic cell fate determination and amplify endocrine progenitors," (in eng), Mol Cell Biol, vol. 28, no. 20, pp. 6373-83, Oct 2008, doi: 10.1128/mcb.00413-08. L. Huang et al., "Commitment and oncogene-induced plasticity of human stem cellderived pancreatic acinar and ductal organoids," (in eng), Cell stem cell, vol. 28, no. 6, pp. 1090-1104.e6, Jun 3 2021, doi: 10.1016/j.stem.2021.03.022. H. S. Han, G. Kang, J. S. Kim, B. H. Choi, and S. H. Koo, "Regulation of glucose metabolism from a liver-centric perspective," Exp Mol Med, vol. 48, p. e218, Mar 11 2016, doi: 10.1038/emm.2015.122. R. C. Nordlie, J. D. Foster, and A. J. Lange, "Regulation of glucose production by the liver," (in eng), Annu Rev Nutr, vol. 19, pp. 379-406, 1999, doi: 10.1146/annurev.nutr.19.1.379. G. Schreiber, "The synthesis and secretion of plasma proteins in the liver," Pathology, vol. 10, no. 4, p. 394, 1978/01/01/ 1978, doi: https://doi.org/10.1016/S0031-3025(16)398178. P. B. Watkins, "Drug safety sciences and the bottleneck in drug development," (in eng), Clin Pharmacol Ther, vol. 89, no. 6, pp. 788-90, Jun 2011, doi: 10.1038/clpt.2011.63. I. J. Onakpoya, C. J. Heneghan, and J. K. Aronson, "Post-marketing withdrawal of 462 medicinal products because of adverse drug reactions: a systematic review of the world literature," (in eng), BMC Med, vol. 14, p. 10, Feb 4 2016, doi: 10.1186/s12916-016-05532. W. Seo and W. I. Jeong, "Hepatic non-parenchymal cells: Master regulators of alcoholic liver disease?," World J Gastroenterol, vol. 22, no. 4, pp. 1348-56, Jan 28 2016, doi: 10.3748/wjg.v22.i4.1348. 111 | P a g e References [83] S. Lemoinne, A. Cadoret, H. El Mourabit, D. Thabut, and C. Housset, "Origins and functions of liver myofibroblasts," Biochim Biophys Acta, vol. 1832, no. 7, pp. 948-54, Jul 2013, doi: 10.1016/j.bbadis.2013.02.019. [84] T. Higashi, S. L. Friedman, and Y. Hoshida, "Hepatic stellate cells as key target in liver fibrosis," Adv Drug Deliv Rev, vol. 121, pp. 27-42, Nov 1 2017, doi: 10.1016/j.addr.2017.05.007. [85] L. D. Deleve, X. Wang, and Y. Guo, "Sinusoidal endothelial cells prevent rat stellate cell activation and promote reversion to quiescence," (in eng), Hepatology (Baltimore, Md.), vol. 48, no. 3, pp. 920-30, Sep 2008, doi: 10.1002/hep.22351. [86] L. D. DeLeve and A. C. Maretti-Mira, "Liver Sinusoidal Endothelial Cell: An Update," Semin Liver Dis, vol. 37, no. 4, pp. 377-387, Nov 2017, doi: 10.1055/s-0037-1617455. [87] R. Malik, C. Selden, and H. Hodgson, "The role of non-parenchymal cells in liver growth," (in eng), Semin Cell Dev Biol, vol. 13, no. 6, pp. 425-31, Dec 2002, doi: 10.1016/s1084952102001301. [88] C. E. Murry and G. Keller, "Differentiation of embryonic stem cells to clinically relevant populations: lessons from embryonic development," (in eng), Cell, vol. 132, no. 4, pp. 66180, Feb 22 2008, doi: 10.1016/j.cell.2008.02.008. [89] A. M. Zorn, "Liver development," in StemBook. Cambridge (MA): Harvard Stem Cell Institute Copyright: © 2008 Aaron M. Zorn., 2008. [90] S. Yagi, M. Hirata, Y. Miyachi, and S. Uemoto, "Liver Regeneration after Hepatectomy and Partial Liver Transplantation," (in eng), Int J Mol Sci, vol. 21, no. 21, Nov 9 2020, doi: 10.3390/ijms21218414. [91] K. Abe et al., "Endoderm-specific gene expression in embryonic stem cells differentiated to embryoid bodies," (in eng), Exp Cell Res, vol. 229, no. 1, pp. 27-34, Nov 25 1996, doi: 10.1006/excr.1996.0340. [92] A. A. Palakkan, J. Nanda, and J. A. Ross, "Pluripotent stem cells to hepatocytes, the journey so far," Biomed Rep, vol. 6, no. 4, pp. 367-373, Apr 2017, doi: 10.3892/br.2017.867. [93] K. D. Tremblay, P. A. Hoodless, E. K. Bikoff, and E. J. Robertson, "Formation of the definitive endoderm in mouse is a Smad2-dependent process," (in eng), Development (Cambridge, England), vol. 127, no. 14, pp. 3079-90, Jul 2000. [94] S. D. Vincent, N. R. Dunn, S. Hayashi, D. P. Norris, and E. J. Robertson, "Cell fate decisions within the mouse organizer are governed by graded Nodal signals," Genes & development, vol. 17, no. 13, pp. 1646-62, Jul 1 2003, doi: 10.1101/gad.1100503. [95] K. A. D'Amour, A. D. Agulnick, S. Eliazer, O. G. Kelly, E. Kroon, and E. E. Baetge, "Efficient differentiation of human embryonic stem cells to definitive endoderm," Nat Biotechnol, vol. 23, no. 12, pp. 1534-41, Dec 2005, doi: 10.1038/nbt1163. [96] M. Nakanishi et al., "Directed induction of anterior and posterior primitive streak by Wnt from embryonic stem cells cultured in a chemically defined serum-free medium," FASEB J, vol. 23, no. 1, pp. 114-22, Jan 2009, doi: 10.1096/fj.08-111203. [97] S. Engert, I. Burtscher, W. P. Liao, S. Dulev, G. Schotta, and H. Lickert, "Wnt/beta-catenin signalling regulates Sox17 expression and is essential for organizer and endoderm formation in the mouse," Development (Cambridge, England), vol. 140, no. 15, pp. 312838, Aug 2013, doi: 10.1242/dev.088765. [98] D. C. Hay et al., "Highly efficient differentiation of hESCs to functional hepatic endoderm requires ActivinA and Wnt3a signaling," (in eng), Proceedings of the National Academy of Sciences of the United States of America, vol. 105, no. 34, pp. 12301-6, Aug 26 2008, doi: 10.1073/pnas.0806522105. [99] P. Roelandt et al., "Human embryonic and rat adult stem cells with primitive endodermlike phenotype can be fated to definitive endoderm, and finally hepatocyte-like cells," PLoS One, vol. 5, no. 8, p. e12101, Aug 11 2010, doi: 10.1371/journal.pone.0012101. 112 | P a g e References [100] [101] [102] [103] [104] [105] [106] [107] [108] [109] [110] [111] [112] [113] [114] [115] J. G. Camp et al., "Multilineage communication regulates human liver bud development from pluripotency," (in eng), Nature, vol. 546, no. 7659, pp. 533-538, Jun 22 2017, doi: 10.1038/nature22796. A. G. Lade and S. P. Monga, "Beta-catenin signaling in hepatic development and progenitors: which way does the WNT blow?," Dev Dyn, vol. 240, no. 3, pp. 486-500, Mar 2011, doi: 10.1002/dvdy.22522. A. M. Zorn and J. M. Wells, "Vertebrate endoderm development and organ formation," Annual review of cell and developmental biology, vol. 25, pp. 221-51, 2009, doi: 10.1146/annurev.cellbio.042308.113344. S. Toivonen et al., "Activin A and Wnt-dependent specification of human definitive endoderm cells," Exp Cell Res, vol. 319, no. 17, pp. 2535-44, Oct 15 2013, doi: 10.1016/j.yexcr.2013.07.007. R. Siller, S. Greenhough, E. Naumovska, and G. J. Sullivan, "Small-molecule-driven hepatocyte differentiation of human pluripotent stem cells," Stem cell reports, vol. 4, no. 5, pp. 939-52, May 12 2015, doi: 10.1016/j.stemcr.2015.04.001. J. Jung, M. Zheng, M. Goldfarb, and K. S. Zaret, "Initiation of mammalian liver development from endoderm by fibroblast growth factors," (in eng), Science (New York, N.Y.), vol. 284, no. 5422, pp. 1998-2003, Jun 18 1999, doi: 10.1126/science.284.5422.1998. J. M. Rossi, N. R. Dunn, B. L. Hogan, and K. S. Zaret, "Distinct mesodermal signals, including BMPs from the septum transversum mesenchyme, are required in combination for hepatogenesis from the endoderm," Genes & development, vol. 15, no. 15, pp. 1998-2009, Aug 1 2001, doi: 10.1101/gad.904601. K. Si-Tayeb et al., "Highly efficient generation of human hepatocyte-like cells from induced pluripotent stem cells," (in eng), Hepatology (Baltimore, Md.), vol. 51, no. 1, pp. 297-305, Jan 2010, doi: 10.1002/hep.23354. L. K. Kanninen et al., "Laminin-511 and laminin-521-based matrices for efficient hepatic specification of human pluripotent stem cells," Biomaterials, vol. 103, pp. 86-100, Oct 2016, doi: 10.1016/j.biomaterials.2016.06.054. C. Du et al., "Highly efficient and expedited hepatic differentiation from human pluripotent stem cells by pure small-molecule cocktails," Stem Cell Res Ther, vol. 9, no. 1, p. 58, Mar 9 2018, doi: 10.1186/s13287-018-0794-4. F. Sampaziotis, M. C. de Brito, I. Geti, A. Bertero, N. R. Hannan, and L. Vallier, "Directed differentiation of human induced pluripotent stem cells into functional cholangiocyte-like cells," Nature protocols, vol. 12, no. 4, pp. 814-827, Apr 2017, doi: 10.1038/nprot.2017.011. S. Tamura, Y. Morikawa, A. Miyajima, and E. Senba, "Expression of oncostatin M in hematopoietic organs," (in eng), Dev Dyn, vol. 225, no. 3, pp. 327-31, Nov 2002, doi: 10.1002/dvdy.10156. A. Kamiya, T. Kinoshita, and A. Miyajima, "Oncostatin M and hepatocyte growth factor induce hepatic maturation via distinct signaling pathways," (in eng), FEBS Lett, vol. 492, no. 1-2, pp. 90-4, Mar 9 2001, doi: 10.1016/s0014-5793(01)02140-8. I. S. Behbahan et al., "New approaches in the differentiation of human embryonic stem cells and induced pluripotent stem cells toward hepatocytes," Stem Cell Rev Rep, vol. 7, no. 3, pp. 748-59, Sep 2011, doi: 10.1007/s12015-010-9216-4. T. Kinoshita et al., "Hepatic differentiation induced by oncostatin M attenuates fetal liver hematopoiesis," (in eng), Proceedings of the National Academy of Sciences of the United States of America, vol. 96, no. 13, pp. 7265-70, Jun 22 1999, doi: 10.1073/pnas.96.13.7265. C. C. Benoist et al., "The procognitive and synaptogenic effects of angiotensin IV-derived peptides are dependent on activation of the hepatocyte growth factor/c-met system," J 113 | P a g e References [116] [117] [118] [119] [120] [121] [122] [123] [124] [125] [126] [127] [128] [129] Pharmacol Exp Ther, vol. 351, no. 2, pp. 390-402, Nov 2014, doi: 10.1124/jpet.114.218735. G. K. Michalopoulos, W. C. Bowen, K. Mulè, and J. Luo, "HGF-, EGF-, and dexamethasoneinduced gene expression patterns during formation of tissue in hepatic organoid cultures," (in eng), Gene Expr, vol. 11, no. 2, pp. 55-75, 2003, doi: 10.3727/000000003108748964. S. S. Chen, W. Fitzgerald, J. Zimmerberg, H. K. Kleinman, and L. Margolis, "Cell-cell and cell-extracellular matrix interactions regulate embryonic stem cell differentiation," Stem cells (Dayton, Ohio), vol. 25, no. 3, pp. 553-61, Mar 2007, doi: 10.1634/stemcells.20060419. H. Wang, X. Luo, and J. Leighton, "Extracellular Matrix and Integrins in Embryonic Stem Cell Differentiation," Biochem Insights, vol. 8, no. Suppl 2, pp. 15-21, 2015, doi: 10.4137/BCI.S30377. E. Luce, A. Messina, J. C. Duclos-Vallee, and A. Dubart-Kupperschmitt, "Advanced Techniques and Awaited Clinical Applications for Human Pluripotent Stem Cell Differentiation into Hepatocytes," Hepatology (Baltimore, Md.), vol. 74, no. 2, pp. 11011116, Aug 2021, doi: 10.1002/hep.31705. J. C. Wong, S. Y. Gao, J. G. Lees, M. B. Best, R. Wang, and B. E. Tuch, "Definitive endoderm derived from human embryonic stem cells highly express the integrin receptors alphaV and beta5," Cell Adh Migr, vol. 4, no. 1, pp. 39-45, Jan-Mar 2010, doi: 10.4161/cam.4.1.10627. B. R. Lawton, J. A. Sosa, S. Roman, and D. S. Krause, "Effect of a matrigel sandwich on endodermal differentiation of human embryonic stem cells," (in eng), Stem Cell Rev Rep, vol. 9, no. 5, pp. 578-85, Oct 2013, doi: 10.1007/s12015-013-9447-2. T. Takebe et al., "Vascularized and functional human liver from an iPSC-derived organ bud transplant," (in eng), Nature, vol. 499, no. 7459, pp. 481-4, Jul 25 2013, doi: 10.1038/nature12271. T. Takebe et al., "Massive and Reproducible Production of Liver Buds Entirely from Human Pluripotent Stem Cells," Cell reports, vol. 21, no. 10, pp. 2661-2670, Dec 5 2017, doi: 10.1016/j.celrep.2017.11.005. W. Zhou, K. Graham, B. Lucendo-Villarin, O. Flint, D. C. Hay, and P. Bagnaninchi, "Combining stem cell-derived hepatocytes with impedance sensing to better predict human drug toxicity," (in eng), Expert Opin Drug Metab Toxicol, vol. 15, no. 1, pp. 77-83, Jan 2019, doi: 10.1080/17425255.2019.1558208. G. Holmgren et al., "Long-term chronic toxicity testing using human pluripotent stem cellderived hepatocytes," (in eng), Drug Metab Dispos, vol. 42, no. 9, pp. 1401-6, Sep 2014, doi: 10.1124/dmd.114.059154. J. H. Kim et al., "Enhanced Metabolizing Activity of Human ES Cell-Derived Hepatocytes Using a 3D Culture System with Repeated Exposures to Xenobiotics," Toxicol Sci, vol. 147, no. 1, pp. 190-206, Sep 2015, doi: 10.1093/toxsci/kfv121. R. L. Gieseck, 3rd et al., "Maturation of induced pluripotent stem cell derived hepatocytes by 3D-culture," PLoS One, vol. 9, no. 1, p. e86372, 2014, doi: 10.1371/journal.pone.0086372. J. L. Corbett and S. A. Duncan, "iPSC-Derived Hepatocytes as a Platform for Disease Modeling and Drug Discovery," Front Med (Lausanne), vol. 6, p. 265, 2019, doi: 10.3389/fmed.2019.00265. R. Jing et al., "A Screen Using iPSC-Derived Hepatocytes Reveals NAD(+) as a Potential Treatment for mtDNA Depletion Syndrome," Cell reports, vol. 25, no. 6, pp. 1469-1484 e5, Nov 6 2018, doi: 10.1016/j.celrep.2018.10.036. 114 | P a g e References [130] [131] [132] [133] [134] [135] [136] [137] [138] [139] [140] [141] [142] [143] [144] [145] [146] [147] S. M. Choi et al., "Efficient drug screening and gene correction for treating liver disease using patient-specific stem cells," Hepatology (Baltimore, Md.), vol. 57, no. 6, pp. 245868, Jun 2013, doi: 10.1002/hep.26237. A. Miyajima, M. Tanaka, and T. Itoh, "Stem/progenitor cells in liver development, homeostasis, regeneration, and reprogramming," Cell stem cell, vol. 14, no. 5, pp. 561-74, May 1 2014, doi: 10.1016/j.stem.2014.04.010. H. Abdellatif, "Oval Cells: Potential Role in Liver Regeneration," Biomedical Journal of Scientific & Technical Research, vol. 2, no. 1, 2017, doi: 10.26717/bjstr.2018.02.000665. S. J. Awan, M. T. Baig, F. Yaqub, A. Tayyeb, and G. Ali, "In vitro differentiated hepatic ovallike cells enhance hepatic regeneration in CCl(4) -induced hepatic injury," (in eng), Cell Biol Int, vol. 41, no. 1, pp. 51-61, Jan 2017, doi: 10.1002/cbin.10699. D. Campard, P. A. Lysy, M. Najimi, and E. M. Sokal, "Native umbilical cord matrix stem cells express hepatic markers and differentiate into hepatocyte-like cells," (in eng), Gastroenterology, vol. 134, no. 3, pp. 833-48, Mar 2008, doi: 10.1053/j.gastro.2007.12.024. S. Z. Lin et al., "Transplantation of human Wharton's Jelly-derived stem cells alleviates chemically induced liver fibrosis in rats," Cell Transplant, vol. 19, no. 11, pp. 1451-63, 2010, doi: 10.3727/096368910X514198. C. N. Shen, J. M. Slack, and D. Tosh, "Molecular basis of transdifferentiation of pancreas to liver," (in eng), Nature cell biology, vol. 2, no. 12, pp. 879-87, Dec 2000, doi: 10.1038/35046522. G. Song et al., "Direct Reprogramming of Hepatic Myofibroblasts into Hepatocytes In Vivo Attenuates Liver Fibrosis," Cell stem cell, vol. 18, no. 6, pp. 797-808, Jun 2 2016, doi: 10.1016/j.stem.2016.01.010. P. Huang et al., "Induction of functional hepatocyte-like cells from mouse fibroblasts by defined factors," Nature, vol. 475, no. 7356, pp. 386-9, May 11 2011, doi: 10.1038/nature10116. S. Sekiya and A. Suzuki, "Direct conversion of mouse fibroblasts to hepatocyte-like cells by defined factors," Nature, vol. 475, no. 7356, pp. 390-3, Jun 29 2011, doi: 10.1038/nature10263. Y. Liu, A. Beyer, and R. Aebersold, "On the Dependency of Cellular Protein Levels on mRNA Abundance," Cell, vol. 165, no. 3, pp. 535-50, Apr 21 2016, doi: 10.1016/j.cell.2016.03.014. A. D. Catherman, O. S. Skinner, and N. L. Kelleher, "Top Down proteomics: facts and perspectives," Biochem Biophys Res Commun, vol. 445, no. 4, pp. 683-93, Mar 21 2014, doi: 10.1016/j.bbrc.2014.02.041. M. S. Kim et al., "A draft map of the human proteome," Nature, vol. 509, no. 7502, pp. 575-81, May 29 2014, doi: 10.1038/nature13302. M. Wilhelm et al., "Mass-spectrometry-based draft of the human proteome," (in eng), Nature, vol. 509, no. 7502, pp. 582-7, May 29 2014, doi: 10.1038/nature13319. V. Mayya and D. K. Han, "Phosphoproteomics by mass spectrometry: insights, implications, applications and limitations," Expert Rev Proteomics, vol. 6, no. 6, pp. 60518, Dec 2009, doi: 10.1586/epr.09.84. M. Shehadul Islam, A. Aryasomayajula, and P. Selvaganapathy, "A Review on Macroscale and Microscale Cell Lysis Methods," Micromachines, vol. 8, no. 3, 2017, doi: 10.3390/mi8030083. A. K. Bhuyan, "On the mechanism of SDS-induced protein denaturation," Biopolymers, vol. 93, no. 2, pp. 186-99, Feb 2010, doi: 10.1002/bip.21318. K. L. Rundlett and D. W. Armstrong, "Mechanism of signal suppression by anionic surfactants in capillary electrophoresis-electrospray ionization mass spectrometry," (in eng), Analytical chemistry, vol. 68, no. 19, pp. 3493-7, Oct 1 1996, doi: 10.1021/ac960472p. 115 | P a g e References [148] [149] [150] [151] [152] [153] [154] [155] [156] [157] [158] [159] [160] [161] [162] [163] R. R. Loo, N. Dales, and P. C. Andrews, "Surfactant effects on protein structure examined by electrospray ionization mass spectrometry," (in eng), Protein Sci, vol. 3, no. 11, pp. 1975-83, Nov 1994, doi: 10.1002/pro.5560031109. A. Das and C. Mukhopadhyay, "Urea-mediated protein denaturation: a consensus view," (in eng), J Phys Chem B, vol. 113, no. 38, pp. 12816-24, Sep 24 2009, doi: 10.1021/jp906350s. L. Switzar, M. Giera, and W. M. Niessen, "Protein digestion: an overview of the available techniques and recent developments," Journal of proteome research, vol. 12, no. 3, pp. 1067-77, Mar 1 2013, doi: 10.1021/pr301201x. J. M. Burkhart, C. Schumbrutzki, S. Wortelkamp, A. Sickmann, and R. P. Zahedi, "Systematic and quantitative comparison of digest efficiency and specificity reveals the impact of trypsin quality on MS-based proteomics," (in eng), J Proteomics, vol. 75, no. 4, pp. 1454-62, Feb 2 2012, doi: 10.1016/j.jprot.2011.11.016. P. Giansanti, T. T. Aye, H. van den Toorn, M. Peng, B. van Breukelen, and A. J. Heck, "An Augmented Multiple-Protease-Based Human Phosphopeptide Atlas," Cell reports, vol. 11, no. 11, pp. 1834-43, Jun 23 2015, doi: 10.1016/j.celrep.2015.05.029. D. Wang et al., "A deep proteome and transcriptome abundance atlas of 29 healthy human tissues," Molecular systems biology, vol. 15, no. 2, p. e8503, Feb 18 2019, doi: 10.15252/msb.20188503. A. Shevchenko, H. Tomas, J. Havlis, J. V. Olsen, and M. Mann, "In-gel digestion for mass spectrometric characterization of proteins and proteomes," Nature protocols, vol. 1, no. 6, pp. 2856-60, 2006, doi: 10.1038/nprot.2006.468. J. Havlis and A. Shevchenko, "Absolute quantification of proteins in solutions and in polyacrylamide gels by mass spectrometry," (in eng), Analytical chemistry, vol. 76, no. 11, pp. 3029-36, Jun 1 2004, doi: 10.1021/ac035286f. M. HaileMariam et al., "S-Trap, an Ultrafast Sample-Preparation Approach for Shotgun Proteomics," Journal of proteome research, vol. 17, no. 9, pp. 2917-2924, Sep 7 2018, doi: 10.1021/acs.jproteome.8b00505. J. R. Wisniewski, A. Zougman, N. Nagaraj, and M. Mann, "Universal sample preparation method for proteome analysis," Nature methods, vol. 6, no. 5, pp. 359-62, May 2009, doi: 10.1038/nmeth.1322. N. A. Kulak, G. Pichler, I. Paron, N. Nagaraj, and M. Mann, "Minimal, encapsulated proteomic-sample processing applied to copy-number estimation in eukaryotic cells," (in eng), Nature methods, vol. 11, no. 3, pp. 319-24, Mar 2014, doi: 10.1038/nmeth.2834. M. Sielaff et al., "Evaluation of FASP, SP3, and iST Protocols for Proteomic Sample Preparation in the Low Microgram Range," (in eng), Journal of proteome research, Oct 11 2017, doi: 10.1021/acs.jproteome.7b00433. C. S. Hughes, S. Foehr, D. A. Garfield, E. E. Furlong, L. M. Steinmetz, and J. Krijgsveld, "Ultrasensitive proteome analysis using paramagnetic bead technology," (in eng), Molecular systems biology, vol. 10, p. 757, Oct 30 2014, doi: 10.15252/msb.20145625. T. S. Batth et al., "Protein Aggregation Capture on Microparticles Enables Multipurpose Proteomics Sample Preparation," Molecular & cellular proteomics : MCP, vol. 18, no. 5, pp. 1027-1035, May 2019, doi: 10.1074/mcp.TIR118.001270. T. Muller, M. Kalxdorf, R. Longuespee, D. N. Kazdal, A. Stenzinger, and J. Krijgsveld, "Automated sample preparation with SP3 for low-input clinical proteomics," Molecular systems biology, vol. 16, no. 1, p. e9111, Jan 2020, doi: 10.15252/msb.20199111. B. Ruprecht, D. Wang, R. Z. Chiozzi, L. H. Li, H. Hahne, and B. Kuster, "Hydrophilic Strong Anion Exchange (hSAX) Chromatography Enables Deep Fractionation of Tissue Proteomes," (in eng), Methods Mol Biol, vol. 1550, pp. 69-82, 2017, doi: 10.1007/978-14939-6747-6_7. 116 | P a g e References [164] [165] [166] [167] [168] [169] [170] [171] [172] [173] [174] [175] [176] [177] [178] J. Rappsilber, M. Mann, and Y. Ishihama, "Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips," (in eng), Nature protocols, vol. 2, no. 8, pp. 1896-906, 2007, doi: 10.1038/nprot.2007.261. B. Manadas, V. M. Mendes, J. English, and M. J. Dunn, "Peptide fractionation in proteomics approaches," Expert Rev Proteomics, vol. 7, no. 5, pp. 655-63, Oct 2010, doi: 10.1586/epr.10.46. M. Gilar, P. Olivova, A. E. Daly, and J. C. Gebler, "Two-dimensional separation of peptides using RP-RP-HPLC system with different pH in first and second separation dimensions," (in eng), J Sep Sci, vol. 28, no. 14, pp. 1694-703, Sep 2005, doi: 10.1002/jssc.200500116. M. S. Ritorto, K. Cook, K. Tyagi, P. G. Pedrioli, and M. Trost, "Hydrophilic strong anion exchange (hSAX) chromatography for highly orthogonal peptide separation of complex proteomes," Journal of proteome research, vol. 12, no. 6, pp. 2449-57, Jun 7 2013, doi: 10.1021/pr301011r. S. Di Palma, P. J. Boersema, A. J. Heck, and S. Mohammed, "Zwitterionic hydrophilic interaction liquid chromatography (ZIC-HILIC and ZIC-cHILIC) provide high resolution separation and increase sensitivity in proteome analysis," Analytical chemistry, vol. 83, no. 9, pp. 3440-7, May 1 2011, doi: 10.1021/ac103312e. Y. Ishihama, J. Rappsilber, and M. Mann, "Modular stop and go extraction tips with stacked disks for parallel and multidimensional Peptide fractionation in proteomics," (in eng), Journal of proteome research, vol. 5, no. 4, pp. 988-94, Apr 2006, doi: 10.1021/pr050385q. P. Yu et al., "Trimodal Mixed Mode Chromatography That Enables Efficient Offline TwoDimensional Peptide Fractionation for Proteome Analysis," (in eng), Analytical chemistry, vol. 89, no. 17, pp. 8884-8891, Sep 5 2017, doi: 10.1021/acs.analchem.7b01356. B. Macek, M. Mann, and J. V. Olsen, "Global and site-specific quantitative phosphoproteomics: principles and applications," Annu Rev Pharmacol Toxicol, vol. 49, pp. 199-221, 2009, doi: 10.1146/annurev.pharmtox.011008.145606. L. Andersson and J. Porath, "Isolation of phosphoproteins by immobilized metal (Fe3+) affinity chromatography," (in eng), Anal Biochem, vol. 154, no. 1, pp. 250-4, Apr 1986, doi: 10.1016/0003-2697(86)90523-3. B. Ruprecht, H. Koch, P. Domasinska, M. Frejno, B. Kuster, and S. Lemeer, "Optimized Enrichment of Phosphoproteomes by Fe-IMAC Column Chromatography," (in eng), Methods Mol Biol, vol. 1550, pp. 47-60, 2017, doi: 10.1007/978-1-4939-6747-6_5. P. Mertins et al., "Reproducible workflow for multiplexed deep-scale proteome and phosphoproteome analysis of tumor tissues by liquid chromatography-mass spectrometry," Nature protocols, vol. 13, no. 7, pp. 1632-1661, Jul 2018, doi: 10.1038/s41596-018-0006-9. B. Ruprecht, H. Koch, G. Medard, M. Mundt, B. Kuster, and S. Lemeer, "Comprehensive and reproducible phosphopeptide enrichment using iron immobilized metal ion affinity chromatography (Fe-IMAC) columns," Molecular & cellular proteomics : MCP, vol. 14, no. 1, pp. 205-15, Jan 2015, doi: 10.1074/mcp.M114.043109. H. Post et al., "Robust, Sensitive, and Automated Phosphopeptide Enrichment Optimized for Low Sample Amounts Applied to Primary Hippocampal Neurons," Journal of proteome research, vol. 16, no. 2, pp. 728-737, Feb 3 2017, doi: 10.1021/acs.jproteome.6b00753. J. R. Murillo, M. Kuras, M. Rezeli, T. Miliotis, L. Betancourt, and G. Marko-Varga, "Automated phosphopeptide enrichment from minute quantities of frozen malignant melanoma tissue," PLoS One, vol. 13, no. 12, p. e0208562, 2018, doi: 10.1371/journal.pone.0208562. I. Arribas Diez, I. Govender, P. Naicker, S. Stoychev, J. Jordaan, and O. N. Jensen, "Zirconium(IV)-IMAC Revisited: Improved Performance and Phosphoproteome Coverage 117 | P a g e References [179] [180] [181] [182] [183] [184] [185] [186] [187] [188] [189] [190] [191] [192] [193] [194] by Magnetic Microparticles for Phosphopeptide Affinity Enrichment," Journal of proteome research, vol. 20, no. 1, pp. 453-462, Jan 1 2021, doi: 10.1021/acs.jproteome.0c00508. L. Liu et al., "p53 sites acetylated in vitro by PCAF and p300 are acetylated in vivo in response to DNA damage," (in eng), Mol Cell Biol, vol. 19, no. 2, pp. 1202-9, Feb 1999, doi: 10.1128/mcb.19.2.1202. J. Boyes, P. Byfield, Y. Nakatani, and V. Ogryzko, "Regulation of activity of the transcription factor GATA-1 by acetylation," (in eng), Nature, vol. 396, no. 6711, pp. 594-8, Dec 10 1998, doi: 10.1038/25166. M. Esmaeili, S. A. Blythe, J. W. Tobias, K. Zhang, J. Yang, and P. S. Klein, "Chromatin accessibility and histone acetylation in the regulation of competence in early development," (in eng), Dev Biol, vol. 462, no. 1, pp. 20-35, Jun 1 2020, doi: 10.1016/j.ydbio.2020.02.013. N. Z. Saraiva, C. S. Oliveira, and J. M. Garcia, "Histone acetylation and its role in embryonic stem cell differentiation," World J Stem Cells, vol. 2, no. 6, pp. 121-6, Dec 26 2010, doi: 10.4252/wjsc.v2.i6.121. S. Legartová et al., "Cell differentiation along multiple pathways accompanied by changes in histone acetylation status," (in eng), Biochem Cell Biol, vol. 92, no. 2, pp. 85-93, Apr 2014, doi: 10.1139/bcb-2013-0082. T. Kawamura et al., "Acetylation of GATA-4 is involved in the differentiation of embryonic stem cells into cardiac myocytes," J Biol Chem, vol. 280, no. 20, pp. 19682-8, May 20 2005, doi: 10.1074/jbc.M412428200. C. Choudhary et al., "Lysine acetylation targets protein complexes and co-regulates major cellular functions," (in eng), Science (New York, N.Y.), vol. 325, no. 5942, pp. 834-40, Aug 14 2009, doi: 10.1126/science.1175371. S. C. Kim et al., "Substrate and functional diversity of lysine acetylation revealed by a proteomics survey," Mol Cell, vol. 23, no. 4, pp. 607-18, Aug 2006, doi: 10.1016/j.molcel.2006.06.026. T. Svinkina et al., "Deep, Quantitative Coverage of the Lysine Acetylome Using Novel Antiacetyl-lysine Antibodies and an Optimized Proteomic Workflow," Molecular & cellular proteomics : MCP, vol. 14, no. 9, pp. 2429-40, Sep 2015, doi: 10.1074/mcp.O114.047555. H. Steen and M. Mann, "The ABC's (and XYZ's) of peptide sequencing," (in eng), Nat Rev Mol Cell Biol, vol. 5, no. 9, pp. 699-711, Sep 2004, doi: 10.1038/nrm1468. A. Ducret, I. Van Oostveen, J. K. Eng, J. R. Yates, 3rd, and R. Aebersold, "High throughput protein characterization by automated reverse-phase chromatography/electrospray tandem mass spectrometry," (in eng), Protein Sci, vol. 7, no. 3, pp. 706-19, Mar 1998, doi: 10.1002/pro.5560070320. A. Makarov and M. Scigelova, "Coupling liquid chromatography to Orbitrap mass spectrometry," J Chromatogr A, vol. 1217, no. 25, pp. 3938-45, Jun 18 2010, doi: 10.1016/j.chroma.2010.02.022. A. B. Chakraborty and S. J. Berger, "Optimization of reversed-phase peptide liquid chromatography ultraviolet mass spectrometry analyses using an automated blending methodology," (in eng), J Biomol Tech, vol. 16, no. 4, pp. 327-35, Dec 2005. A. J. Link et al., "Direct analysis of protein complexes using mass spectrometry," (in eng), Nat Biotechnol, vol. 17, no. 7, pp. 676-82, Jul 1999, doi: 10.1038/10890. Y. Bian et al., "Robust Microflow LC-MS/MS for Proteome Analysis: 38000 Runs and Counting," Analytical chemistry, vol. 93, no. 8, pp. 3686-3690, Mar 2 2021, doi: 10.1021/acs.analchem.1c00257. Y. Bian et al., "Robust, reproducible and quantitative analysis of thousands of proteomes by micro-flow LC-MS/MS," Nature communications, vol. 11, no. 1, p. 157, Jan 9 2020, doi: 10.1038/s41467-019-13973-x. 118 | P a g e References [195] [196] [197] [198] [199] [200] [201] [202] [203] [204] [205] [206] [207] [208] [209] [210] [211] [212] M. Wilm, "Principles of electrospray ionization," Molecular & cellular proteomics : MCP, vol. 10, no. 7, p. M111 009407, Jul 2011, doi: 10.1074/mcp.M111.009407. A. Gomez and K. Tang, "Charge and fission of droplets in electrostatic sprays," Physics of Fluids, vol. 6, no. 1, pp. 404-414, 1994, doi: 10.1063/1.868037. T. G., "Disintegration of Water Drops in an Electric Field," Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol. 280, no. 1382, pp. 383-397, 1964, doi: 10.1098/rspa.1964.0151. B. A. Thomson and J. V. Iribarne, "Field induced ion evaporation from liquid surfaces at atmospheric pressure," The Journal of Chemical Physics, vol. 71, no. 11, pp. 4451-4463, 1979, doi: 10.1063/1.438198. J. V. Iribarne, "On the evaporation of small ions from charged droplets," The Journal of Chemical Physics, vol. 64, no. 6, 1976, doi: 10.1063/1.432536. M. Dole, L. L. Mack, R. L. Hines, R. C. Mobley, L. D. Ferguson, and M. B. Alice, "Molecular Beams of Macroions," The Journal of Chemical Physics, vol. 49, no. 5, pp. 2240-2249, 1968, doi: 10.1063/1.1670391. J. P. Savaryn, T. K. Toby, and N. L. Kelleher, "A researcher's guide to mass spectrometrybased proteomics," Proteomics, vol. 16, no. 18, pp. 2435-43, Sep 2016, doi: 10.1002/pmic.201600113. D. J. Douglas, A. J. Frank, and D. Mao, "Linear ion traps in mass spectrometry," (in eng), Mass spectrometry reviews, vol. 24, no. 1, pp. 1-29, Jan-Feb 2005, doi: 10.1002/mas.20004. M. Scigelova and A. Makarov, "Orbitrap mass analyzer--overview and applications in proteomics," (in eng), Proteomics, vol. 6 Suppl 2, pp. 16-21, Sep 2006, doi: 10.1002/pmic.200600528. Q. Hu, R. J. Noll, H. Li, A. Makarov, M. Hardman, and R. Graham Cooks, "The Orbitrap: a new mass spectrometer," (in eng), J Mass Spectrom, vol. 40, no. 4, pp. 430-43, Apr 2005, doi: 10.1002/jms.856. M. W. Senko et al., "Novel parallelized quadrupole/linear ion trap/Orbitrap tribrid mass spectrometer improving proteome coverage and peptide identification rates," Analytical chemistry, vol. 85, no. 24, pp. 11710-4, Dec 17 2013, doi: 10.1021/ac403115c. J. Seidler, N. Zinn, M. E. Boehm, and W. D. Lehmann, "De novo sequencing of peptides by MS/MS," Proteomics, vol. 10, no. 4, pp. 634-49, Feb 2010, doi: 10.1002/pmic.200900459. A. Devabhaktuni and J. E. Elias, "Application of de Novo Sequencing to Large-Scale Complex Proteomics Data Sets," Journal of proteome research, vol. 15, no. 3, pp. 732-42, Mar 4 2016, doi: 10.1021/acs.jproteome.5b00861. J. Allmer, "Algorithms for the de novo sequencing of peptides from tandem mass spectra," (in eng), Expert Rev Proteomics, vol. 8, no. 5, pp. 645-57, Oct 2011, doi: 10.1586/epr.11.54. J. Cox and M. Mann, "MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification," Nat Biotechnol, vol. 26, no. 12, pp. 1367-72, Dec 2008, doi: 10.1038/nbt.1511. J. Cox, N. Neuhauser, A. Michalski, R. A. Scheltema, J. V. Olsen, and M. Mann, "Andromeda: a peptide search engine integrated into the MaxQuant environment," Journal of proteome research, vol. 10, no. 4, pp. 1794-805, Apr 1 2011, doi: 10.1021/pr101065j. J. E. Elias and S. P. Gygi, "Target-decoy search strategy for mass spectrometry-based proteomics," Methods Mol Biol, vol. 604, pp. 55-71, 2010, doi: 10.1007/978-1-60761-4449_5. J. V. Olsen et al., "Global, in vivo, and site-specific phosphorylation dynamics in signaling networks," Cell, vol. 127, no. 3, pp. 635-48, Nov 3 2006, doi: 10.1016/j.cell.2006.09.026. 119 | P a g e References [213] [214] [215] [216] [217] [218] [219] [220] [221] [222] [223] [224] [225] [226] [227] [228] [229] M. M. Savitski et al., "Confident phosphorylation site localization using the Mascot Delta Score," Molecular & cellular proteomics : MCP, vol. 10, no. 2, p. M110 003830, Feb 2011, doi: 10.1074/mcp.M110.003830. J. Grossmann et al., "Implementation and evaluation of relative and absolute quantification in shotgun proteomics with label-free methods," (in eng), J Proteomics, vol. 73, no. 9, pp. 1740-6, Aug 5 2010, doi: 10.1016/j.jprot.2010.05.011. J. Cox, M. Y. Hein, C. A. Luber, I. Paron, N. Nagaraj, and M. Mann, "Accurate proteomewide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ," (in eng), Molecular & cellular proteomics : MCP, vol. 13, no. 9, pp. 2513-26, Sep 2014, doi: 10.1074/mcp.M113.031591. A. B. Arul and R. A. S. Robinson, "Sample Multiplexing Strategies in Quantitative Proteomics," Analytical chemistry, vol. 91, no. 1, pp. 178-189, Jan 2 2019, doi: 10.1021/acs.analchem.8b05626. J. Li et al., "TMTpro-18plex: The Expanded and Complete Set of TMTpro Reagents for Sample Multiplexing," Journal of proteome research, vol. 20, no. 5, pp. 2964-2972, May 7 2021, doi: 10.1021/acs.jproteome.1c00168. N. A. Karp, W. Huber, P. G. Sadowski, P. D. Charles, S. V. Hester, and K. S. Lilley, "Addressing accuracy and precision issues in iTRAQ quantitation," Molecular & cellular proteomics : MCP, vol. 9, no. 9, pp. 1885-97, Sep 2010, doi: 10.1074/mcp.M900628-MCP200. M. M. Savitski et al., "Measuring and managing ratio compression for accurate iTRAQ/TMT quantification," Journal of proteome research, vol. 12, no. 8, pp. 3586-98, Aug 2 2013, doi: 10.1021/pr400098r. M. M. Savitski et al., "Delayed fragmentation and optimized isolation width settings for improvement of protein identification and accuracy of isobaric mass tag quantification on Orbitrap-type mass spectrometers," Analytical chemistry, vol. 83, no. 23, pp. 8959-67, Dec 1 2011, doi: 10.1021/ac201760x. C. D. Wenger et al., "Gas-phase purification enables accurate, multiplexed proteome quantification with isobaric tagging," Nature methods, vol. 8, no. 11, pp. 933-5, Oct 2 2011, doi: 10.1038/nmeth.1716. M. Wuhr et al., "Accurate multiplexed proteomics at the MS2 level using the complement reporter ion cluster," Analytical chemistry, vol. 84, no. 21, pp. 9214-21, Nov 6 2012, doi: 10.1021/ac301962s. L. Ting, R. Rad, S. P. Gygi, and W. Haas, "MS3 eliminates ratio distortion in isobaric multiplexed quantitative proteomics," Nature methods, vol. 8, no. 11, pp. 937-40, Oct 2 2011, doi: 10.1038/nmeth.1714. G. C. McAlister et al., "MultiNotch MS3 enables accurate, sensitive, and multiplexed detection of differential expression across cancer cell line proteomes," Analytical chemistry, vol. 86, no. 14, pp. 7150-8, Jul 15 2014, doi: 10.1021/ac502040v. I. Virant-Klun, S. Leicht, C. Hughes, and J. Krijgsveld, "Identification of Maturation-Specific Proteins by Single-Cell Proteomics of Human Oocytes," (in eng), Molecular & cellular proteomics : MCP, vol. 15, no. 8, pp. 2616-27, Aug 2016, doi: 10.1074/mcp.M115.056887. P. Cohen, "The origins of protein phosphorylation," (in eng), Nature cell biology, vol. 4, no. 5, pp. E127-30, May 2002, doi: 10.1038/ncb0502-e127. H. Zhou et al., "Zirconium phosphonate-modified porous silicon for highly specific capture of phosphopeptides and MALDI-TOF MS analysis," (in eng), Journal of proteome research, vol. 5, no. 9, pp. 2431-7, Sep 2006, doi: 10.1021/pr060162f. H. Zhou et al., "Specific phosphopeptide enrichment with immobilized titanium ion affinity chromatography adsorbent for phosphoproteome analysis," (in eng), Journal of proteome research, vol. 7, no. 9, pp. 3957-67, Sep 2008, doi: 10.1021/pr800223m. T. Y. Low, M. A. Mohtar, P. Y. Lee, N. Omar, H. Zhou, and M. Ye, "WIDENING THE BOTTLENECK OF PHOSPHOPROTEOMICS: EVOLVING STRATEGIES FOR PHOSPHOPEPTIDE 120 | P a g e References [230] [231] [232] [233] [234] [235] [236] [237] [238] [239] [240] [241] [242] [243] ENRICHMENT," (in eng), Mass spectrometry reviews, vol. 40, no. 4, pp. 309-333, Jul 2021, doi: 10.1002/mas.21636. S. J. Humphrey, O. Karayel, D. E. James, and M. Mann, "High-throughput and highsensitivity phosphoproteomics with the EasyPhos platform," Nature protocols, vol. 13, no. 9, pp. 1897-1916, 2018, doi: 10.1038/s41596-018-0014-9. S. J. Humphrey, S. B. Azimifar, and M. Mann, "High-throughput phosphoproteomics reveals in vivo insulin signaling dynamics," Nature Biotechnology, vol. 33, no. 9, pp. 990995, 2015, doi: 10.1038/nbt.3327. W. Chen, L. Chen, and R. Tian, "An integrated strategy for highly sensitive phosphoproteome analysis from low micrograms of protein samples," Analyst, vol. 143, no. 15, pp. 3693-3701, Jul 23 2018, doi: 10.1039/c8an00792f. W. Chen et al., "Simple and Integrated Spintip-Based Technology Applied for Deep Proteome Profiling," (in eng), Analytical chemistry, vol. 88, no. 9, pp. 4864-71, May 3 2016, doi: 10.1021/acs.analchem.6b00631. J. Zecha et al., "TMT Labeling for the Masses: A Robust and Cost-efficient, In-solution Labeling Approach," (in eng), Molecular & cellular proteomics : MCP, vol. 18, no. 7, pp. 1468-1478, Jul 2019, doi: 10.1074/mcp.TIR119.001385. C. S. Hughes, S. Moggridge, T. Muller, P. H. Sorensen, G. B. Morin, and J. Krijgsveld, "Singlepot, solid-phase-enhanced sample preparation for proteomics experiments," Nature protocols, vol. 14, no. 1, pp. 68-85, Jan 2019, doi: 10.1038/s41596-018-0082-x. B. Ruprecht, J. Zecha, D. P. Zolg, and B. Kuster, "High pH Reversed-Phase Micro-Columns for Simple, Sensitive, and Efficient Fractionation of Proteome and (TMT labeled) Phosphoproteome Digests," (in eng), Methods Mol Biol, vol. 1550, pp. 83-98, 2017, doi: 10.1007/978-1-4939-6747-6_8. S. Tyanova, T. Temu, and J. Cox, "The MaxQuant computational platform for mass spectrometry-based shotgun proteomics," (in eng), Nature protocols, vol. 11, no. 12, pp. 2301-2319, Dec 2016, doi: 10.1038/nprot.2016.136. S. Tyanova et al., "The Perseus computational platform for comprehensive analysis of (prote)omics data," Nature methods, vol. 13, no. 9, pp. 731-40, Sep 2016, doi: 10.1038/nmeth.3901. J. Cox and M. Mann, "1D and 2D annotation enrichment: a statistical method integrating quantitative proteomics with complementary high-throughput data," (in eng), BMC Bioinformatics, vol. 13 Suppl 16, no. Suppl 16, p. S12, 2012, doi: 10.1186/1471-2105-13s16-s12. S. Moggridge, P. H. Sorensen, G. B. Morin, and C. S. Hughes, "Extending the Compatibility of the SP3 Paramagnetic Bead Processing Approach for Proteomics," (in eng), Journal of proteome research, vol. 17, no. 4, pp. 1730-1740, Apr 6 2018, doi: 10.1021/acs.jproteome.7b00913. H. Steen, J. A. Jebanathirajah, J. Rush, N. Morrice, and M. W. Kirschner, "Phosphorylation analysis by mass spectrometry: myths, facts, and the consequences for qualitative and quantitative measurements," Molecular & cellular proteomics : MCP, vol. 5, no. 1, pp. 17281, Jan 2006, doi: 10.1074/mcp.M500135-MCP200. M. Marcantonio, M. Trost, M. Courcelles, M. Desjardins, and P. Thibault, "Combined enzymatic and data mining approaches for comprehensive phosphoproteome analyses: application to cell signaling events of interferon-gamma-stimulated macrophages," Molecular & cellular proteomics : MCP, vol. 7, no. 4, pp. 645-60, Apr 2008, doi: 10.1074/mcp.M700383-MCP200. Q. R. Li, Z. B. Ning, J. S. Tang, S. Nie, and R. Zeng, "Effect of peptide-to-TiO2 beads ratio on phosphopeptide enrichment selectivity," (in eng), Journal of proteome research, vol. 8, no. 11, pp. 5375-81, Nov 2009, doi: 10.1021/pr900659n. 121 | P a g e References [244] [245] [246] [247] [248] [249] [250] [251] [252] [253] [254] [255] [256] [257] [258] [259] [260] C. F. Tsai et al., "Immobilized metal affinity chromatography revisited: pH/acid control toward high selectivity in phosphoproteomics," (in eng), Journal of proteome research, vol. 7, no. 9, pp. 4058-69, Sep 2008, doi: 10.1021/pr800364d. M. Leutert, R. A. Rodriguez-Mias, N. K. Fukuda, and J. Villen, "R2-P2 rapid-robotic phosphoproteomics enables multidimensional cell signaling studies," Molecular systems biology, vol. 15, no. 12, p. e9021, Dec 2019, doi: 10.15252/msb.20199021. C. J. Tape et al., "Reproducible automated phosphopeptide enrichment using magnetic TiO2 and Ti-IMAC," Analytical chemistry, vol. 86, no. 20, pp. 10296-302, Oct 21 2014, doi: 10.1021/ac5025842. J. Krumm et al., "High temporal resolution proteome and phosphoproteome profiling of stem cell-derived hepatocyte development," (in eng), Cell reports, vol. 38, no. 13, p. 110604, Mar 29 2022, doi: 10.1016/j.celrep.2022.110604. P. W. Burridge and E. T. Zambidis, "Highly efficient directed differentiation of human induced pluripotent stem cells into cardiomyocytes," (in eng), Methods Mol Biol, vol. 997, pp. 149-61, 2013, doi: 10.1007/978-1-62703-348-0_12. R. S. Kadzik and E. E. Morrisey, "Directing lung endoderm differentiation in pluripotent stem cells," (in eng), Cell stem cell, vol. 10, no. 4, pp. 355-61, Apr 6 2012, doi: 10.1016/j.stem.2012.03.013. J. Tchieu et al., "A Modular Platform for Differentiation of Human PSCs into All Major Ectodermal Lineages," (in eng), Cell stem cell, vol. 21, no. 3, pp. 399-410.e7, Sep 7 2017, doi: 10.1016/j.stem.2017.08.015. J. R. Spence et al., "Directed differentiation of human pluripotent stem cells into intestinal tissue in vitro," (in eng), Nature, vol. 470, no. 7332, pp. 105-9, Feb 3 2011, doi: 10.1038/nature09691. Y. Kunisada, N. Tsubooka-Yamazoe, M. Shoji, and M. Hosoya, "Small molecules induce efficient differentiation into insulin-producing cells from human induced pluripotent stem cells," (in eng), Stem Cell Res, vol. 8, no. 2, pp. 274-84, Mar 2012, doi: 10.1016/j.scr.2011.10.002. K. Sekine et al., "Generation of human induced pluripotent stem cell-derived liver buds with chemically defined and animal origin-free media," Scientific reports, vol. 10, no. 1, p. 17937, Oct 21 2020, doi: 10.1038/s41598-020-73908-1. D. Vyas et al., "Self-assembled liver organoids recapitulate hepatobiliary organogenesis in vitro," Hepatology (Baltimore, Md.), vol. 67, no. 2, pp. 750-761, Feb 2018, doi: 10.1002/hep.29483. L. T. Ang et al., "A Roadmap for Human Liver Differentiation from Pluripotent Stem Cells," (in eng), Cell reports, vol. 22, no. 8, pp. 2190-2205, Feb 20 2018, doi: 10.1016/j.celrep.2018.01.087. K. T. Rigbolt et al., "System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation," (in eng), Science signaling, vol. 4, no. 164, p. rs3, Mar 15 2011, doi: 10.1126/scisignal.2001570. D. van Hoof, J. Krijgsveld, and C. Mummery, "Proteomic analysis of stem cell differentiation and early development," (in eng), Cold Spring Harbor perspectives in biology, vol. 4, no. 3, Mar 1 2012, doi: 10.1101/cshperspect.a008177. M. Baxter et al., "Phenotypic and functional analyses show stem cell-derived hepatocytelike cells better mimic fetal rather than adult hepatocytes," (in eng), Journal of hepatology, vol. 62, no. 3, pp. 581-9, Mar 2015, doi: 10.1016/j.jhep.2014.10.016. C. Rowe et al., "Proteome-wide analyses of human hepatocytes during differentiation and dedifferentiation," (in eng), Hepatology (Baltimore, Md.), vol. 58, no. 2, pp. 799-809, Aug 2013, doi: 10.1002/hep.26414. N. Takayama et al., "Transient activation of c-MYC expression is critical for efficient platelet generation from human induced pluripotent stem cells," (in eng), The Journal of 122 | P a g e References [261] [262] [263] [264] [265] [266] [267] [268] [269] [270] [271] [272] [273] [274] [275] [276] [277] [278] experimental medicine, vol. 207, no. 13, pp. 2817-30, Dec 20 2010, doi: 10.1084/jem.20100844. W. E. Johnson, C. Li, and A. Rabinovic, "Adjusting batch effects in microarray expression data using empirical Bayes methods," Biostatistics, vol. 8, no. 1, pp. 118-27, Jan 2007, doi: 10.1093/biostatistics/kxj037. G. Yu and Q. Y. He, "ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization," (in eng), Mol Biosyst, vol. 12, no. 2, pp. 477-9, Feb 2016, doi: 10.1039/c5mb00663e. H. Han et al., "TRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions," Nucleic Acids Res, vol. 46, no. D1, pp. D380-D386, Jan 4 2018, doi: 10.1093/nar/gkx1013. H. Horn et al., "KinomeXplorer: an integrated platform for kinome biology studies," Nature methods, vol. 11, no. 6, pp. 603-4, Jun 2014, doi: 10.1038/nmeth.2968. R. Linding et al., "Systematic discovery of in vivo phosphorylation networks," (in eng), Cell, vol. 129, no. 7, pp. 1415-26, Jun 29 2007, doi: 10.1016/j.cell.2007.05.052. P. N. Ghule et al., "Reprogramming the pluripotent cell cycle: restoration of an abbreviated G1 phase in human induced pluripotent stem (iPS) cells," J Cell Physiol, vol. 226, no. 5, pp. 1149-56, May 2011, doi: 10.1002/jcp.22440. K. A. Becker, J. L. Stein, J. B. Lian, A. J. van Wijnen, and G. S. Stein, "Human embryonic stem cells are pre-mitotically committed to self-renewal and acquire a lengthened G1 phase upon lineage programming," J Cell Physiol, vol. 222, no. 1, pp. 103-10, Jan 2010, doi: 10.1002/jcp.21925. C. Hindley and A. Philpott, "The cell cycle and pluripotency," Biochem J, vol. 451, no. 2, pp. 135-43, Apr 15 2013, doi: 10.1042/BJ20121627. R. de Sousa Abreu, L. O. Penalva, E. M. Marcotte, and C. Vogel, "Global signatures of protein and mRNA expression levels," Mol Biosyst, vol. 5, no. 12, pp. 1512-26, Dec 2009, doi: 10.1039/b908315d. S. Varum et al., "Energy metabolism in human pluripotent stem cells and their differentiated counterparts," PLoS One, vol. 6, no. 6, p. e20914, 2011, doi: 10.1371/journal.pone.0020914. O. Warburg, F. Wind, and E. Negelein, "THE METABOLISM OF TUMORS IN THE BODY," (in eng), J Gen Physiol, vol. 8, no. 6, pp. 519-30, Mar 7 1927, doi: 10.1085/jgp.8.6.519. D. Bausch-Fluck et al., "A mass spectrometric-derived cell surface protein atlas," PLoS One, vol. 10, no. 3, p. e0121314, 2015, doi: 10.1371/journal.pone.0121314. Y. A. Medvedeva et al., "EpiFactors: a comprehensive database of human epigenetic factors and complexes," Database (Oxford), vol. 2015, p. bav067, 2015, doi: 10.1093/database/bav067. R. Liang et al., "Restraining Lysosomal Activity Preserves Hematopoietic Stem Cell Quiescence and Potency," Cell stem cell, vol. 26, no. 3, pp. 359-376 e7, Mar 5 2020, doi: 10.1016/j.stem.2020.01.013. S. Mahanty et al., "Keratinocyte differentiation promotes ER stress-dependent lysosome biogenesis," Cell Death & Disease, vol. 10, no. 4, 2019, doi: 10.1038/s41419-019-1478-4. R. E. Lawrence and R. Zoncu, "The lysosome as a cellular centre for signalling, metabolism and quality control," Nature cell biology, vol. 21, no. 2, pp. 133-142, Feb 2019, doi: 10.1038/s41556-018-0244-7. S. M. Ferguson, "Beyond indigestion: emerging roles for lysosome-based signaling in human disease," Curr Opin Cell Biol, vol. 35, pp. 59-68, Aug 2015, doi: 10.1016/j.ceb.2015.04.014. T. Berg et al., "Fibroblast growth factor 10 is critical for liver growth during embryogenesis and controls hepatoblast survival via beta-catenin activation," Hepatology (Baltimore, Md.), vol. 46, no. 4, pp. 1187-97, Oct 2007, doi: 10.1002/hep.21814. 123 | P a g e References [279] [280] [281] [282] [283] [284] [285] [286] [287] [288] [289] [290] [291] [292] [293] [294] [295] A. Drazic, L. M. Myklebust, R. Ree, and T. Arnesen, "The world of protein acetylation," (in eng), Biochim Biophys Acta, vol. 1864, no. 10, pp. 1372-401, Oct 2016, doi: 10.1016/j.bbapap.2016.06.007. J. E. Choi and R. Mostoslavsky, "Sirtuins, metabolism, and DNA repair," Curr Opin Genet Dev, vol. 26, pp. 24-32, Jun 2014, doi: 10.1016/j.gde.2014.05.005. Y. Cha et al., "Metabolic control of primed human pluripotent stem cell fate and function by the miR-200c-SIRT2 axis," (in eng), Nature cell biology, vol. 19, no. 5, pp. 445-456, May 2017, doi: 10.1038/ncb3517. B. S. Kim, C. H. Lee, G. E. Chang, E. Cheong, and I. Shin, "A potent and selective small molecule inhibitor of sirtuin 1 promotes differentiation of pluripotent P19 cells into functional neurons," Scientific reports, vol. 6, p. 34324, Sep 29 2016, doi: 10.1038/srep34324. T. Tobita et al., "SIRT1 Disruption in Human Fetal Hepatocytes Leads to Increased Accumulation of Glucose and Lipids," PLoS One, vol. 11, no. 2, p. e0149344, 2016, doi: 10.1371/journal.pone.0149344. B. Hu et al., "Repression of SIRT1 promotes the differentiation of mouse induced pluripotent stem cells into neural stem cells," (in eng), Cell Mol Neurobiol, vol. 34, no. 6, pp. 905-12, Aug 2014, doi: 10.1007/s10571-014-0071-8. Y. Xu et al., "Oxidative stress activates SIRT2 to deacetylate and stimulate phosphoglycerate mutase," Cancer Res, vol. 74, no. 13, pp. 3630-42, Jul 1 2014, doi: 10.1158/0008-5472.CAN-13-3615. X. Si et al., "Activation of GSK3beta by Sirt2 is required for early lineage commitment of mouse embryonic stem cell," PLoS One, vol. 8, no. 10, p. e76699, 2013, doi: 10.1371/journal.pone.0076699. Y. L. Lee et al., "Sirtuin 1 facilitates generation of induced pluripotent stem cells from mouse embryonic fibroblasts through the miR-34a and p53 pathways," PLoS One, vol. 7, no. 9, p. e45633, 2012, doi: 10.1371/journal.pone.0045633. V. Calvanese et al., "Sirtuin 1 regulation of developmental genes during differentiation of stem cells," Proceedings of the National Academy of Sciences of the United States of America, vol. 107, no. 31, pp. 13736-41, Aug 3 2010, doi: 10.1073/pnas.1001399107. M. J. Gómez-Lechón, L. Tolosa, I. Conde, and M. T. Donato, "Competency of different cell models to predict human hepatotoxic drugs," (in eng), Expert Opin Drug Metab Toxicol, vol. 10, no. 11, pp. 1553-68, Nov 2014, doi: 10.1517/17425255.2014.967680. J. Basta and M. Rauchman, "The nucleosome remodeling and deacetylase complex in development and disease," Transl Res, vol. 165, no. 1, pp. 36-47, Jan 2015, doi: 10.1016/j.trsl.2014.05.003. S. L. Kloet et al., "NuRD-interacting protein ZFP296 regulates genome-wide NuRD localization and differentiation of mouse embryonic stem cells," Nature communications, vol. 9, no. 1, p. 4588, Nov 2 2018, doi: 10.1038/s41467-018-07063-7. N. Reynolds et al., "NuRD suppresses pluripotency gene expression to promote transcriptional heterogeneity and lineage commitment," Cell stem cell, vol. 10, no. 5, pp. 583-94, May 4 2012, doi: 10.1016/j.stem.2012.02.020. R. D. Kelly and S. M. Cowley, "The physiological roles of histone deacetylase (HDAC) 1 and 2: complex co-stars with multiple leading parts," (in eng), Biochem Soc Trans, vol. 41, no. 3, pp. 741-9, Jun 2013, doi: 10.1042/bst20130010. G. A. Challen et al., "Dnmt3a and Dnmt3b have overlapping and distinct functions in hematopoietic stem cells," Cell stem cell, vol. 15, no. 3, pp. 350-364, Sep 4 2014, doi: 10.1016/j.stem.2014.06.018. M. Jackson et al., "Severe global DNA hypomethylation blocks differentiation and induces histone hyperacetylation in embryonic stem cells," Mol Cell Biol, vol. 24, no. 20, pp. 886271, Oct 2004, doi: 10.1128/MCB.24.20.8862-8871.2004. 124 | P a g e References [296] [297] [298] [299] [300] [301] [302] [303] [304] [305] [306] [307] [308] [309] [310] [311] Z. D. Smith and A. Meissner, "DNA methylation: roles in mammalian development," Nat Rev Genet, vol. 14, no. 3, pp. 204-20, Mar 2013, doi: 10.1038/nrg3354. Q. Li et al., "A sequential EMT-MET mechanism drives the differentiation of human embryonic stem cells towards hepatocytes," Nature communications, vol. 8, p. 15166, May 3 2017, doi: 10.1038/ncomms15166. C. Ding et al., "Gab2 facilitates epithelial-to-mesenchymal transition via the MEK/ERK/MMP signaling in colorectal cancer," J Exp Clin Cancer Res, vol. 35, p. 5, Jan 12 2016, doi: 10.1186/s13046-015-0280-0. K. Matsusue et al., "Hepatic CCAAT/enhancer binding protein alpha mediates induction of lipogenesis and regulation of glucose homeostasis in leptin-deficient mice," (in eng), Mol Endocrinol, vol. 18, no. 11, pp. 2751-64, Nov 2004, doi: 10.1210/me.2004-0213. N. D. Wang et al., "Impaired energy homeostasis in C/EBP alpha knockout mice," (in eng), Science (New York, N.Y.), vol. 269, no. 5227, pp. 1108-12, Aug 25 1995, doi: 10.1126/science.7652557. L. E. Olofsson et al., "CCAAT/enhancer binding protein alpha (C/EBPalpha) in adipose tissue regulates genes in lipid and glucose metabolism and a genetic variation in C/EBPalpha is associated with serum levels of triglycerides," (in eng), J Clin Endocrinol Metab, vol. 93, no. 12, pp. 4880-6, Dec 2008, doi: 10.1210/jc.2008-0574. A. Westmacott, Z. D. Burke, G. Oliver, J. M. Slack, and D. Tosh, "C/EBPalpha and C/EBPbeta are markers of early liver development," Int J Dev Biol, vol. 50, no. 7, pp. 653-7, 2006, doi: 10.1387/ijdb.062146aw. B. Wang, C. Gao, and K. P. Ponder, "C/EBPbeta contributes to hepatocyte growth factorinduced replication of rodent hepatocytes," (in eng), Journal of hepatology, vol. 43, no. 2, pp. 294-302, Aug 2005, doi: 10.1016/j.jhep.2005.02.029. L. E. Greenbaum, D. E. Cressman, B. A. Haber, and R. Taub, "Coexistence of C/EBP alpha, beta, growth-induced proteins and DNA synthesis in hepatocytes during liver regeneration. Implications for maintenance of the differentiated state during liver growth," (in eng), J Clin Invest, vol. 96, no. 3, pp. 1351-65, Sep 1995, doi: 10.1172/jci118170. C. A. Alper, A. M. Johnson, A. G. Birtch, and F. D. Moore, "Human C'3: evidence for the liver as the primary site of synthesis," (in eng), Science (New York, N.Y.), vol. 163, no. 3864, pp. 286-8, Jan 17 1969, doi: 10.1126/science.163.3864.286. Z. Zhou, M. J. Xu, and B. Gao, "Hepatocytes: a key cell type for innate immunity," Cell Mol Immunol, vol. 13, no. 3, pp. 301-15, May 2016, doi: 10.1038/cmi.2015.97. A. B. McLean et al., "Activin a efficiently specifies definitive endoderm from human embryonic stem cells only when phosphatidylinositol 3-kinase signaling is suppressed," (in eng), Stem cells (Dayton, Ohio), vol. 25, no. 1, pp. 29-38, Jan 2007, doi: 10.1634/stemcells.2006-0219. E. H. Lee et al., "Disruption of the non-canonical WNT pathway in lung squamous cell carcinoma," (in eng), Clin Med Oncol, vol. 2008, no. 2, pp. 169-179, Apr 1 2008. A. Sato, H. Yamamoto, H. Sakane, H. Koyama, and A. Kikuchi, "Wnt5a regulates distinct signalling pathways by binding to Frizzled2," EMBO J, vol. 29, no. 1, pp. 41-54, Jan 6 2010, doi: 10.1038/emboj.2009.322. J. A. Bisson, B. Mills, J. C. Paul Helt, T. P. Zwaka, and E. D. Cohen, "Wnt5a and Wnt11 inhibit the canonical Wnt pathway and promote cardiac progenitor development via the Caspase-dependent degradation of AKT," Dev Biol, vol. 398, no. 1, pp. 80-96, Feb 1 2015, doi: 10.1016/j.ydbio.2014.11.015. M. J. Nemeth, L. Topol, S. M. Anderson, Y. Yang, and D. M. Bodine, "Wnt5a inhibits canonical Wnt signaling in hematopoietic stem cells and enhances repopulation," (in eng), Proceedings of the National Academy of Sciences of the United States of America, vol. 104, no. 39, pp. 15436-41, Sep 25 2007, doi: 10.1073/pnas.0704747104. 125 | P a g e References [312] [313] [314] [315] [316] [317] [318] [319] [320] [321] [322] [323] [324] [325] [326] [327] [328] M. H. Kang et al., "Estrogen-related receptor gamma functions as a tumor suppressor in gastric cancer," Nature communications, vol. 9, no. 1, p. 1920, May 15 2018, doi: 10.1038/s41467-018-04244-2. Y. Ahn et al., "Multiple modes of Lrp4 function in modulation of Wnt/beta-catenin signaling during tooth development," Development (Cambridge, England), vol. 144, no. 15, pp. 2824-2836, Aug 1 2017, doi: 10.1242/dev.150680. K. Tamai et al., "LDL-receptor-related proteins in Wnt signal transduction," (in eng), Nature, vol. 407, no. 6803, pp. 530-5, Sep 28 2000, doi: 10.1038/35035117. P. May, E. Woldt, R. L. Matz, and P. Boucher, "The LDL receptor-related protein (LRP) family: an old family of proteins with new physiological functions," Ann Med, vol. 39, no. 3, pp. 219-28, 2007, doi: 10.1080/07853890701214881. E. I. Christensen et al., "Evidence for an essential role of megalin in transepithelial transport of retinol," (in eng), J Am Soc Nephrol, vol. 10, no. 4, pp. 685-95, Apr 1999, doi: 10.1681/asn.V104685. S. Stefansson, D. A. Chappell, K. M. Argraves, D. K. Strickland, and W. S. Argraves, "Glycoprotein 330/low density lipoprotein receptor-related protein-2 mediates endocytosis of low density lipoproteins via interaction with apolipoprotein B100," J Biol Chem, vol. 270, no. 33, pp. 19417-21, Aug 18 1995, doi: 10.1074/jbc.270.33.19417. A. Niemeier et al., "Identification of megalin/gp330 as a receptor for lipoprotein(a) in vitro," (in eng), Arterioscler Thromb Vasc Biol, vol. 19, no. 3, pp. 552-61, Mar 1999, doi: 10.1161/01.atv.19.3.552. J. Sahi, S. Grepper, and C. Smith, "Hepatocytes as a tool in drug metabolism, transport and safety evaluations in drug discovery," (in eng), Curr Drug Discov Technol, vol. 7, no. 3, pp. 188-98, Sep 2010, doi: 10.2174/157016310793180576. J. Shan et al., "Identification of small molecules for human hepatocyte expansion and iPS differentiation," (in eng), Nature chemical biology, vol. 9, no. 8, pp. 514-20, Aug 2013, doi: 10.1038/nchembio.1270. C. C. Bell et al., "Characterization of primary human hepatocyte spheroids as a model system for drug-induced liver injury, liver function and disease," (in eng), Scientific reports, vol. 6, p. 25187, May 4 2016, doi: 10.1038/srep25187. J. V. Castell, R. Jover, C. P. Martínez-Jiménez, and M. J. Gómez-Lechón, "Hepatocyte cell lines: their use, scope and limitations in drug metabolism studies," (in eng), Expert Opin Drug Metab Toxicol, vol. 2, no. 2, pp. 183-212, Apr 2006, doi: 10.1517/17425255.2.2.183. S. Bartfeld and H. Clevers, "Stem cell-derived organoids and their application for medical research and patient treatment," (in eng), Journal of molecular medicine (Berlin, Germany), vol. 95, no. 7, pp. 729-738, Jul 2017, doi: 10.1007/s00109-017-1531-7. M. D. Davidson, B. R. Ware, and S. R. Khetani, "Stem cell-derived liver cells for drug testing and disease modeling," (in eng), Discov Med, vol. 19, no. 106, pp. 349-58, May 2015. T. Takebe et al., "Generation of a vascularized and functional human liver from an iPSCderived organ bud transplant," Nature protocols, vol. 9, no. 2, pp. 396-409, Feb 2014, doi: 10.1038/nprot.2014.020. V. Kegel, D. Deharde, E. Pfeiffer, K. Zeilinger, D. Seehofer, and G. Damm, "Protocol for Isolation of Primary Human Hepatocytes and Corresponding Major Populations of Nonparenchymal Liver Cells," J Vis Exp, no. 109, p. e53069, Mar 30 2016, doi: 10.3791/53069. D. M. Keller et al., "A DNA damage-induced p53 serine 392 kinase complex contains CK2, hSpt16, and SSRP1," (in eng), Mol Cell, vol. 7, no. 2, pp. 283-92, Feb 2001, doi: 10.1016/s1097-2765(01)00176-9. M. Sayed, S. Pelech, C. Wong, A. Marotta, and B. Salh, "Protein kinase CK2 is involved in G2 arrest and apoptosis following spindle damage in epithelial cells," (in eng), Oncogene, vol. 20, no. 48, pp. 6994-7005, Oct 25 2001, doi: 10.1038/sj.onc.1204894. 126 | P a g e References [329] [330] [331] [332] [333] [334] [335] [336] [337] [338] [339] [340] [341] [342] [343] [344] J. L. Brockman, S. D. Gross, M. R. Sussman, and R. A. Anderson, "Cell cycle-dependent localization of casein kinase I to mitotic spindles," (in eng), Proceedings of the National Academy of Sciences of the United States of America, vol. 89, no. 20, pp. 9454-8, Oct 15 1992, doi: 10.1073/pnas.89.20.9454. S. Jiang, M. Zhang, J. Sun, and X. Yang, "Casein kinase 1α: biological mechanisms and theranostic potential," (in eng), Cell Commun Signal, vol. 16, no. 1, p. 23, May 24 2018, doi: 10.1186/s12964-018-0236-z. T. Suga, H. Yamaguchi, T. Sato, M. Maekawa, J. Goto, and N. Mano, "Preference of Conjugated Bile Acids over Unconjugated Bile Acids as Substrates for OATP1B1 and OATP1B3," PLoS One, vol. 12, no. 1, p. e0169719, 2017, doi: 10.1371/journal.pone.0169719. K. Maeda, M. Kambara, Y. Tian, A. F. Hofmann, and Y. Sugiyama, "Uptake of ursodeoxycholate and its conjugates by human hepatocytes: role of Na(+)-taurocholate cotransporting polypeptide (NTCP), organic anion transporting polypeptide (OATP) 1B1 (OATP-C), and oatp1B3 (OATP8)," (in eng), Mol Pharm, vol. 3, no. 1, pp. 70-7, Jan-Feb 2006, doi: 10.1021/mp050063u. H. Huang et al., "Human FABP1 T94A variant enhances cholesterol uptake," Biochim Biophys Acta, vol. 1851, no. 7, pp. 946-55, Jul 2015, doi: 10.1016/j.bbalip.2015.02.015. J. Herbert et al., "Transthyretin: a choroid plexus-specific transport protein in human brain. The 1986 S. Weir Mitchell award," (in eng), Neurology, vol. 36, no. 7, pp. 900-11, Jul 1986, doi: 10.1212/wnl.36.7.900. D. K. Kim et al., "Characterization of the system L amino acid transporter in T24 human bladder carcinoma cells," (in eng), Biochim Biophys Acta, vol. 1565, no. 1, pp. 112-21, Sep 20 2002, doi: 10.1016/s0005-2736(02)00516-3. E. C. Friesema et al., "Thyroid hormone transport by the heterodimeric human system L amino acid transporter," (in eng), Endocrinology, vol. 142, no. 10, pp. 4339-48, Oct 2001, doi: 10.1210/endo.142.10.8418. M. Geiszt, J. Witta, J. Baffi, K. Lekstrom, and T. L. Leto, "Dual oxidases represent novel hydrogen peroxide sources supporting mucosal surface host defense," (in eng), Faseb j, vol. 17, no. 11, pp. 1502-4, Aug 2003, doi: 10.1096/fj.02-1104fje. A. Schroder et al., "Genomics of ADME gene expression: mapping expression quantitative trait loci relevant for absorption, distribution, metabolism and excretion of drugs in human liver," Pharmacogenomics J, vol. 13, no. 1, pp. 12-20, Feb 2013, doi: 10.1038/tpj.2011.44. H. H. Lau, N. H. J. Ng, L. S. W. Loo, J. B. Jasmen, and A. K. K. Teo, "The molecular functions of hepatocyte nuclear factors - In and beyond the liver," Journal of hepatology, vol. 68, no. 5, pp. 1033-1048, May 2018, doi: 10.1016/j.jhep.2017.11.026. E. Currie, A. Schulze, R. Zechner, T. C. Walther, and R. V. Farese, Jr., "Cellular fatty acid metabolism and cancer," Cell metabolism, vol. 18, no. 2, pp. 153-61, Aug 6 2013, doi: 10.1016/j.cmet.2013.05.017. F. Rohrig and A. Schulze, "The multifaceted roles of fatty acid synthesis in cancer," Nat Rev Cancer, vol. 16, no. 11, pp. 732-749, Nov 2016, doi: 10.1038/nrc.2016.89. M. Knobloch et al., "Metabolic control of adult neural stem cell activity by Fasn-dependent lipogenesis," Nature, vol. 493, no. 7431, pp. 226-30, Jan 10 2013, doi: 10.1038/nature11689. J. A. Park, S. Park, W. Y. Park, M. K. Han, and Y. Lee, "Splitomicin, a SIRT1 Inhibitor, Enhances Hematopoietic Differentiation of Mouse Embryonic Stem Cells," Int J Stem Cells, vol. 12, no. 1, pp. 21-30, Mar 30 2019, doi: 10.15283/ijsc18040. S. L. Wild, A. Elghajiji, C. Grimaldos Rodriguez, S. D. Weston, Z. D. Burke, and D. Tosh, "The Canonical Wnt Pathway as a Key Regulator in Liver Development, Differentiation and 127 | P a g e References [345] [346] [347] [348] [349] [350] [351] [352] [353] [354] [355] [356] [357] [358] [359] [360] Homeostatic Renewal," Genes (Basel), vol. 11, no. 10, Sep 30 2020, doi: 10.3390/genes11101163. B. Boward, T. Wu, and S. Dalton, "Concise Review: Control of Cell Fate Through Cell Cycle and Pluripotency Networks," Stem cells (Dayton, Ohio), vol. 34, no. 6, pp. 1427-36, Jun 2016, doi: 10.1002/stem.2345. I. Neganova et al., "CDK1 plays an important role in the maintenance of pluripotency and genomic stability in human pluripotent stem cells," Cell Death Dis, vol. 5, p. e1508, Nov 6 2014, doi: 10.1038/cddis.2014.464. J. Heo et al., "Phosphorylation of TFCP2L1 by CDK1 is required for stem cell pluripotency and bladder carcinogenesis," EMBO Mol Med, vol. 12, no. 1, p. e10880, Jan 9 2020, doi: 10.15252/emmm.201910880. W. Michowski et al., "Cdk1 Controls Global Epigenetic Landscape in Embryonic Stem Cells," (in eng), Mol Cell, vol. 78, no. 3, pp. 459-476.e13, May 7 2020, doi: 10.1016/j.molcel.2020.03.010. L. Li et al., "Cdk1 interplays with Oct4 to repress differentiation of embryonic stem cells into trophectoderm," (in eng), FEBS Lett, vol. 586, no. 23, pp. 4100-7, Nov 30 2012, doi: 10.1016/j.febslet.2012.10.030. T. Kunath, M. K. Saba-El-Leil, M. Almousailleakh, J. Wray, S. Meloche, and A. Smith, "FGF stimulation of the Erk1/2 signalling cascade triggers transition of pluripotent embryonic stem cells from self-renewal to lineage commitment," Development (Cambridge, England), vol. 134, no. 16, pp. 2895-902, Aug 2007, doi: 10.1242/dev.02880. A. Piliszek, Z. E. Madeja, and B. Plusa, "Suppression of ERK signalling abolishes primitive endoderm formation but does not promote pluripotency in rabbit embryo," Development (Cambridge, England), vol. 144, no. 20, pp. 3719-3730, Oct 15 2017, doi: 10.1242/dev.156406. M. Roode et al., "Human hypoblast formation is not dependent on FGF signalling," Dev Biol, vol. 361, no. 2, pp. 358-63, Jan 15 2012, doi: 10.1016/j.ydbio.2011.10.030. Q. L. Ying et al., "The ground state of embryonic stem cell self-renewal," Nature, vol. 453, no. 7194, pp. 519-23, May 22 2008, doi: 10.1038/nature06968. R. K. Jaiswal, N. Jaiswal, S. P. Bruder, G. Mbalaviele, D. R. Marshak, and M. F. Pittenger, "Adult human mesenchymal stem cell differentiation to the osteogenic or adipogenic lineage is regulated by mitogen-activated protein kinase," J Biol Chem, vol. 275, no. 13, pp. 9645-52, Mar 31 2000, doi: 10.1074/jbc.275.13.9645. Y. Meng et al., "Nicotinamide Promotes Cell Survival and Differentiation as Kinase Inhibitor in Human Pluripotent Stem Cells," Stem cell reports, vol. 11, no. 6, pp. 1347-1356, Dec 11 2018, doi: 10.1016/j.stemcr.2018.10.023. Y. Yu et al., "ERK inhibition promotes neuroectodermal precursor commitment by blocking self-renewal and primitive streak formation of the epiblast," Stem Cell Res Ther, vol. 9, no. 1, p. 2, Jan 5 2018, doi: 10.1186/s13287-017-0750-8. Y. Zhang et al., "Nicotinamide promotes pancreatic differentiation through the dual inhibition of CK1 and ROCK kinases in human embryonic stem cells," Stem Cell Res Ther, vol. 12, no. 1, p. 362, Jun 25 2021, doi: 10.1186/s13287-021-02426-2. S. Yamada, M. Hill, and T. Takakuwa, "Human Embryology," in New Discoveries in Embryology, 2015, ch. Chapter 5. D. Jiang, H. W. Jarrett, and W. E. Haskins, "Methods for proteomic analysis of transcription factors," (in eng), J Chromatogr A, vol. 1216, no. 41, pp. 6881-9, Oct 9 2009, doi: 10.1016/j.chroma.2009.08.044. D. Jiang, Y. Zhou, R. A. Moxley, and H. W. Jarrett, "Purification and identification of positive regulators binding to a novel element in the c-Jun promoter," (in eng), Biochemistry, vol. 47, no. 35, pp. 9318-34, Sep 2 2008, doi: 10.1021/bi800285q. 128 | P a g e References [361] [362] [363] [364] [365] [366] [367] [368] [369] [370] [371] [372] [373] [374] [375] [376] [377] B. Schwanhausser et al., "Global quantification of mammalian gene expression control," (in eng), Nature, vol. 473, no. 7347, pp. 337-42, May 19 2011, doi: 10.1038/nature10098. G. Kustatscher, P. Grabowski, and J. Rappsilber, "Pervasive coexpression of spatially proximal genes is buffered at the protein level," Molecular systems biology, vol. 13, no. 8, p. 937, Aug 23 2017, doi: 10.15252/msb.20177548. S. Doll and A. L. Burlingame, "Mass spectrometry-based detection and assignment of protein posttranslational modifications," (in eng), ACS Chem Biol, vol. 10, no. 1, pp. 63-71, Jan 16 2015, doi: 10.1021/cb500904b. P. V. Hornbeck, B. Zhang, B. Murray, J. M. Kornhauser, V. Latham, and E. Skrzypek, "PhosphoSitePlus, 2014: mutations, PTMs and recalibrations," (in eng), Nucleic Acids Res, vol. 43, no. Database issue, pp. D512-20, Jan 2015, doi: 10.1093/nar/gku1267. E. J. Needham, B. L. Parker, T. Burykin, D. E. James, and S. J. Humphrey, "Illuminating the dark phosphoproteome," (in eng), Science signaling, vol. 12, no. 565, Jan 22 2019, doi: 10.1126/scisignal.aau8645. C. Schölz et al., "Acetylation site specificities of lysine deacetylase inhibitors in human cells," (in eng), Nat Biotechnol, vol. 33, no. 4, pp. 415-23, Apr 2015, doi: 10.1038/nbt.3130. A. J. Hughes, D. P. Spelke, Z. Xu, C. C. Kang, D. V. Schaffer, and A. E. Herr, "Single-cell western blotting," Nature methods, vol. 11, no. 7, pp. 749-55, Jul 2014, doi: 10.1038/nmeth.2992. O. I. Ornatsky et al., "Study of cell antigens and intracellular DNA by identification of element-containing labels and metallointercalators using inductively coupled plasma mass spectrometry," (in eng), Analytical chemistry, vol. 80, no. 7, pp. 2539-47, Apr 1 2008, doi: 10.1021/ac702128m. L. J. Tracey, Y. An, and M. J. Justice, "CyTOF: An Emerging Technology for Single-Cell Proteomics in the Mouse," Curr Protoc, vol. 1, no. 4, p. e118, Apr 2021, doi: 10.1002/cpz1.118. R. Gadalla et al., "Validation of CyTOF Against Flow Cytometry for Immunological Studies and Monitoring of Human Cancer Clinical Trials," (in eng), Front Oncol, vol. 9, p. 415, 2019, doi: 10.3389/fonc.2019.00415. B. Budnik, E. Levy, G. Harmange, and N. Slavov, "SCoPE-MS: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation," (in eng), Genome biology, vol. 19, no. 1, p. 161, Oct 22 2018, doi: 10.1186/s13059-018-1547-5. A. A. Petelski et al., "Multiplexed single-cell proteomics using SCoPE2," (in eng), Nature protocols, vol. 16, no. 12, pp. 5398-5425, Dec 2021, doi: 10.1038/s41596-021-00616-z. T. K. Cheung, C. Y. Lee, F. P. Bayer, A. McCoy, B. Kuster, and C. M. Rose, "Defining the carrier proteome limit for single-cell proteomics," (in eng), Nature methods, vol. 18, no. 1, pp. 76-83, Jan 2021, doi: 10.1038/s41592-020-01002-5. C. Ctortecka, K. Stejskal, G. Krššáková, S. Mendjan, and K. Mechtler, "Quantitative Accuracy and Precision in Multiplexed Single-Cell Proteomics," (in eng), Analytical chemistry, Dec 30 2021, doi: 10.1021/acs.analchem.1c04174. Y. Zhu et al., "Nanodroplet processing platform for deep and quantitative proteome profiling of 10-100 mammalian cells," (in eng), Nature communications, vol. 9, no. 1, p. 882, Feb 28 2018, doi: 10.1038/s41467-018-03367-w. Y. Zhu et al., "Proteomic analysis of single mammalian cells enabled by microfluidic nanodroplet sample preparation and ultrasensitive nanoLC-MS," (in eng), Angewandte Chemie (International ed. in English), May 24 2018, doi: 10.1002/anie.201802843. S. M. Williams et al., "Automated Coupling of Nanodroplet Sample Preparation with Liquid Chromatography-Mass Spectrometry for High-Throughput Single-Cell Proteomics," (in eng), Analytical chemistry, vol. 92, no. 15, pp. 10588-10596, Aug 4 2020, doi: 10.1021/acs.analchem.0c01551. 129 | P a g e References [378] [379] [380] [381] [382] [383] J. Woo et al., "High-throughput and high-efficiency sample preparation for single-cell proteomics using a nested nanowell chip," Nature communications, vol. 12, no. 1, p. 6246, Oct 29 2021, doi: 10.1038/s41467-021-26514-2. K. Stejskal, J. Op de Beeck, G. Dürnberger, P. Jacobs, and K. Mechtler, "Ultrasensitive NanoLC-MS of Subnanogram Protein Samples Using Second Generation Micropillar Array LC Technology with Orbitrap Exploris 480 and FAIMS PRO," (in eng), Analytical chemistry, vol. 93, no. 25, pp. 8704-8710, Jun 29 2021, doi: 10.1021/acs.analchem.1c00990. J. Woo et al., "Robust, sensitive, and quantitative single-cell proteomics based on ion mobility filtering," bioRxiv, p. 2021.01.30.428333, 2021, doi: 10.1101/2021.01.30.428333. Y. Zhu et al., "Subnanogram proteomics: impact of LC column selection, MS instrumentation and data analysis strategy on proteome coverage for trace samples," (in eng), Int J Mass Spectrom, vol. 427, pp. 4-10, Apr 2018, doi: 10.1016/j.ijms.2017.08.016. Y. Cong et al., "Improved Single-Cell Proteome Coverage Using Narrow-Bore Packed NanoLC Columns and Ultrasensitive Mass Spectrometry," (in eng), Analytical chemistry, vol. 92, no. 3, pp. 2665-2671, Feb 4 2020, doi: 10.1021/acs.analchem.9b04631. H. Ouldali et al., "Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore," Nat Biotechnol, vol. 38, no. 2, pp. 176-181, Feb 2020, doi: 10.1038/s41587-019-0345-2. 130 | P a g e List of Abbreviations 2D 3D ACN Ac-site AGC ANOVA BH CID CV DDA DE EMT ESC FA FDR HCD HDAC HE HPLC ICM IH IMAC iPSC iST KDAC KEGG LB LC m/z maxIT MH MS PBS PCA PHH PSC P-site PSM PTM Two-dimensional Three-dimensional Acetonitrile Acetylation site Automatic gain control Analysis of variance Benjamini-Hochberg Collision-induced dissociation Coefficient of variation Data-dependent acquisition Definitive endoderm Epithelial-mesenchymal transition Embryonic stem cell Formic acid false discovery rate Higher-energy collisional dissociation Histone deacetylase Hepatic endoderm High-performance liquid chromatography Inner cell mass Immature hepatocyte Immobilized metal ion affinity chromatography induced pluripotent stem cell in-StageTip Lysine deacetylase Kyoto Encyclopedia of Gene and Genomes Liver bud Liquid chromatography Mass-to-charge ratio Maximum injection time Mature hepatocyte Mass spectrometry Phosphate-buffered saline Principal component analysis Primary human hepatocyte Pluripotent stem cell Phosphorylation site Peptide spectrum match Post-translational modification 131 | P a g e List of abbreviations R Pearson´s correlation coefficient 2 R ROS rpm RT SDS SP3 STAGETip TFA TMT 132 | P a g e Coefficient of determination Reactive oxygen species Revolutions per minute Room temperature Sodium dodecyl sulphate Single-pot, solid-phase-enhanced sample preparation Stop-and-go-extraction tip Trifluoroacetic acid Tandem mass tag List of Figures I-Figure 1: Schematic of stem cell classification and their differentiation potency. ................... 5 I-Figure 2: Hepatocyte cell lineage specification. ...................................................................... 10 I-Figure 3: Classical bottom-up proteomics workflow. ............................................................. 18 I-Figure 4: Schematic of the Orbitrap Fusion™ Lumos™ Tribrid™ mass spectrometer .............. 21 II-Figure 1: Optimizing the iST workflow. ................................................................................. 39 II-Figure 2: Comparison of iST and SP3 workflows. ................................................................... 40 II-Figure 3: TMT labelling with the SP3 workflow. ................................................................... 41 II-Figure 4: Evaluating different magnetic beads as alternatives for SP3 beads. ....................... 43 II-Figure 5: Comparison of SP3 beads and HILIC beads. ............................................................ 44 II-Figure 6: Benchmarking different magnetic IMAC-beads for phospho enrichment. ............. 45 II-Figure 7: Optimizing conditions for phospho enrichment with magnetic IMAC beads. ......... 47 II-Figure 8: Comparing phosphopeptide enrichment of the OnePot approach with the Agilent Bravo platform. ........................................................................................................................ 49 III-Figure 1: Workflow for the proteomics experiment. ........................................................... 61 III-Figure 2: Quality control and differentiation check. ............................................................. 62 III-Figure 3: Global proteomics analysis. ................................................................................... 63 III-Figure 4: Metabolic switch between HE and IH. .................................................................. 64 III-Figure 5: Correlation of protein and mRNA expression levels. ............................................ 66 III-Figure 6: Cell cycle-related protein and phosphorylation changes. ..................................... 68 III-Figure 7: Experimental setup and quality control of Ff-I01 iPSC cell line. ............................ 70 III-Figure 8: Proteome and phosphoproteome analysis of Ff-I01 cells during hepatocyte differentiation. ......................................................................................................................... 71 III-Figure 9: Metabolism vs. maturity........................................................................................ 72 III-Figure 10: Expression profiles of cell surface proteins and epigenetic modifiers. ................. 74 III-Figure 11: Expression profiles of transporter proteins and transcription factors. ................ 75 III-Figure 12: Novel stage-specific protein marker. .................................................................. 77 III-Figure 13: WNT signaling during hepatocyte differentiation. .............................................. 79 IV-Figure 1: Quality control of differentiation. ......................................................................... 89 IV-Figure 2: Elevated levels of metabolic pathways in PHH. .................................................... 91 IV-Figure 3: Protein expression of 3D samples resemble PHH better than the 2D approach. ... 92 IV-Figure 4: Comparison of 2D and 3D-derived hepatocytes. ................................................... 93 IV-Figure 5: Highly expressed ADME/Tox-related proteins in PHH. ......................................... 95 IV-Figure 6: Roadmap of hepatocyte differentiation. ............................................................... 97 0-Figure 1: Temporal characterization of selected protein classes. ........................................... II 0-Figure 2: Expression profiles related to novel stage-specific marker proteins. ........................ II 0-Table 1: Primer list. ................................................................................................................ III 0-Table 2: Cell surface marker proteins. .................................................................................... III 133 | P a g e Appendix 1 Supplementary Figures ......................................................................................................... II 2 Supplementary Tables ......................................................................................................... III I|P age Appendix 1 Supplementary Figures 0-Figure 1: Temporal characterization of selected protein classes. (A) Z-scored expression of lysosomal proteins from the Ff-I01 differentiation. (B) High-temporal resolution expression of SIRT1 and SIRT2 as log2fold-change relative to iPSC. X-axis showing days or specific differentiation stage (iPSC=d0, DE=d6, HE=d10, IH=d13, MH=d21). Data represent the average and error bars show range. (C) Left panel shows the z-scored expression of ABC transporter proteins. Right panel depicts average expression of ROS reducing enzymes. Data is from the Ff-I01 differentiation, error bars depict the average, and asterisk denote significance (ANOVA, BH corrected: *FDR<0.05; **FDR<0.01; ***FDR<0.001). (D) Dynamic expression of members from the SIN3 complex. Datapoints denoting the median of identified SIN3 complex members relative to iPSC. Error bars represent the standard deviation. 0-Figure 2: Expression profiles related to novel stage-specific marker proteins. (A) Dynamic protein expression relative to iPSC. Data represents the average and error bars the range. (B) Relative mRNA expression of all three DNA methyltransferases as derived from Camp et al.[100]. (C) Protein and mRNA expression of the epithelial marker E-cadherin and the mesenchymal marker N-cadherin, as quantified via MS or scRNA-seq (from Camp et al.[100]), respectively. II | P a g e Appendix 2 Supplementary Tables 0-Table 1: Primer list. List of forward and reverse primers with their respective Roche Universal ProbeLibrary number. Gene name OCT4 GATA4 SOX17 HNF1B HNF4A FOXA2 RBP4 AFP ALB KRT7 Forward primer CTTCGCAAGCCCTCATTTC GAAAACGGAAGCCCAAGAAC ACGCCGAGTTGAGCAAGA CCTCTCACCTGATGGTAAAATGA TCAGACCCTGAGCCACCT TGTCTGAGGAGTCGGAGAGC CCAGAAGCGCAGAAGATTG TCCTTGTAAGTGGCTTCTTGAAC AATGTTGCCAAGCTGCTGA CAGGCTGAGATCGACAACATC Reverse primer GAGAAGGCGAAATCCGAAG CATCTCCTCGCTGCTGCT TCTGCCTCCTCCACGAAG GGATATTCGTCAAGGTGCTGA AGCAACGGACAGATGTGTGA ACCGCTCCCAGCATACTTT TTTCTTTCTGATCTGCCATCG TGTACTGCAGAGATAAGTTTAGCTGAC CTTCCCTTCATCCCGAAGTT CTTGGCACGAGCATCCTT UPL # 60 11 61 63 27 7 17 61 27 24 0-Table 2: Cell surface marker proteins. List showing cell surface proteins that are differentially expressed (ANOVA, BH corrected FDR<0.05) and with a fold-change >2 at one or more stages. The ‘CD’ column shows the assigned cluster of differentiation name and the ‘Stage’ column depicts the respective developmental stage in which the marker protein is highly expressed. Gene name ABCC4 SIL1 TMEM63A AGRN ASPH ATP1B1 ATP6V0A1 B4GALT5 BST1 CLU ERBB2 FGFR1 FGFR4 FLRT2 GGT7 GOLM1 HLA-DRB1 IL6ST LAMA1 LAMB1 LAMC1 CD no no no no no no no no CD157 no CD340 CD331 CD334 no no no no CD130 no no no Stage DE DE DE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE HE III | P a g e Appendix LRP2 PLD3 PTPRA SCARB2 SLC22A5 SLC44A2 ST3GAL1 SULF2 TMEM132A TMEM2 TSPAN9 TTYH3 VCAN WNT11 A2M ACAA1 ANPEP APOB ASPH BST1 C3 CDH1 CPD EGFR ENPP1 FGA FGB FLRT2 FN1 HPX HS2ST1 IGDCC4 IGSF1 IL1RAP ITGA3 LDLR LMF2 LRP1 LRP4 MERTK PLOD1 PON2 PTPRF SCARB1 IV | P a g e no no no no no no no no no no no no no no no no CD13 no no CD157 no CD324 no no no no no no no no no no no no CD49c no no CD91 no no no no no no HE HE HE HE HE HE HE HE HE HE HE HE HE HE IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH IH Appendix SEMA4G SERPINA1 SERPING1 SERPINH1 SIDT2 SLC12A7 SLC22A5 SLC46A1 SLIT3 TFPI VTN A2M ANPEP C3 CASP7 CTSA CTSD DPP7 ERO1LB GGT1 HLA-C ICAM1 ITGA2 LAMP2 LMNA NAGLU P2RX4 SMPDL3B SQRDL ST14 TFRC TMEM2 no no no no no no no no no no no no CD13 no no no no no no CD224 no CD54 CD49b CD107b no no no no no no CD71 no IH IH IH IH IH IH IH IH IH IH IH MH MH MH MH MH MH MH MH MH MH MH MH MH MH MH MH MH MH MH MH MH V|P a ge Danksagungen Geschafft! Nach etwas über 4 Jahren bin ich nun am Ende meiner Doktorarbeit und möchte mich bei einigen Menschen bedanken ohne die ich nicht so weit gekommen wäre. Vielen Dank, Bernhard! Danke für deine Betreuung in den letzten Jahren. Für die vielen Meetings und die zahlreichen Tips, die du mir als Doktorvater mit auf den Weg gegeben hast! Danke, dass nicht nur deine Tür immer offen ist, sondern auch dein Ohr, und das trotz deines vollen Terminkalenders. Ein weiterer Dank geht an Barbara. Durch dich ist das ganze Projekt erst entstanden und ich konnte einige interessante Einblicke in euer Labor werfen. Bei den Aufenthalten in Leipzig habe ich viel gelernt und wurde herzlich in die Arbeitsgruppe aufgenommen. Ein besonderer Dank geht an Agnieska für die Einblicke, die du mir in die Zelldifferenzierung gegeben hat. Desweiteren möchte ich mich bei Markus Breunig bedanken. Mit dir habe ich nicht nur während der letzten Jahre erfolgreich wissenschaftlich zusammengearbeitet, sondern du bist auch einer meiner engsten Freunde und stehst mir immer mit Rat und Tat zur Seite. Danke auch für das Korrektur lesen meiner Arbeit. Als nächstes möchte ich mich bei meinem Studenten Karl Kristian für die gute Zusammenarbeit während des Forschungspraktikums und der Masterarbeit bedanken. Du warst sehr fleißig und zuverlässig und es hat Spaß gemacht, mir dir zusammenzuarbeiten. Ein dickes „Dankeschön“ geht an den kompletten Lehrstuhl für Proteomik und Bioanalytik aka TTT. Ich hätte mir in den vergangenen Jahren keinen schöneren Arbeitsplatz vorstellen können. Die tolle Arbeitsathmosphäre und das hohe Knowhow sucht wahrscheinlich seinesgleichen und ich werde noch oft an diese Zeit mit euch zurückdenken. Besonders möchte ich mich bei Andrea, Micha, Andi, Martina, Silvia und Gabi bedanken, ihr haltet den Lehrstuhl am Laufen und seid immer hilfsbereit zur Stelle. Ich möchte mich auch explizit bei Severin bedanken, mit dir habe ich nicht nur den ein oder anderen Kaffee getrunken, sondern währendessen viel über Wissenschaft aber auch Privates reden können. Danke auch für deinen Input für diese Doktorarbeit! Ich möchte mich auch bei meinem Prüfungskomitee für die Bereitschaft bedanken, meine Doktorarbeit zu begutachten! Außerdem möchte ich mich auch bei meinen Kollaborationspartnern in den letzten Jahren für die Zusammenarbeit in vielen interessanten Projekten bedanken. Mama, Papa, ich bin froh, dass ich euch habe! Vielen Dank für die Unterstützung und Freiheit, die ihr mir nicht nur während des Studiums oder der Doktorarbeit gegeben habt, sondern die ich schon mein ganzes Leben genießen darf. Eure positive Einstellung und euer Zuspruch, dass ich das schon alles schaffe und ich meinen Weg finde, haben die Weichen für diese Arbeit gestellt. Liebe Kiki, was wäre ich nur ohne dich? Wie hätte ich das alles nur ohne dich schaffen können? Ich kann meinen Dank dir gegenüber nicht in Worte fassen und bin einfach nur glücklich, dass ich dich habe! Ohne deine alltägliche Unterstützung würde ich ganz schön alt aussehen! VII | P a g e List of publications Published articles:  Krumm J., Sekine K., Samaras P., Brazovskaja A., Breunig M., Yasui R., Kleger A., Taniguchi H., Wilhelm M., Treutlein B., Camp J. G., Kuster B. “High temporal resolution proteome and phosphoproteome profiling of stem cell-derived hepatocyte development”. Cell Reports 2022; 38(13):110604; doi: 10.1016/j.celrep.2022.110604  Giansanti P, Samaras P, Bian Y, Meng C, Coluccio A, Frejno M, Jakubowsky H, Dobiasch S, Hazarika RR, Rechenberger J, Calzada-Wack J, Krumm J, Mueller S, Lee CY, Wimberger N, Lautenbacher L, Hassan Z, Chang YC, Falcomatà C, Bayer FP, Bärthel S, Schmidt T, Rad R, Combs SE, The M, Johannes F, Saur D, de Angelis MH, Wilhelm M, Schneider G, Kuster B. “Mass spectrometry-based draft of the mouse proteome.”. Nature Methods 2022; 19(7):803-811; doi: 10.1038/s41592-022-01526-y  Breunig M., Merkle J., Wagner M., Melzer M. K., Barth T. F. E., Engleitner T., Krumm J., Wiedenmann S., Cohrs C. M., Perkhofer L., Jain G., Krüger J., Hermann P. C., Schmid M., Madácsy T., Varga Á., Griger J., Azoitei N., Müller M., Wessely O., Robey P. G., Heller S., Dantes Z., Reichert M., Günes C., Bolenz C., Kuhn F., Maléth J., Speier S., Liebau S., Sipos B., Kuster B., Seufferlein T., Rad R., Meier M., Hohwieler M., and Kleger A. “Modelling Plasticity and Dysplasia of Pancreatic Ductal Organoids Derived from Human Pluripotent Stem Cells”. Cell Stem Cell 2021; 28(6):1105-1124; doi: 10.1016/j.stem.2021.03.005  Philippi A., Heller S., Costa I., Senée V., Breunig M., Kwon G., Zhijian L., Illing A., Lin Q., Hohwieler M., Degavre A., Kassai B., Liebau S., Schuster M., Krumm J., Zhang N., Geusz R., Russell R., Besse C., Kuster B., Hebrok M., Seufferlein T., Boehm B. O., Oswald F., Sander M., Nicolino M., Julier C., and Kleger A. “Mutations and variants of ONECUT1 in diabetes”. Nature Medicine 2021; 27:1928-1940; doi: 10.1038/s41591-021-01502-7 IX | P a g e