bioinformatics for multiomeโspatialโsingle-cell; developmentโdisease; Assistant professor at UCSF; Tweets๐own;
BlueSky: PengHe.bsky.social Mastodon:penghe
Our Perspective paper "Toward informed batch correction for single-cell transcriptome integration" is out in @NatComputSci ๐rdcu.be/e4cSp
We review a decade of batch-correction methods and propose to move from "blind" integration to "informed" modeling.๐งต๐
๐จ๐๐ ๐๐๐ job opportunity @UCSF ๐จ:
I'm hiring TWO postdocs - an experimentalist and a computational biologist. Join us to use cutting-edge single-cell and spatial technologies to unravel the mysteries of human tissues.
opportunities.ucsf.edu/conteโฆ
๐My first postdoc project at @teichlab is now published online! Huge thanks to @ZHANGLAB, Bao Zhang, John Lawrence, @Shuaiyu_Wang, and our collaborators for making this exciting research possible. nature.com/articles/s41586-0โฆ Here are 8 of my favorite highlights from the paper:
Happy to share our preprint ๐ single-cell + spatial atlas of the developing human limb ๐ฆต๐คณ produced by @teichlab .Thanks to all of our co-authors and friends! biorxiv.org/content/10.1101/โฆ
Iโve opened two more positions in my lab for both junior and senior RAsโno PhD required. ๐ Spread the word!
๐ฌ Dry Lab: aprecruit.ucsf.edu/JPF05334
๐ง Wet Lab: aprecruit.ucsf.edu/JPF05332
Thanks for sharing!
I'm maintaining an active list of biology-related job openings at various levels (PIs, staff scientists, postdocs, PhDs, etc.). Please feel free to subscribe or contribute! github.com/brianpenghe/Bio-Jโฆ
You wonโt want to miss Dr Peng He (Wellcome Sanger Institute, Cambridge) presenting โSpatial and multi-omic studies of human organogenesisโ as part of our external seminar series today! @PengHeCam
25th Oct
12-1pm
Open to all Oxford University members.
talks.ox.ac.uk/talks/id/c0f5โฆ
The developing human lung has been mapped in unprecedented detail, identifying 144 cell states in the early stages of life, and uncovering new links between developmental cells and lung cancer.
ebi.ac.uk/about/news/researcโฆ
The Teichmann lab at @sangerinstitute is recruiting Principal/Senior bioinformaticians. As a Programme, we are committed to personal and professional development and would work with you to support your career aspirations. jobs.sanger.ac.uk/intranet/wโฆ
Itโs the 13th day after I acquired COVID virus (probably omicron) at my friendโs birthday party. Iโm negative now and eager to share some personal experiences.
1) Lateral flow tests are useful. See my Day 6, Day 11 and Day 12
Very happy to have contributed to showcasing the ๐ features of this incredible platform ๐, dedicated to making basic single-cell/nucleus and๐บ๏ธspatial transcriptomics analysis accessible to everyone, regardless of programming skills ๐ป!!
We were able to quickly reproduce key findings from @PengHeAtlas of the @teichlabโs new work on human limb development using CZ #CellxGeneโs new spatial features.
Give it a try and, as always, please let us know what you think bit.ly/3WUpgvn
had a great time at #ISSCR2022 .Thanks for the support from @ISSCR for my travel and talk here! really learned a lot and connected/reconnected to so many friends ๐ค
Everyone welcome ๐ Public seminar next week UK time Wed 2 March at 5-6pm by @RonZhu2015 : "Multicellular circuit design in mammalian cells".
News ๐ caltech.edu/about/news/modelโฆ
Message me for Zoom link if interested
Abstract see thread below:
We are so excited to say that it's been announced that Dr Sarah Teichmann @teichlab will be joining us as Professor in Stem Cell Medicine! Sarah does amazing work, and we're thrilled to be part of it soon.
More here: stemcells.cam.ac.uk/news/sarโฆ
I should've read this earlier, and so should whoever thinks they are studying GRNs. It's a very beautiful theory proposed more than half a century ago which was mostly correct (except that it is mainly the TFs but not lncRNAs that are the "activators") science.sciencemag.org/conteโฆ
Great opportunity to share the stories and struggles of my earlier career. Science isnt just about linear glory ! And amazing and wholesome stories by Kerstin! Looking forward to the next session!
CamLS hosted our first career and networking event last Friday with with a focus on academia. We were delighted to have Dr Kerstin Meyer and Dr @PengHeCam who shared captivating stories about their career journeys! Huge thanks our sponsor @TheBgiGroup and everyone who joined us!
Fun dinners with Development Biology professors @MekaylaStorer@LabRawlins and future Immunology professor @AmandaOPhD . English winter can be quite enjoyable! โ๏ธ๐ฝ๐ท
Our adult story of lungcellatlas.org/ led by @EMadissoon and
@AmandaOPhD is also online now! Yet our journey hasn't stopped as more cell types/states (such as angiogenic tip cells) and their regulatory principles are to be described in follow-up papers!
Happy to see our work led by @t_koromila of the Stathopoulos lab officially online now! Drosophila embryoโs spatial-temporal gene regulation is way more interesting than I would have imagined! doi.org/10.7554/eLife.59610@eLife
When we type a password, we should not be shown a ********* but should be given this โถโทโธโนโบโปโผโฝโพ so that we know how many digits we've typed when trying to backspace - just a suggestion
As great an environment as @emblebi has been for my #PhD Iโve been very disappointed in @embl upper management for not inflation adjusting fellowsโ salaries. No increase for EBI fellows in Jan 2022 and (likely) 0.6% in 2023, but โthereโs nothing we can doโ
scRNA-seq offers a snapshot of transcriptomes but not RNA dynamics. scNT-seq from the Wu lab @hao_wu_7 integrates metabolic labeling and droplet sequencing to perform parallel analysis of newly-transcribed and pre-existing mRNAs. nature.com/articles/s41592-0โฆ
Copy-pasting a list of genes from Excel into python codes? This website saves you a lot of time by adding ' and commas automatically. seabreezecomputers.com/excelโฆ
Our paper describing gene regulation in the fly brain is out now in @Nature! So proud of this one and of the amazing team work with @SaraAibar@ibrahimihsan and @joynismail in the @steinaerts lab! Check it out here go.nature.com/3ETX2EF with below our main findings:
1โฃHaving accumulated lots of CD45+ cells from developing human lungs in our freezer, we thawed them for BCR, ฮฑฮฒTCR, ฮณฮดTCR, transcriptome, and surface proteome analysis (CITE-seq) at the single-cell level. science.org/doi/10.1126/sciiโฆ
On the computational side, I developed a simple "DeepTree" algorithm (tutorial here github.com/brianpenghe/pythoโฆ) to select deeply clustered genes from highly variable gene list for single-cell analysis. This can leave behind a lot of batch-effect genes or sporadic noises. (18)
In a new paper with @JanLause & @CellTypist we argue that the best approach for normalization of UMI counts is *analytic Pearson residuals*, using NB model with an offset term for seq depth. + We analyze related 2019 papers by @satijalab and @rafalab. /1
biorxiv.org/content/10.1101/โฆ
Just in! New medicine laureate Ardem Patapoutian and his son Luca, watching the #NobelPrize press conference shortly after finding out the happy news.
Stay tuned for our interview with Patapoutian coming up soon!
Photographer: Nancy Hong
Excited to share #DUBStepR, our correlation-based feature selection method for clustering scRNA-seq data! , Link to the preprint: biorxiv.org/content/10.1101/โฆ.
Huge thank you to everyone in @shyam_lab for guiding me through this journey! As always, #tweetorial below :)
Our new preprint describes how two non-coding RNA pathways cooperate to recognize and repress selfish genetic elements.
Trans-generational cytoplasmic inheritance plays an essential role in this process.
Years of hard work, led by @luo_yicheng.
๐งต /1
biorxiv.org/cgi/content/shorโฆ
3๏ธโฃWe also figured out a way to computationally stitch Visium data from multiple pseudo-overlapping sections to profile a large piece of tissue (using adjacent slices). Our codes can be found here: github.com/Teichlab/limbcellโฆ
ALT Stitching 3 "pseudo-overlapping" adjacent slices
Can't emphasize enough that this is a great platform. Really appreciate the effort of everyone (CZI Cellxgene team and @martinprete for developing the features, as well as my student Weimin Lin and HCA Lattice for formatting the data)
Using the ๐ spatial transcriptomics feature in CZ #CellxGene, we reproduced the key finding of a study from @PengHeAtlas in the Teichmann lab showing transcriptionally + spatially distinct mesenchyme populations in the autopod.
Explore the dataset โก๏ธ bit.ly/4cCLxSW
Earlier this year we had identified a mysterious Foxp1+ perichondrial population in the mouse forelimb nitter.app/PengHeCambridge/statusโฆ Now I'm glad to see it again in human, with more discussions!
The third phase of the ENCODE project presents an online registry of over 1,200,000 human and mouse candidate functional elements that regulate genes, and the findings are presented in a collection of papers. See the collection here. go.nature.com/2DlgiRe
Real-time cases of confirmed #coronarvirus infection in China can be visualized using the codes developed by @lispczz . I modified their heatmap color scheme (my fork: github.com/brianpenghe/pneumโฆ) to suit my eyes better. Hope the spreads stop soon.
A big thanks to #CoBHumanDev organisers, @briscoejames, @priscaliberali, @morris_lab and @xielablife, as well as all the speakers and delegates for making โFrom Stem Cells to Human Developmentโ such a wonderful meeting. We hope to see you all back at Wotton House in 2024!
I've restarted my Zoom-seminar Google calendar which includes seminar time and Zoom links. If anyone wants to get access to my collection of Genomics seminars please let me know your calendar email address. ๐
3. Genomic features are not i.i.d. The genome is SUPER structured. You wouldn't put a promoter in the training set and its gene in the test set, would you? Likewise, the presence of a CTCF motif will tell you about cohesin binding, as well as what ISN'T binding nearby.
Zic protein family has always been a wonderland for treasure hunt. Opa is one of the most interesting ones, which initiates chromatin landscape changes as a pioneer factor. Nice work @t_koromila and everyone!
Opa (odd-paired) is a zinc-finger transcription factor that controls transition of genes from pair-rule to segmental patterns along the AP axis in D. mel. @t_koromila et al. show it's a late timing factor regulating expression along both the DV & AP axes. biorxiv.org/content/10.1101/โฆ
So much to read | Integration of spatial and single-cell transcriptomic data elucidates mouse organogenesis | Nature Biotechnology nature.com/articles/s41587-0โฆ
Indeed! In our #ENCODE3MouseRNA paper I used log(0.1 + x) for bulk RNA-seq analysis and explained to the reviewers that the ratios between 'pseudocount' and 'normalization constant' (1 million in TPM; 10000 in Seurat default for UMI) should be carefully tuned for better results!
I was looking for the correct way to extract forward reads (-F 16 is wrong) from bam file, and got an answer by myself half a year ago biostars.org/p/14378/#441049 ๐
New year question: 10mM dNTP means how much dATP/dCTP/dGTP/dTTP each in the mix? (just trying to see if I'm the only one who made this serious mistake in grad school)
1โฃ4โฃ๏ธ
We also learned a couple of lessons. One is that novel clusters may be artifacts, such as contaminations from other organs mapped to other atlases (@EMadissoon 's Esophagus, @cartalop @Mona_Litvinuk โค๏ธ, @roserventotormo placenta and @Muzz_Haniffa 's liver)
I've posted the notes/slides for my computational biology class at github.com/pachterlab/Bi-BE-โฆ
Topics were chosen based on appearing in >=3 bio areas, although for focus examples are all drawn from #scRNAseq.
Homeworks include both theory and exploration of data (via @GoogleColab).
๐Moreover, IL-1ฮฒ-treated lung organoids show a significant increase in ionocytes while IL-13 enhances tuft cells, for reasons we ran out of time to find. Explore our data portal here: fetal-lung-immune.cellgeni.sโฆ
1โฃ6โฃ๏ธ
Batch correction may hide important confounded features. I did a โsoft correctionโ by merging highly correlated and highly variable genes per batch. All the codes can be found here. github.com/brianpenghe/pythoโฆ
Challenge accepted @teichlab and completed with proper social distancing, despite of distraction by the yellow flowers. Fight on Addenbrookes and NHS! Nominating my dearest bros @tomsgoms@laidlawsean to finish a 5K or/and a 5ยฃ donation
And my exciting June started with an energetic virtual meeting. Thanks to @YuanhuaHuang@XiChenUoM@joshuawkho@linzx06 for organizing this and inviting me to talk about my work. (end of this ๐งต)
So much enjoyed the lively discussion on science in this workshop (locked in HK since moving here). Many thanks to all 42 wonderful speakers & great support from our students and admin colleagues. Such a great experience to organise it with @joshuawkho, @XiChenUoM & @linzx06.
5โฃBut lung T cell trajectory is missing the early part, indicating an irreplaceable role of the thymus; Transcriptomic distinctions between our naive T cells and their pediatric counterparts suggest a systemic switch during childhoodโpossibly linked to pathogen exposure.