The single-cell transcriptome and chromatin accessibility datasets of peripheral blood mononuclear cells in Chinese holstein cattle

Objectives This study was performed in the frame of a more extensive study dedicated to the integrated analysis of the single-cell transcriptome and chromatin accessibility datasets of peripheral blood mononuclear cells (PBMCs) with a large-scale GWAS of 45 complex traits in Chinese Holstein cattle. Lipopolysaccharide (LPS) is a crucial mediator of chronic inflammation to modulate immune responses. PBMCs include primary T and B cells, natural killer (NK) cells, monocytes (Mono), and dendritic cells (DC). How LPS stimulates PBMCs at the single-cell level in dairy cattle remains largely unknown. Data description We sequenced 30,756 estimated single cells and mapped 26,141 of them (96.05%) with approximately 60,075 mapped reads per cell after quality control for four whole-blood treatments (no, 2 h, 4 h, and 8 h LPS) by single-cell RNA sequencing (scRNA-seq) and single-cell sequencing assay for transposase-accessible chromatin (scATAC-seq). Finally, 7,107 (no), 9,174 (2 h), 6,741 (4 h), and 3,119 (8 h) cells were generated with ~ 15,000 total genes in the whole population. Therefore, the single-cell transcriptome and chromatin accessibility datasets in this study enable a further understanding of the cell types and functions of PBMCs and their responses to LPS stimulation in vitro.

The single-cell transcriptome and chromatin accessibility datasets of peripheral blood mononuclear cells in Chinese holstein cattle Xiao Wang 1 , Yahui Gao 2 , Cong-jun Li 2 , Lingzhao Fang 3 , George E. Liu 2 , Xiuxin Zhao 4 , Yuanpei Zhang 4 , Gaozhan Cai 4 , Guanghui Xue 4 , Yan Liu 4 , Lingling Wang 4 , Fan Zhang 1 , Kun Wang 1 , Miao Zhang 1 , Rongling Li 1 , Yundong Gao 1* and Jianbin Li 1* Objective Lipopolysaccharide (LPS) composes the outer membrane of Gram-negative bacteria and its exposure can result in elevated levels of local or systemic inflammation in cattle [1], as LPS modulates immune responses as the crucial mediator of chronic inflammation [2].LPS is a highly potent activator of the innate immune system that can activate cellular responses through the activation of proinflammatory transcription factors and binding of TLR4/ CD14/MD2 receptor complex to cause strong inflammatory responses in infected animals [3][4][5].As the first line of defense, the innate immune system confronts infectious agents through monocytes (Mono), dendritic cells (DCs), granulocytes, phagocytic macrophages, cytotoxic natural killer (NK) cells, and gamma/delta (γδ) T cells [6,7].In dairy cattle, peripheral blood mononuclear cells (PBMCs) typically include primary T and B cells, NK cells, Mono, and DCs, but higher levels of γδ T cell receptor (TCR) positive T cells were found in young calves than those of humans and mice [8][9][10].
Large-scale single-cell PBMC profiling and their responses to LPS stimulation in cattle need to be investigated to determine cell types and functions of PBMCs and further understanding of LPS-mediated bovine PBMC responses regarding the gene transcriptional, chromatin accessibility, and gene-based changes in PBMCs at the single-cell resolution [11].Therefore, these single-cell transcriptome and chromatin accessibility datasets will permit investigators to summarize the cell types and functions of PBMCs to interrogate complex cellular differentiation, regulations, and interactions when responding to LPS stimulation in vitro.

Data description
Four whole blood samples were collected from four 2-year-old Holstein female lactating cattle through the tail vein in Jinan Jiabao Dairy Co., Ltd., in Shandong province, China.Afterward, they were mixed in one pool and evenly divided into four replicates.Four wholeblood samples included one control (no LPS) treatment and three case treatments (2 h, 4 h, and 8 h LPS), where 2 µg/ml LPS (Product Number: L2880, Sigma-Aldrich, Saint Louis, MO, USA) was performed at 37 °C.On Hanks' Balanced Salt Solution (Solarbio; Beijing, China), PBMCs were finally isolated from the whole-blood treated replicates by centrifugation at 500 g for 20 min at room temperature.All experimental procedures were approved by the Animal Ethics Committee of Shandong Academy of Agricultural Sciences, China (No. 20-123).All experimental protocols were approved by the Animal Ethics Committee of Shandong Academy of Agricultural Sciences, China.All methods were carried out in accordance with relevant guidelines and regulations.All experiments were performed in accordance with the ARRIVE guidelines (Animal Research: Reporting of In Vivo Experiments).
Following the user guide of Chromium Single Cell 3' Reagent Kits v3 (CG000183), reverse transcriptase was mixed with the barcoded isolated cells into the Gel Beads-In-Emulsions (GEMs), where read 1 primer sequence (R1) was put into during its incubation.Via end repair, tailing, adaptor ligation, and PCR procedures, the sample index, P5, P7, and read 2 primer sequence (R2) were included in library construction to generate singlecell RNA sequencing (scRNA-seq) library.Following the user guide of Chromium Single Cell ATAC Reagent Kits v1.1 (CG000209), nuclei suspensions were incubated with transposase to preferential fragment DNA in open regions of the chromatin and adapter sequences were added to the ends of the DNA fragments instantaneously.The single-cell sequencing assay for transposase-accessible chromatin (scATAC-seq) library was constructed via PCR when nuclei were barcoded into the GEMs, where the sample index, P7, and R2 were added.Finally, scRNAseq and scATAC-seq libraries containing P5 and P7 primers for bridge amplification were sequenced on the Illumina Novaseq 6000 platform (Illumina, San Diego, CA, USA) with double-end 150 bp.
Cell Ranger Single-Cell Software Suite (release 3.1.0)was used to process the raw data, demultiplex raw basecall files into FASTQ files by "mkfastq", perform alignment, filtering, barcode counting, and unique molecular identifier (UMI) counting by "count", and align raw reads to the cattle reference genome (ARS-UCD1.2/bosTau9)[12] by "pipeline".On average, 392,590,940 out of 408,710,340 raw reads were mapped successfully with the mapping rate of 96.05% and 7,689 cells were finally estimated per sample.The cut-off thresholds for quality control include the number of gene expressions per cell between 200 and 3,000; UMI counts less than 200; percentage of mitochondrial-DNA derived gene expression less than 20%.In addition, genes expressed in less than three cells were removed from the datasets.In total, 26,141 single cells with approximately 60,075 reads per cell were generated by scRNA-seq with approximately 15,000 genes including 7,107 (no), 9,174 (2 h), 6,741 (4 h), and 3,119 (8 h) cells for four samples.In the meantime, 3,422 single cells were also were generated by scATACseq.All raw data sets and cell-level processed data files of four samples of scRNA-seq and scATAC-seq are listed in Table 1.
This study provided the scRNA-seq and scATAC-seq data at single-cell resolution that could serve as valuable resources for the further understanding of the cell types and functions of PBMCs and their responses to LPS stimulation in vitro.

Limitations
The pooled blood sample was mixed from four 2-year-old Holstein female lactating cattle in a relatively small sample size, so it is restricted to represent the PBMCs of the whole population of Holstein female lactating cattle.