生物信息学与计算生物学


分类

现刊
0 Q&A 112 Views Mar 5, 2026

RNA-binding protein (RBP)–RNA interactions are fundamental for gene regulation and cellular homeostasis. Ataxin-2 is an RBP that has been shown to play an instrumental role in pathophysiological processes by binding to mRNA. Methods such as RNA immunoprecipitation (RIP), cross-linking immunoprecipitation (CLIP), and their variants can be used to study the interactions between Ataxin-2 and its targets, although their high sample requirements and labor-intensive workflows can limit their widespread use. RNA editing-based approaches, such as targets of RBPs identified by editing (TRIBE), provide effective alternatives. TRIBE enables transcriptome-wide identification of RBP targets by inducing site-specific adenosine-to-inosine (A-to-I) editing, which is subsequently detected through high-throughput RNA sequencing in both in vivo and in vitro systems. Compared to in vivo models, cell lines offer a rapid and flexible experimental design. Drosophila S2 cells are a commonly used insect cell line to investigate RNA–protein dynamics and serve as a versatile platform for studying RBP function. Here, we describe a protocol used for identifying RNA targets of Ataxin-2, a versatile RBP involved in post-transcriptional and translational regulation, in S2 cells using TRIBE. This method allows rapid, efficient, and reliable identification of Ataxin-2-associated RNA targets and can be readily applied to other RBPs.

0 Q&A 186 Views Mar 5, 2026

Evaluating single-domain antibody cooperativity is essential for developing potent, escape-resistant antiviral biologics. Here, we present a protocol that reproducibly quantifies functional synergy between neutralizing nanobody pairs in standardized viral infectivity assays. Controlled automated liquid handling prepares two-dimensional concentration matrices, minimizing pipetting variance and systematic error. Neutralization data are fitted using quantitative models that independently estimate potency, cooperativity, and efficacy to distinguish additive, synergistic, and antagonistic effects between nanobody pairs. Replicated measurements enable statistically interpretable parameter estimates, supporting robust evaluation of combinatorial nanobody therapeutics with commonly available equipment and open-source analysis tools. This framework is broadly applicable to assessing cooperative effects among other classes of binding or inhibitory molecules, facilitating systematic discovery of synergistic combinations.

0 Q&A 96 Views Mar 5, 2026

The morphology of single-neuron axonal projections is critical for deciphering neural circuitry and information flow in the brain. Yet, manually reconstructing these complex, long-range projections from high-throughput whole-brain imaging data remains an exceptionally labor-intensive and time-consuming task. Here, we developed a points assignment-based method for axonal reconstruction, named PointTree. PointTree enables the precise identification of the individual axons from densely packed axonal population using a minimal information flow tree model to suppress the snowball effect of reconstruction errors. In this protocol, we have elaborated on how to configure the required environment for PointTree software, prepare suitable data for it, and run the software. This protocol can assist neuroscience researchers in more easily and rapidly obtaining the reconstruction results of neuronal axons.

0 Q&A 160 Views Mar 5, 2026

Spatial proteomics enables the mapping of protein distribution within tissues, which is crucial for understanding cellular functions in their native context. While spatial transcriptomics has seen rapid advancement, spatial proteomics faces challenges due to protein non-amplifiability and mass spectrometry sensitivity limitations. This protocol describes a sparse sampling strategy for spatial proteomics (S4P) that combines multi-angle tissue strip microdissection with deep learning–based image reconstruction. The method achieves whole-tissue slice coverage with significantly reduced sampling requirements, enabling mapping of over 9,000 proteins in mouse brain tissue at 525 μm resolution within 200 h of mass spectrometry time. Key advantages include reduced sample processing time, deep proteome coverage, and applicability to centimeter-sized tissue samples.

0 Q&A 219 Views Mar 5, 2026

Organelle abundance is a key microscopic readout of organelle formation and, in many cases, function. Quantification of organelle abundance using confocal microscopy requires estimating their area based on the fluorescence intensity of compartment-specific markers. This analysis usually depends on a user-defined intensity threshold to distinguish organelle regions from the surrounding cytoplasm, which introduces potential bias and variability. To address this issue, we present a machine learning–assisted algorithm that allows for the quantification of organelle density using the open-source Fiji platform and WEKA segmentation. Our method enables the automated quantification of organelle number, area, and density by learning from training data. This standardizes threshold selection and minimizes user intervention. We demonstrate the utility of this approach for both membrane and non-membrane organelles, such as peroxisomes, lipid droplets, and stress granules, in human cells and whole fish samples.

往期刊物
0 Q&A 245 Views Feb 20, 2026

DNA epigenetic modifications play crucial roles in regulating gene expression and cellular function across diverse organisms. Among them, 5-glyceryl-methylcytosine (5gmC), a unique DNA modification first discovered in Chlamydomonas reinhardtii, represents a novel link between redox metabolism and epigenetic regulation. Accurate genome-wide detection of 5gmC is essential for investigating its biological functions, yet no streamlined method has been available. Here, we present deaminase-assisted sequencing (DEA-seq), a simple and robust approach for base-resolution mapping of 5gmC. DEA-seq employs a single DNA deaminase that efficiently converts unmodified cytosines (C) and 5-methylcytosine (5mC) into uracils or thymines, while leaving 5gmC intact. This selective resistance generates a clear sequence signature that enables precise identification of 5gmC sites across the genome. The method operates under mild reaction conditions and is compatible with low-input DNA, minimizing sample loss and improving detection sensitivity. Overall, DEA-seq provides an accessible, efficient, and highly accurate protocol for profiling 5gmC, offering clear advantages in workflow simplicity, DNA integrity, and analytical performance.

0 Q&A 669 Views Feb 20, 2026

Serial spatial omics technologies capture genome-wide gene expression patterns in thin tissue sections but lose spatial continuity along the third dimension. Reconstructing these two-dimensional measurements into coherent three-dimensional volumes is necessary to relate molecular domains, gradients, and tissue architecture within whole organs or embryos. sc3D is an open-source Python framework that registers consecutive spatial transcriptomic sections, interpolates bead coordinates in three dimensions, and stores the result in an AnnData object compatible with Scanpy. The workflow performs slice alignment, 3D reconstruction, optional downsampling, and interactive visualization in a napari-sc3D-viewer, enabling virtual in situ hybridization and spatial differential gene expression analysis. We tested sc3D on Slide-seq and Stereo-seq datasets, including E8.5 and E16.5 mouse embryos, recovering continuous tissue morphologies, cardiac anatomical markers, and the expected anterior–posterior gradients of Hox gene expression. These results show that sc3D allows reproducible reconstruction and analysis of volumetric spatial omics data across different samples and experimental platforms.

0 Q&A 336 Views Feb 20, 2026

The deep learning revolution has accelerated discovery in cell biology by allowing researchers to outsource their microscopy analyses to a new class of tools called cell segmentation models. The performance of these models, however, is often constrained by the limited availability of annotated data for them to train on. This limitation is a consequence of the time cost associated with annotating training data by hand. To address this bottleneck, we developed Cell-APP (cellular annotation and perception pipeline), a tool that automates the annotation of high-quality training data for transmitted-light (TL) cell segmentation. Cell-APP uses two inputs—paired TL and fluorescence images—and operates in two main steps. First, it extracts each cell’s location from the fluorescence images. Then, it provides these locations to the promptable deep learning model μSAM, which generates cell masks in the TL images. Users may also employ Cell-APP to classify each annotated cell; in this case, Cell-APP extracts user-specified, single-cell features from the fluorescence images, which can then be used for unsupervised classification. These annotations and optional classifications comprise training data for cell segmentation model development. Here, we provide a step-by-step protocol for using Cell-APP to annotate training data and train custom cell segmentation models. This protocol has been used to train deep learning models that simultaneously segment and assign cell-cycle labels to HeLa, U2OS, HT1080, and RPE-1 cells.

0 Q&A 243 Views Feb 5, 2026

Pinpointing causal genes for complex traits from genome-wide association studies (GWAS) remains a central challenge in crop genetics, particularly in species with extensive linkage disequilibrium (LD) such as rice. Here, we present CisTrans-ECAS, a computational protocol that overcomes this limitation by integrating population genomics and transcriptomics. The method’s core principle is the decomposition of gene expression into two distinct components: a cis-expression component (cis-EC), regulated by local genetic variants, and a trans-expression component (trans-EC), influenced by distal genetic factors. By testing the association of both components with a phenotype, CisTrans-ECAS establishes a dual-evidence framework that substantially improves the reliability of causal inference. This protocol details the complete workflow, demonstrating its power not only to identify causal genes at loci with weak GWAS signals but also to systematically reconstruct gene regulatory networks. It provides a robust and powerful tool for advancing crop functional genomics and molecular breeding.

0 Q&A 563 Views Jan 20, 2026

Expansion microscopy (ExM) is an innovative and cost-effective super-resolution imaging technique that enables nanoscale visualization of biological structures using conventional fluorescence microscopes. By physically enlarging biological specimens, ExM circumvents the diffraction limit and has become an indispensable tool in cell biology. Ongoing methodological advances have further enhanced its spatial resolution, labeling versatility, and compatibility with diverse sample types. However, ExM imaging is often hindered by sample drift during image acquisition, caused by subtle movements of the expanded hydrogel. This drift can distort three-dimensional reconstruction, compromising both visualization accuracy and quantitative analysis. To overcome this limitation, we developed 3D-Aligner, an advanced and user-friendly image analysis software that computationally corrects sample drift in fluorescence microscopy datasets, including but not limited to those acquired using ExM. The algorithm accurately determines drift trajectories across image stacks by detecting and matching stable background features, enabling nanometer-scale alignment to restore structural fidelity. We demonstrate that 3D-Aligner robustly corrects drift across ExM datasets with varying expansion factors and fluorescent labels. This protocol provides a comprehensive, step-by-step workflow for implementing drift correction in ExM datasets, ensuring reliable three-dimensional imaging and quantitative assessment.

0 Q&A 509 Views Jan 20, 2026

Accurate profiling of soil and root-associated bacterial communities is essential for understanding ecosystem functions and improving sustainable agricultural practices. Here, a comprehensive, modular workflow is presented for the analysis of full-length 16S rRNA gene amplicons generated with Oxford Nanopore long-read sequencing. The protocol integrates four standardized steps: (i) quality assessment and filtering of raw reads with NanoPlot and NanoFilt, (ii) removal of plant organelle contamination using a curated Viridiplantae Kraken2 database, (iii) species-level taxonomic assignment with Emu, and (iv) downstream ecological analyses, including rarefaction, diversity metrics, and functional inference. Leveraging high-performance computing resources, the workflow enables parallel processing of large datasets, rigorous contamination control, and reproducible execution across environments. The pipeline’s efficiency is demonstrated on full-length 16S rRNA gene datasets from yellow pea rhizosphere and root samples, with high post-filter read retention and high-resolution community profiles. Automated SLURM scripts and detailed documentation are provided in a public GitHub repository (https://github.com/henrimdias/emu-microbiome-HPC; release v1.0.2, emu-pipeline-revised) and archived on Zenodo (DOI: 10.5281/zenodo.17764933).

0 Q&A 830 Views Dec 20, 2025

Hair cells are the sensory receptors of the auditory and vestibular systems in the inner ears of all vertebrates. Hair cells also serve to detect water flow in the lateral line system in amphibians and fish. The zebrafish lateral line serves as a well-established model for investigating hair cell development and function, including research on genetic mutations associated with deafness and environmental factors that cause hair cell damage. Rheotaxis, the ability to orient and swim in response to water flow, is a behavior mediated by multiple sensory modalities, including the lateral line organ. In this protocol, we describe a rheotaxis assay in which station holding behavior, which employs positive rheotaxis to maintain position in oncoming water flow, serves as a sensitive measure of lateral line function in larval zebrafish. This assay provides a valuable tool for researchers assessing the functional consequences of genetic or environmental disruptions of the lateral line system.

0 Q&A 794 Views Dec 20, 2025

Plants move chloroplasts in response to light, changing the optical properties of leaves. Low irradiance induces chloroplast accumulation, while high irradiance triggers chloroplast avoidance. Chloroplast movements may be monitored through changes in leaf transmittance and reflectance, typically in red light. We present a step-by-step procedure for the detection of chloroplast positioning using reflectance hyperspectral imaging in white light. We show how to employ machine learning methods to classify leaves according to the chloroplast positioning. The convolutional network is a method of choice for the analysis of the reflectance spectra, as it allows low levels of misclassification. As a complementary approach, we propose a vegetation index, called the Chloroplast Movement Index (CMI), which is sensitive to chloroplast positioning. Our method offers a high-throughput, contactless way of chloroplast movement detection.

0 Q&A 817 Views Dec 20, 2025

The exploration of microbial genomes through next-generation sequencing (NGS) and genome mining has transformed the discovery of natural products, revealing an immense reservoir of previously untapped chemical diversity. Bacteria remain a prolific source of specialized metabolites with potential applications in medicine and biotechnology. Here, we present a protocol to access novel biosynthetic gene clusters (BGCs) that encode natural products from soil bacteria. The protocol uses a combination of Oxford Nanopore Technology (ONT) sequencing, de novo genome assembly, antiSMASH for BGC identification, and transformation-associated recombination (TAR) for cloning the BGCs. We used this protocol to allow the detection of large BGCs at a relatively fast and low-cost DNA sequencing. The protocol can be applied to diverse bacteria, provided that sufficient high-molecular-weight DNA can be obtained for long-read sequencing. Moreover, this protocol enables subsequent cloning of uncharacterized BGCs into a genome engineering-ready vector, illustrating the capabilities of this powerful and cost-effective strategy.

0 Q&A 676 Views Dec 20, 2025

Understanding how lipids interact with lipid transfer proteins (LTPs) is essential for uncovering their molecular mechanisms. Yet, many available LTP structures, particularly those thought to function as membrane bridges, lack detailed information on where their native lipid ligands are located. Computational strategies, such as docking or AI-methods, offer a valuable alternative to overcome this gap, but their effectiveness is often restricted by the inherent flexibility of lipid molecules and the lack of large training sets with structures of proteins bound to lipids. To tackle this issue, we introduce a reproducible computational pipeline that uses unbiased coarse-grained molecular dynamics (CG-MD) simulations on a free and open-source software (GROMACS) with the Martini 3 force-field. Starting from a configuration of a lipid in bulk solvent, we run CG-MD simulations and observe spontaneous binding of the lipid to the protein. We show that this protocol reliably identifies lipid-binding pockets in LTPs and, unlike docking methods, suggests potential entry routes for lipid molecules with no a priori knowledge other than the protein’s structure. We demonstrate the utility of this approach in investigating bridge LTPs whose internal lipid-binding positions remain unresolved. Altogether, our study provides a cost-effective, efficient, and accurate framework for mapping binding sites and entry pathways in diverse LTPs.