groupTranscripts_dbscan — groupTranscripts

group the flagged transcript within each cell based on spatial clustering using dbscan

Usage

groupTranscripts_dbscan(
  chosen_transcripts = NULL,
  distance_cutoff = "auto",
  transcript_df,
  cellID_coln = "CellId",
  transID_coln = "transcript_id",
  transSpatLocs_coln = c("x", "y", "z")
)

Arguments

chosen_transcripts: the transcript_id of chosen transcript
distance_cutoff: maximum molecule-to-molecule distance within same transcript group (default = "auto")
transcript_df: the data.frame with transcript_id, target/geneName, x, y and cell_id
cellID_coln: the column name of cell_ID in transcript_df
transID_coln: the column name of transcript_ID in transcript_df
transSpatLocs_coln: the column name of 1st, 2nd, optional 3rd spatial dimension of each transcript in transcript_df

Value

data frame of connected transcripts among chosen_transcripts #'

cellID_coln, orignal cell_ID
transID_coln, connected transcripts among chosen_transcripts
transSpatLocs_coln, spatial coordinates of transcript
transcript_group, group of chosen_transcripts

Details

For query cell, group flagged transcripts only based on their molecular distance to each other. When distance cutoff = 'auto', use 20% average XY cell range as cutoff. In case of no more than 3 flagged transcripts per cell, determine the grouping based on distance cutoff directly. In case of more transcripts per cell, use dbscan to group transcripts with distance_cutoff as eps and minPts = 1.