Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Practical Bioinformatics I

Name: Practical Bioinformatics I
Rating: 3.9 (82 reviews)

Biological Data Manipulation

Created byAhmed Karam, M.Sc.

Last updated 3/2022

English

What you'll learn

Sanger sequencing method and working with trace files
Common High-throughput sequencing platforms, methods, and applications
Common sequence, feature, alignment, and variation files
Extract, convert, split, merge, and/or remove sequences, features, alignments, and/or variations
Using more than 40 (currently 43) bioinformatics programs in addition to explaining their parameters
Using more than 20 (currently 21) bioinformatics programs without explaining their parameters
Different ways of using biological data in different researches (more than 30 papers are available)

Course content

9 sections • 56 lectures • 6h 27m total length

Brief introduction to DNA, RNA, and Protein7:02
Explore how dna encodes genetic information, transcription copies it to rna, and translation produces proteins, with promoters, exons, introns, enhancers, and CpG island regulation.
Introduction to the course

The Sanger Sequencing Method2:49
Explain sanger sequencing as a method validating next generation data, sequencing genes or plasmids, and show how dideoxynucleotides terminate synthesis with chromatograms from gel electrophoresis in abi and scf formats.
Technical Troubles and Causes5:49
Visualize Trace Files using Teal6:20
Visualize Trace Files using Teal (2)6:52
Edit Trace Files using MEGA11:24
Edit Trace Files using Finch TV7:41
Install finch tv, open ab1 or scf trace files, and explore sequences with vertical and horizontal scales; export to fasta or fastq for analysis and prepare segments for blast.
Sanger sequencing and trace editor software

Illumina and Ion Torrent Sequencing Platforms5:55
Pacific Biosciences and Nanopore sequencing platforms4:45
Use of High-Throughput Sequencing in Genomics and Epigenomics5:09
Explore how high-throughput sequencing enables genome and epigenomics studies, including SNPs, indels, and structural variants. Investigate regulatory elements, chromatin accessibility, and three-dimensional genome architecture with hi-c and chia-pet.
Use of High-Throughput Sequencing in Transcriptomics, Metagenomics, and Cancer6:06
Exploring High-Throughput Sequencing Methods2:01

FASTQ8:16
FASTA and GenBank7:53
Explore FASTA and GenBank formats, including unique identifiers, and extensions like fna, faa, and frn, plus locus information, definitions, and accessions.
GFF/GTF and BED6:29
Understand gff3 and gtf annotation formats, their nine-column structure, and key fields such as seqid, source, type, start, end, score, strand, phase, and attributes; also learn bed track basics.
SAM/BAM6:23
VCF/BCF6:45
Understand vcf and bcf formats and the vital vcf header, and explore annotation fields: chromosome, position, id, ref, alt, qual, filter, plus sample fields like gt, gq, dp, and af.

Extract Regions from a Sequence7:50
Learn to extract regions from a sequence using extractseq by specifying start and end, handle fasta and genbank formats, and manage overlap removal with blastn and output options.
Extract Subsequence from FASTA with Advanced Options5:50
Extract Sequence Window from FASTA6:36
Extract Chromosome(s) from Genome6:31
Extract Gene, mRNA, and CDS from Genome11:36
Extract Genomic Features with Advanced Options6:57
Explore advanced options to extract genomic features, including combining the cdss of a single gene into one sequence, translating cds to protein, and exporting in GenBank and NCBI FASTA formats.
Extract Genomic Features, Translations, and Summary Table6:52
Align Sequences using BLAST7:57
Extract Sequences using IDs or BED5:16
Extract Regions from a Sequence Alignment5:47
Extract Features from GFF/GTF8:18
Extracts Sequences from FASTA using BED/GFF/VCF file.5:18
Extract Flanking Intervals using BED/GFF/VCF file.9:15
Extract Complementing Intervals using BED/GFF/VCF file.10:44
Quality Assessment and Mapping of FASTQ files6:45
Filtering, Sorting and Summarizing of BAM file | Extract Mapped Reads in FASTA8:04
this lecture shows filtering bam files to keep mapped reads by removing unmapped reads with -q and -f flags, sorting by coordinate, and optionally converting to fastq.
Variant Calling and filtration7:27
Normalization of Called Variant and Extract Consensus Sequence10:37
Extract Fields from VCF or BCF Files6:36
Extract fields from vcf or bcf files with bcftools query, producing a tab-delimited, user-defined output. The lecture introduces the lyve-set phylogenomics pipeline for studying foodborne pathogens at the genomics level.

Remove Poly-A Tails from Sequences3:47
Learn how to remove poly-a tails from nucleotide sequences with trimest, including minimum tail length, allowed mismatches, and optional reverse-complement conversion for poly-t tails; see output headers reflect removals.
Remove Redundant Sequences7:05
Remove Vectors Parts from Sequence(s)5:18
Learn to remove adapter sequences from nucleotide reads with vector strip, adjusting mismatch tolerance and interpreting the output statistics that report removed regions.
Remove Adapters from FASTA/FASTQ8:40
Remove Low-Quality Bases from FASTA/FASTQ11:16
Subtract Overlapping Intervals from BED/GFF/VCF7:19

Requirements

Internet connection and a web browser.

Description

Learning many tools and manipulation methods helps you to accurately understand files, their components, and their outputs, thus broadening your horizons in bioinformatics applications in general.

Practical applications will be on nucleic acids

The programs used are popular, free, and online (majority) or software installed on all operating systems

The first part discusses the DNA Sanger Sequencing because of its great importance so far despite the old method, and we will talk in this part about:

Explanation of the Sanger Sequencing method, what files it produces, problems, and their causes.
A practical application to open and manipulate the Sanger Sequencing files using three different programs.

The second part discusses High-throughput Sequencing, which is the basis of recent nucleic acid analysis research, and we will talk in this part about:

Common High-throughput Sequencing methods are Illumina, Ion Torrent, Pacific Biosciences, and Oxford Nanopore.
Various applications of High-throughput Sequencing in different fields.

The third part discusses bioinformatics files, which are the raw material for biologists where sequences, alignments, variations, and annotations are stored. The files mainly found are:

FASTQ
FASTA
Genbank
GTF/GFF3
BED
SAM/BAM
BCF/VCF

Parts from the fourth to the last are practical applications on the aforementioned files using various programs from online packages such as:

Sequence Manipulation Suite 2.
EMBOSS.
Packages on the Galaxy platform:
1. Seqtk.
2. Bedtools.
3. Samtools.
4. Bcftools.

The course contains many programs, some of which are basic in manipulation and have been explained in detail, and other programs were found to help clearly understand the examples, and these were not explained in detail but were used to perform a specific function.

The quizzes are not yet complete.

Who this course is for:

Life science students

Practical Bioinformatics I

What you'll learn

Explore related topics

Course content

Brief Introduction to Biological Data1 lecture • 7min

Sanger Sequencing6 lectures • 41min

High-throughput Sequencing5 lectures • 24min

Files5 lectures • 36min

Extract19 lectures • 2hr 24min

Convert5 lectures • 32min

Split4 lectures • 21min

Merge5 lectures • 39min

Remove6 lectures • 43min

Requirements

Description

Who this course is for: