BioPython Tutorial for Bioinformatics | Calculate GC content of sequences in a multi-fasta file

Опубликовано: 10 Май 2021
на канале: Bioinformatics Coach
1,876
23

This tutorial shows you how to read a fasta file and calculate the GC content of sequences using the python bioinformatics package, biopython.

Get more Python Tutorials on Patreon:   / bigdataanalytics  

Thank me with some coffee
https://www.buymeacoffee.com/informat...

Consultation (Video Conferencing): https://calendly.com/bioinformaticscoach
Teaching (Video Conferencing): https://calendly.com/bioinformaticscoach
Consultation(Audio Call): https://clarity.fm/vincentappiah

Support my work
https://www.buymeacoffee.com/informat...
https://www.paypal.com/paypalme/thein...
  / bigdataanalytics  

Subscribe to my channels
Bioinformatics:    / @bioinformaticscoach  
Data Science:    / @datasciencecoach  
Short Clips:    / @bioinformaticscforbeginners  

Reach out
[email protected]






Materials
________________________________________________________________________________________________

Download codes and dataset using this link
https://github.com/vappiah/Python-Bio...

How to read fasta files using python
   • Read FASTA files with Python for Bioi...  
   • Read FASTA files with Python for Bioi...  

Video on how to install python libraries
   • Install python packages using pip  
   • Installing python packages using pip ...  

Video on how to install Anaconda in Linux
   • Install , Configure and Run Anaconda ...  

Manual calculation of GC Content
   • Bioinformatics for beginners | Course...  

Video on string indexing
   • Python for Bioinformatics #4: Extract...  



Chapters
00:11 Introduction
00:35 Download Data
01:27 Import libraries
02:21 Set file path
03:05 Read fasta file
04:09 Extract and analyze sequences
08:52 Analyze sequences and save result to pandas dataframe
12:58 Create Dataframe using pandas
15:00 Save dataframe to output file

Commands used

from Bio import SeqIO
SeqIO.parse
Bio.SeqUtils
GC



#Bioinformatics #Python #DataScience


Смотрите видео BioPython Tutorial for Bioinformatics | Calculate GC content of sequences in a multi-fasta file онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Bioinformatics Coach 10 Май 2021, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 1,876 раз и оно понравилось 23 людям.