Reading Multiple GenBank Files and Count feature | Python for Bioinformatics

Опубликовано: 05 Август 2021
на канале: Bioinformatics Coach
1,424
14

#bioinformatics #datascience #python

This tutorial shows you how to read a genbank file and count the number of sequence features using the python bioinformatics package, biopython.


Get more Python Tutorials on Patreon:   / bigdataanalytics  

Thank me with some coffee
https://www.buymeacoffee.com/informat...

Consultation (Video Conferencing): https://calendly.com/bioinformaticscoach
Teaching (Video Conferencing): https://calendly.com/bioinformaticscoach
Consultation(Audio Call): https://clarity.fm/vincentappiah

Support my work
https://www.buymeacoffee.com/informat...
https://www.paypal.com/paypalme/thein...
  / bigdataanalytics  

Subscribe to my channels
Bioinformatics:    / @bioinformaticscoach  
Data Science:    / @datasciencecoach  
Short Clips:    / @bioinformaticscforbeginners  

Reach out
[email protected]




Course Materials
________________________________________________________________________________________________

Commands used

from Bio import SeqIO
SeqIO.parse
SeqIO.read
SeqIO.write
Biopython Tutorial


Download the genbank files using the links here
M48 https://www.ncbi.nlm.nih.gov/nuccore/...
P10 https://www.ncbi.nlm.nih.gov/nuccore/...
AR465 https://www.ncbi.nlm.nih.gov/nuccore/...
R50 https://www.ncbi.nlm.nih.gov/nuccore/...
V521 https://www.ncbi.nlm.nih.gov/nuccore/...

Notebook can be found here
https://github.com/vappiah/Python-Bio...

Install Python libraries using pip
   • Install python packages using pip  
   • Installing python packages using pip ...  

Install python libraries using anaconda
   • How I Install python packages on ANY ...  

How to use Functions to analyze multiple sequences
   • Python for Bioinformatics | Beginners...  

Count features in multiple genbank files- Part-1
   • How to count number of sequence featu...  

How to count features in a single genbank file
   • Reading GenBank Files and Count featu...  



Description of genbank files : https://www.ncbi.nlm.nih.gov/Sitemap/...

Chapters
00:00 introduction and motivation for this video
01:15 summary of the exercises to be done
01:41 PC requirement
02:10 Exercise begins
02:14 download the genbank files
04:20 import python libraries
04:57 set the path of the directory containing the genbank files
05:35 get the filenames of the downloaded genbank data
06:53 count genbank features
07:18 create functions to count features
09:47 get all features in the genbank files
11:09 create a list to store the feature counts
11:33 using 'for' loop to count the features in all the genbank files
14:07 examine the result
16:00 create a dataframe to store the counts
20:50 remove unwanted columns
23:04 making queries / extract information from a pandas dataframe
24:43 save dataframe to an output file
26:19 open the output file


BioPython Tutorial for Bioinformatics - Count features in multiple genbank files | Part 2

Support my work
https://www.buymeacoffee.com/informat...
https://paypal.me/theinformatician

Join the Patreon Community
  / bigdataanalytics  

Book a Session
https://clarity.fm/vincentappiah
Reach out
[email protected]


Смотрите видео Reading Multiple GenBank Files and Count feature | Python for Bioinformatics онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Bioinformatics Coach 05 Август 2021, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 1,42 раз и оно понравилось 1 людям.