Skip to main content
Pearson+ LogoPearson+ Logo
Ch. 16 - Genomics: Genetics from a Whole-Genome Perspective
Sanders - Genetic Analysis: An Integrated Approach 3rd Edition
Sanders3rd EditionGenetic Analysis: An Integrated ApproachISBN: 9780135564172Not the one you use?Change textbook
Chapter 16, Problem 14

Go to http://blast.ncbi.nlm.nih.gov/Blast.cgi and follow the links to nucleotide BLAST. Type in the sequence below; it is broken up into codons to make it easier to copy.
5' ATG TTC GTC AAT CAG CAC CTT TGT GGT TCT CAC CTC GTT GAA GCTTTG TAC CTT GTT TGC GGT GAA CGT GGT TTC TTC TAC ACT CCT AAG ACT TAA 3'
As you will note on the BLAST page, there are several options for tailoring your query to obtain the most relevant information. Some are related to which sequences to search in the database. For example, the search can be limited taxonomically (e.g., restricted to mammals) or by the type of sequences in the database (e.g., cDNA or genomic). For our search, we will use the broadest database, the 'Nucleotide collection (nr/nt).' This is the nonredundant (nr) database of all nucleotide data (nt) in GenBank and can be selected in the 'Database' dialogue box. Other parameters can also be adjusted to make the search more or less sensitive to mismatches or gaps. For our purposes, we will use the default setting, which is automatically presented. Press 'BLAST' to search. What can you say about the DNA sequence?

Verified step by step guidance
1
Step 1: Understand the purpose of the BLAST search. BLAST (Basic Local Alignment Search Tool) compares your input DNA sequence against a large database of known sequences to find regions of similarity, which can help identify the gene or organism your sequence might come from.
Step 2: Prepare the sequence for input. The sequence is given in codons (groups of three nucleotides) from 5' to 3'. When entering it into the BLAST search box, remove spaces and line breaks so the sequence is continuous, for example: 5'-ATGTTCGTCAATCAGCACCTTTGTGGTCTCACC...-3'.
Step 3: Select the appropriate database. Choose the 'Nucleotide collection (nr/nt)' database, which is a comprehensive, nonredundant collection of nucleotide sequences from GenBank. This ensures your query is compared against a broad range of sequences.
Step 4: Use default BLAST parameters. The default settings balance sensitivity and speed, allowing for mismatches and gaps to find the best alignments without being too restrictive or too permissive.
Step 5: Interpret the BLAST results. After running the search, examine the top hits to see which known sequences closely match your query. Look at the organism names, gene annotations, and alignment scores to infer the identity or function of your DNA sequence.

Verified video answer for a similar problem:

This video solution was recommended by our tutors as helpful for the problem above.
Video duration:
2m
Was this helpful?

Key Concepts

Here are the essential concepts you must grasp in order to answer the question correctly.

BLAST (Basic Local Alignment Search Tool)

BLAST is a bioinformatics tool used to compare a query DNA or protein sequence against a database to find regions of similarity. It helps identify homologous sequences, infer functional and evolutionary relationships, and annotate genes. The nucleotide BLAST specifically compares DNA sequences to nucleotide databases.
Recommended video:
Guided course
08:28
Bioinformatics

Nucleotide Sequence Databases and Search Parameters

Nucleotide databases like GenBank contain vast collections of DNA sequences from various organisms. Selecting the appropriate database and search parameters (e.g., taxonomic limits, sequence type) affects the relevance and sensitivity of BLAST results, enabling targeted or broad searches depending on research goals.
Recommended video:
Guided course
04:12
Sanger Sequencing

Interpreting BLAST Results

BLAST results provide alignments showing sequence similarity, including scores, E-values, and identities. Understanding these metrics helps determine if the query sequence matches known genes or organisms, revealing potential function or origin. High similarity and low E-values indicate significant matches.
Recommended video:
Guided course
08:28
Bioinformatics
Related Practice
Textbook Question

When comparing genes from two sequenced genomes, how does one determine whether two genes are orthologous? What pitfalls arise when one or both of the genomes are not sequenced?

525
views
Textbook Question

What is a reference genome? How can it be used to survey genetic variation within a species?

611
views
Textbook Question

The two-hybrid method facilitates the discovery of protein–protein interactions. How does this technique work? Can you think of reasons for obtaining a false-positive result, that is, where the proteins encoded by two clones interact in the two-hybrid system but do not interact in the organism in which they naturally occur? Can you think of reasons you might obtain a false-negative result, in which the two proteins interact in vivo but fail to interact in the two-hybrid system?

404
views
Textbook Question

In the course of the Drosophila melanogaster genome project, the following genomic DNA sequences were obtained. Try to assemble the sequences into a single contig.

5' TTCCAGAACCGGCGAATGAAGCTGAAGAAG 3'

5' GAGCGGCAGATCAAGATCTGGTTCCAGAAC 3'

5' TGATCTGCCGCTCCGTCAGGCATAGCGCGT 3'

5' GGAGAATCGAGATGGCGCACGCGCTATGCC 3'

5' GGAGAATCGAGATGGCGCACGCGCTATGCC 3'

5' CCATCTCGATTCTCCGTCTGCGGGTCAGAT 3'

Go to the URL provided in Problem 14, and using the sequence you have just assembled, perform a blastn search in the 'Nucleotide collection (nr/nt)' database. Does the search produce sequences similar to your assembled sequence, and if so, what are they? Can you tell if your sequence is transcribed, and if it represents protein-coding sequence? Perform a tblastx search, first choosing the 'Nucleotide collection (nr/nt)' database and then limiting the search to human sequences by typing Homo sapiens in the organism box. Are homologous sequences found in the human genome? Annotate the assembled sequence.

486
views
Textbook Question

Consider the phylogenetic trees below pertaining to three related species (A, B, and C) that share a common ancestor (last common ancestor, or LCA). The lineage leading to species A diverges before the divergence of species B and C.

For gene X, no gene duplications have occurred in any lineage, and each gene X is derived from the ancestral gene X via speciation events. Are genes AX, BX, and CX orthologous, paralogous, or homologous?

388
views
Textbook Question

Consider the phylogenetic trees below pertaining to three related species (A, B, and C) that share a common ancestor (last common ancestor, or LCA). The lineage leading to species A diverges before the divergence of species B and C.

For gene Y, a gene duplication occurred in the lineage leading to A after it diverged from that, leading to B and C. Are genes AY1 and AY2 orthologous or paralogous? Are genes AY1 and BY orthologous or paralogous? Are genes BY and CY orthologous or paralogous?

436
views