DNA Research Advance Access originally published online on January 11, 2006
DNA Research 2005 12(5):379-387; doi:10.1093/dnares/dsi014
© The Author 2006. Kazusa DNA Research Institute
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@oxfordjournals.org
InCeP: Intracellular Pathway Based on mKIAA ProteinProtein Interactions
Masatoshi Murakami1,
Kiyo Shimada1,
Makoto Kawai1 and
Hisashi Koga1,2,*
1Chiba Industry Advancement Center 2-6 Nakase, Mihama-ku, Chiba 261-7126, Japan
2Kazusa DNA Research Institute 2-6-7 Kazusa-Kamatari, Kisarazu, Chiba 292-0818, Japan
Received 19 May 2005; revised 2 September 2005
 |
Abstract
|
|---|
Since December 2001, we have been conducting a project to isolate
and determine entire sequences of mouse KIAA cDNA clones, which
encode polypeptides corresponding to human KIAA proteins. The
ultimate goal of this project has been elucidation of the functions
of KIAA proteins. To address this issue, we have been generating
libraries of antibodies against mKIAA proteins.
We have, to date, already generated >800 antibodies. Using
our libraries of antibodies, we are now identifying
endogenous mKIAA proteinprotein interactions. In the
present study, novel interactions were identified by MS/MS analysis
following immunoprecipitation with anti-mKIAA antibodies. The
interactions with biologically known molecules should enable
us to predict the function of mKIAA/KIAA proteins, including
hypothetical proteins identified in our cDNA project. These
interactions are subsequently used for construction of an intracellular
pathway related to the mKIAA protein, and the pathway is distributed
through the InCeP (IntraCellular Pathway based on mKIAA proteinprotein
interactions) database. Users can freely access the InCeP through
the internet and download the graphical display as well as the
curated information.
Key words: InCeP; mKIAA; proteinprotein interactions; intracellular pathway
 |
1. Introduction
|
|---|
Accumulation of a large amount of biological evidence has led
to a need for systematization according to the cellular function
of each molecule. Recent progress in computational science provides
the scope for a possible solution to this issue, with database
archiving of cellular pathways already having begun. For example,
the Biomolecular Interaction Network Database (BIND) is a quickly
growing database designed to archive molecular interactions.1

3
The BIND has already recorded >170 000 interactions and the
resultant cellular pathways. In addition, the Kyoto Encyclopedia
of Genes and Genomes (KEGG) is a widely used bioinformatics
resource for understanding higher-order cell function.4

6
The KEGG includes genomic and chemical information as well as
pathway information. Furthermore, 166 pathway resources are
now available online (Pathway Resource List, a catalog of pathway
data resources,
http://www.cbio.mskcc.org/prl/index.php). Although
these databases have been based primarily on published experimental
research,
de novo molecular interactions are required not only
to expand the pathways, but also to curate the details.
With this in mind, we have attempted to develop a pathway resource based on an accumulation of information regarding de novo molecular interactions. We have especially focused on the KIAA genes that were identified in our human cDNA sequencing project and that were functionally unknown at the time they were sequenced.7
,8
Specific molecules that capture proteins such as antibodies have become strong tools in the identification of molecular interactions. We have therefore begun to generate libraries of antibodies against mouse counterparts of human KIAA proteins in order to overcome the legal and ethical restrictions on the use of human materials.9
,10
Using the libraries of antibodies, we have begun to identify the endogenous mKIAA proteinprotein interactions. Identified interactors are subsequently checked for their function and used to build the intracellular pathway involving the interaction. The inaugural version of the InCeP reveals only 18 intracellular pathways of mKIAA proteins, but further progress in our project promises to rapidly elucidate the function of many mKIAA proteins.
 |
2. Materials and Methods
|
|---|
2.1. Identification of a novel proteinprotein interaction
Approximately 0.4 g of each adult mouse tissue (ICR strain,
8 weeks) or cell lines derived from mouse tumor was homogenized
with CelLytic M (Sigma, St Louis, MO) containing 0.5% Protease
Inhibitor Cocktail (Sigma) and then subjected to immunoprecipitation
with anti-mKIAA antibody. The resulting precipitates were recovered
in 20 µl of 2
x SDS sample buffer containing of 0.2 M DTT
by boiling for 10 min. The supernatant was resolved by 8% 1D-SDSPAGE
and then subjected to imidazole-zinc reverse staining.11

,12
All excised bands were digested with 10 µg/ml trypsin
(Promega, Madison, WI) for 16 h. After dilution with 1% TFA,
the resulting peptide mixture was subjected to a high-pressure
liquid chromatography (HPLC) separation on a MAGIC 2002 (Michrom
BioResources, Auburn, CA). The peptides were first loaded onto
polymeric reverse-phase packing material (Peptide Captrap, Michrom
BioResources) for desalting and concentration, and then separated
onto a reverse-phase capillary HPLC column (C18, 200 A, 0.2
mm
x 50 mm, Michrom BioResources) with a flow rate of 5 µl/min.
As solvents, 2% v/v acetonitrile in 0.1% v/v formic acid (solvent
A) and 90% v/v acetonitrile in 0.1% v/v formic acid (solvent
B) were used with a linear gradient from 5 to 65% of solvent
B over 30 min. The chromatography system was coupled via a HTS-PAL
(CTC Analytics, Zwingen, Switzerland) to an ion trap mass spectrometer
LCQ (Thermo Finnigan, CA). The resulting MS/MS data were analyzed
using the Mascot search engine (Matrix Science, London, UK).
Proteins identified with a combined peptide score of >80
were considered significant, and lower-scoring proteins were
rejected.
2.2. Pathway analysis
Ingenuity Pathway Analysis software (Ingenuity Systems, Mountain View, CA, USA) is the world's largest curated database consisting of millions of individually modeled relationships between proteins, genes, complexes, cells, tissues, drugs and diseases. If researcher inputs a set of proteins into the Ingenuity Pathway Analysis, the software presents relevant pathways and diseases. Therefore, we first imported accession numbers of mKIAA and their interacting proteins identified as described in Section 2.1 into the Ingenuity Pathway Analysis. The software then computed a score for each pathway according to the fit of the imported set of proteins. The score is derived from a P-value and indicates the likelihood of the focus proteins in a pathway being found together owing to random chance. A score of 2 indicates that there is a 1 in 100 chance that the focus proteins are together in same pathway owing to random chance. Therefore, scores of 2 or higher have at least a 99% confidence of not being generated by random chance alone. Biological functions were then calculated and assigned to each pathway. Subsequently, accession number of each component for the most relevant cellular pathway statistically selected was exported to another pathway analysis tool, PathwayAssist (Ariadne Genomics, Inc., Rockville, MD). The information of each node and edge was manually curated and updated through computerized searches in PubMed (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi). The curation of each record was performed by an MD or PhD-level scientist. Essential links to PubMed citations and our original data were performed on PathwayAssist, because the software is superior to Ingenuity Pathway Analysis in manual reconstruction of the graphical views.
2.3. Database construction
The InCeP database is written in Java. The Java Servlet and the JavaServer Pages (JSP) are implemented by the Apache Jakarta Tomcat Servlet/JSP container and communicate with the Oracle Database 10g (Relational Database Server) and the JDBC driver (the interface programs for the database). The recommended web client is Internet Explore 6.0 browser or higher.
 |
3. Results and Discussion
|
|---|
3.1. Identification of a novel mKIAA proteinprotein interaction
To identify novel mKIAA proteinprotein interactions,
we performed MS/MS analysis following immunoprecipitation with
anti-mKIAA antibodies. Expecting efficient identification, we
selected highly expressed tissues or cell lines based on the
information obtained from western blotting. Some of these data
are freely available through our InGaP database (Integrative
Gene and Protein expression database;
http://www.kazusa.or.jp/ingap2)
in which information about mKIAA gene/proteins consisting of
cDNA microarray analysis, subcellular localization of the ectopically
expressed gene and experimental data generated from use of the
anti-mKIAA antibody such as western blotting and immunohistochemical
analyses are recorded.13
Functional annotation of mKIAA-interactors provides information important in conceiving a functional hypothesis of mKIAA proteins based on the molecular network. Although the identified molecules may interact with corresponding mKIAA in an indirect manner, it is conceivable that at least these molecules function along the same cellular pathway. One example is the mKIAA1027 protein, also known as talin 1. Already distinct functional annotation had been obtained from numerous experimental studies. This protein plays a significant role in the cellcell and cellextracellular matrix adhesions through interactions with the intracellular domain of integrin ß.14
16
Although these previous efforts have clarified the functional importance of mKIAA1027/talin 1 in the transduction of integrin signal, we have identified a novel interaction of mKIAA1027/talin 1 supposing a completely different functional aspect. Promyelocytic leukemia (Gene Symbol in Fig. 1 is PML) was found in immunoprecipitant with anti-mKIAA1027 antibody (Fig. 2). Pml was originally identified as fusion protein with retinoic acid receptor
, which causes acute promyelocytic leukemia.17
Although the tumor-suppressive role of PML has been attributed to its ability to regulate the transcriptional function of nuclear tumor suppressors such as p53 and Rb,18
several cytoplasmic PML (cPML) isoforms of unknown function have also been described.19
,20
Most recently, Lin et al.21
have identified a novel function of cPML as an essential modulator of TGF-ß signaling. Likewise, our finding suggests that cPML might modulate integrin ß-mKIAA1027/talin 1 signaling.

View larger version (84K):
[in this window]
[in a new window]
|
Schematic representation of the pathway including mKIAA1027. The proteins are essentially represented by ellipses. A modification of the shape refers to a protein's functions (e.g. an eclipsed shape represents protein kinase). The proteins are also displayed with several different colors. Dark blue indicates the targeted mKIAA protein. Light blue indicates identified mKIAA-interactors. Red indicates other components of the cellular pathway selected by Ingenuity Pathway Analysis. The interactions and regulations among the proteins are illustrated by the different lines of connection. The detail explanation of these differences is recorded on Search Results of InCeP (Intracellular pathway based on mKIAA proteinprotein interactions) database (http://www.kazusa.or.jp/create/). These data will be freely available through our InCeP database. Each component of the pathway is represented by the gene symbol (e.g. Promyelocytic leukemia, PML).
|
|

View larger version (101K):
[in this window]
[in a new window]
|
Identification of endogenous mKIAA protein and its interactor. (A) Approximately 60 mg of the kidney lysate was subjected to immunoprecipitation with antibody against mKIAA1027, and the resulting precipitates were resolved in 8% SDSPAGE. The arrowheads indicate the positions of the bands that were excised for digestion by trypsin (gray and black indicate endogenous mKIAA protein and its interactor, respectively). All picked bands were identified by subsequent LC-MS/MS analysis and the resulting MS/MS dataset was analyzed using the Mascot search engine. IgG indicates the position of the rabbit immunoglobulin heavy chain. (B) The sequences of each protein are depicted in the single-letter code. The sequence stretches that are covered by peptide ion signals are shown in bold. Percent coverages of mKIAA1027 and PML are 5 and 7%, and the Mowse scores are 308 and 268, respectively.
|
|
Non-specifically binding proteins to sepharose column and antibodies
reduce the reliability of identified interactors. Considering
this point, we have also determined all non-specifically binding
proteins in each tissue or cell line by MS/MS analysis (for
instance, Ogdh protein and transitional endoplasmic reticulum
ATPase are observed in most of the tissues). If we detect these
proteins in the excised band which is assumed to be mKIAA-interactor,
these proteins are excluded from further construction of corresponding
mKIAA-pathway. The reproducibility of these interactions should
also be assessed. To strengthen the reliability, we performed
at least two independent experiments and confirmed the interactions
(mKIAA0035, mKIAA0675, mKIAA0994 and mKIAA1465) at initial stage
of our study.
3.2. Data representation and usage of the InCeP database
The inaugural version of the InCeP database provides users 18 intracellular pathways, including endogenous mKIAA protein interactions, in HTML format (Table 1). This information has been manually curated from selected publications. The InCeP database is accessible through the CREATE portal (http://www.kazusa.or.jp/create/) (Fig. 3A). Clicking on InCeP, the InCeP home page will open. Similar to this operation, users can easily access to the InGaP database. To locate pathways, a user can simply search using the mKIAA number, the accession number or the clone name. Alternatively, a user can find a pathway related to a particular disease or tissue (Fig. 3B). On the summary page, an overview of each pathway is represented by edges (protein, protein complex, small molecule and phenomena) and nodes (several kinds of interactions) (Fig. 3C). Clicking on each edge or node, a user can directly obtain a variety of supplementary data such as detailed information for the molecules and a PubMed citation for the interaction (Fig. 3D). mKIAA protein interactions developed from in-house assays are also displayed by simple clicking of the corresponding node. At this time, the node links to the mKIAA knowledge page, which contains all information about the mKIAA gene and protein on InGaP database.

View larger version (76K):
[in this window]
[in a new window]
|
Actual records of the InCeP database. (A) CREATE portal page (http://www.kazusa.or.jp/create/). Clicking on InCeP, users can access the top page of the InCepP database. (B) The top page of the InCepP database. To locate pathways, a user can simply search using the mKIAA number, the accession number, or the clone name. (C) One view of the summary page showing a pathway related to mKIAA0202. The pathway is represented by edges (protein, protein complex, small molecule and phenomena) and nodes (several kinds of interactions). (D) An example of hyperlinked information about each node. Clicking on a node, a user can directly obtain a variety of supplementary data. In this example, the page represents an abstract of the PubMed citation experimentally verifying the interaction. All of the records have been manually curated by an MD or PhD-level scientist.
|
|
 |
4. Concluding Remarks
|
|---|
We have previously placed a strong emphasis on integration of
several kinds of biological knowledge.13

With respect to this
issue, we have launched the InGaP database, a comprehensive
database of gene/protein expression profiles using cDNA microarray
and anti-mKIAA antibodies. The InCeP is defined as a sister
database of the InGaP; concomitant use of these databases may
therefore accelerate our appreciation of the function of mKIAA/KIAA
proteins. Furthermore, a bidirectional accumulation of information
is the expected next step of InCeP development. We are currently
developing a graphical analysis tool using the SVG format that
will enable us to supplement user's data on the internet through
our resources. The user's data as well as the records of our
novel interactions would facilitate consolidation of the intracellular
pathway involving mKIAA/KIAA proteins. We expect that our strategy
will be an InCePtive approach to discovering molecular
mechanisms related to other hypothetical proteins.
 |
Acknowledgements
|
|---|
We are grateful to all the staff who have worked on our project.
This study was supported by the CREATE Program (Collaboration
of Regional Entities for the Advancement of Technological Excellence)
from the JST (Japan Science and Technology Corporation).
 |
Footnotes
|
|---|
*To whom correspondence should be addressed. Tel. +81-438-52-3919, Fax. +81-438-52-3918, Email:
hkoga{at}kazusa.or.jp
Communicated by Michio Oishi
 |
References
|
|---|
- Alfarano, C., Andrade, C. E., Anthony, K., et al. 2005, The Biomolecular interaction network database and related tools 2005 update, Nucleic Acids Res., 33, D418D424.[Abstract/Free Full Text]
- Bader, G. D. and Hogue, C. W. 2000, BINDa data specification for storing and describing biomolecular interactions, molecular complexes and pathways, Bioinformatics, 16, 465477.[Abstract/Free Full Text]
- Bader, G. D., Betel, D., Hogue, C. W. 2003, BIND: the biomolecular interaction network database, Nucleic Acids Res., 31, 248250.[Abstract/Free Full Text]
- Kanehisa, M. and Goto, S. 2000, KEGG: kyoto encyclopedia of genes and genomes, Nucleic Acids Res., 28, 2730.[Abstract/Free Full Text]
- Kanehisa, M., Goto, S., Kawashima, S., Nakaya, A. 2002, The KEGG databases at GenomeNet, Nucleic Acids Res., 30, 4246.[Abstract/Free Full Text]
- Kanehisa, M., Goto, S., Kawashima, S., Okuno, Y., Hattori, M. 2004, The KEGG resource for deciphering the genome, Nucleic Acids Res., 32, D277D280.[Abstract/Free Full Text]
- Nakajima, D., Okazaki, N., Yamakawa, H., Kikuno, R., Ohara, O., Nagase, T. 2002, Construction of expression-ready cDNA clones for KIAA genes: manual curation of 330 KIAA cDNA clones, DNA Res., 9, 99106.[Abstract]
- Nomura, N., Miyajima, N., Sazuka, T., et al. 1994, Prediction of the coding sequences of unidentified human genes. I. The coding sequences of 40 new genes (KIAA0001-KIAA0040) deduced by analysis of randomly sampled cDNA clones from human immature myeloid cell line KG-1, DNA Res., 1, 2735.[Abstract/Free Full Text]
- Hara, Y., Shimada, K., Kohga, H., Ohara, O., Koga, H. 2003, High-throughput production of recombinant antigens for mouse KIAA proteins in Escherichia coli: computational allocation of possible antigenic regions, and construction of expression plasmids of glutathione-S-transferase-fused antigens by an in vitro recombination-assisted method, DNA Res., 10, 129136.[Abstract]
- Koga, H., Shimada, K., Hara, Y., et al. 2004, A comprehensive approach for establishment of the platform to analyze functions of KIAA proteins: generation and evaluation of anti-mKIAA antibodies, Proteomics, 4, 14121416.[CrossRef][ISI][Medline]
- Fernandez-Patron, C., Madrazo, J., Hardy, E., Mendez, E., Frank, R., Castellanos-Serra, L. 1995, Single-step electrotransfer of reverse-stained proteins from sodium dodecyl sulfate-polyacrylamide gel onto reversed-phase minicartridge and subsequent desalting and elution with a conventional high-performance liquid chromatography gradient system for analysis, Electrophoresis, 16, 911920.[Medline]
- Fernandez-Patron, C., Hardy, E., Sosa, A., Seoane, J., Castellanos, L. 1995, Double staining of coomassie blue-stained polyacrylamide gels by imidazole-sodium dodecyl sulfate-zinc reverse staining: sensitive detection of coomassie blue-undetected proteins, Anal. Biochem., 224, 263269.[CrossRef][Medline]
- Koga, H., Yuasa, S., Nagase, T., et al. 2004, A comprehensive approach for establishment of the platform to analyze functions of KIAA proteins II: public release of inaugural version of InGaP database containing gene/protein expression profiles for 127 mouse KIAA genes/proteins, DNA Res., 11, 293304.[Abstract]
- Campbell, I. D. and Ginsberg, M. H. 2004, The talin-tail interaction places integrin activation on FERM ground, Trends Biochem. Sci., 29, 429435.[CrossRef][Medline]
- Calderwood, D. A. 2004, Integrin activation, J. Cell Sci., 117, 657666.[Abstract/Free Full Text]
- Nayal, A., Webb, D. J., Horwitz, A. F. 2004, Talin: an emerging focal point of adhesion dynamics, Curr. Opin. Cell Biol., 16, 9498.[CrossRef][Medline]
- Goddard, A. D., Yuan, J. Q., Fairbairn, L., et al. 1995, Cloning of the murine homolog of the leukemia-associated PML gene, Mamm. Genome, 6, 732737.[CrossRef][Medline]
- Salomoni, P. and Pandolfi, P. P. 2002, The role of PML in tumor suppression, Cell, 108, 165170.[CrossRef][ISI][Medline]
- Fagioli, M., Alcalay, M., Pandolfi, P. P., et al. 1992, Alternative splicing of PML transcripts predicts coexpression of several carboxy-terminally different protein isoforms, Oncogene, 7, 10831091.[ISI][Medline]
- Jensen, K., Shiels, C., Freemont, P. S. 2001, PML protein isoforms and the RBCC/TRIM motif, Oncogene, 20, 72237233.[CrossRef][ISI][Medline]
- Lin, H. K., Bergmann, S., Pandolfi, P. P. 2004, Cytoplasmic PML function in TGF-beta signalling, Nature, 431, 205211.[CrossRef][Medline]

CiteULike
Connotea
Del.icio.us What's this?