Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Special pages
Niidae Wiki
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
CiteSeerX
(section)
Page
Discussion
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== History == === CiteSeer and CiteSeer.IST === CiteSeer was created by researchers [[Lee Giles]], [[Kurt Bollacker]] and [[Steve Lawrence (computer scientist)|Steve Lawrence]] in 1997 while they were at the [[NEC Research Institute]] (now [[NEC Laboratories America|NEC Labs]]), [[Princeton, New Jersey]], US. CiteSeer's goal was to actively crawl and harvest academic and scientific documents on the web and use autonomous [[citation index]]ing to permit querying by citation or by document, ranking them by [[citation impact]]. At one point, it was called ResearchIndex. CiteSeer became public in 1998 and had many new features unavailable in academic search engines at that time. These included: * Autonomous Citation Indexing automatically created a citation index that can be used for literature search and evaluation. * Citation statistics and related documents were computed for all articles cited in the database, not just the indexed articles. * Reference linking, allowing browsing of the database using citation links. * Citation context showed the context of citations to a given paper, allowing a researcher to quickly and easily see what other researchers have to say about an article of interest. * Related documents were shown using citation and word based measures, and an active and continuously updated bibliography is shown for each document. CiteSeer was granted a United States [[patent]] # 6289342, titled "''Autonomous citation indexing and literature browsing using citation context''", on September 11, 2001. The patent was filed on May 20, 1998, and has priority to January 5, 1998. A continuation patent (US Patent # 6738780) was filed on May 16, 2001, and granted on May 18, 2004.{{Citation needed|date=April 2023}} After NEC, in 2004 it was hosted as CiteSeer.IST on the [[World Wide Web]] at the College of Information Sciences and Technology, The [[Pennsylvania State University]], and had over 700,000 documents. For enhanced access, performance and research, similar versions of CiteSeer were supported at universities such as the [[Massachusetts Institute of Technology]], [[University of Zürich]] and the [[National University of Singapore]]. However, these versions of CiteSeer proved difficult to maintain and are no longer available. Because CiteSeer only indexes freely available papers on the web and does not have access to publisher metadata, it returns fewer citation counts than sites, such as [[Google Scholar]], that have publisher metadata. <!-- for historical reference: Versions of CiteSeer have been or are available at the following links: * [https://citeseer.ittc.ku.edu/ Univ. of Kansas] * [https://citeseer.csail.mit.edu/ MIT] * [https://sherry.ifi.unizh.ch/ Univ. of Zürich] * [https://citeseer.comp.nus.edu.sg/cs National Univ. of Singapore] --> CiteSeer had not been comprehensively updated since 2005 due to limitations in its architecture design. It had a representative sampling of research documents in computer and information science but was limited in coverage because it was limited to papers that are publicly available, usually at an author's homepage, or those submitted by an author. To overcome some of these limitations, a modular and open source architecture for CiteSeer was designed – CiteSeer<sup>X</sup>. === CiteSeer<sup>X</sup> === '''CiteSeer<sup>X</sup>''' replaced CiteSeer and all queries to CiteSeer were redirected. CiteSeer<sup>X</sup><ref name="about-page">{{Cite web |title=About CiteSeerX |access-date=2010-05-07 |url=https://citeseerx.ist.psu.edu/about/site |archive-date=2010-07-22 |archive-url=https://web.archive.org/web/20100722162718/https://citeseerx.ist.psu.edu/about/site |url-status=live }}</ref> is a public [[search engine]] and [[digital library]] and [[Disciplinary repository|repository]] for scientific and academic papers, primarily with a focus on [[computer science|computer]] and [[information science]].<ref name="about-page" /> However, recently CiteSeer<sup>X</sup> has been expanding into other scholarly domains such as economics, physics and others. Released in 2008, it was loosely based on the previous CiteSeer search engine and digital library and is built with a new [[Open-source software|open source]] infrastructure, SeerSuite, and new algorithms and their implementations. It was developed by researchers Isaac Councill and C. [[Lee Giles]] at [[Penn State College of Information Sciences and Technology|the College of Information Sciences and Technology]], [[Pennsylvania State University]]. It continues to support the goals outlined by CiteSeer to actively crawl and harvest academic and scientific documents on the public web and to use a citation inquiry by citations and ranking of documents by the impact of citations. Currently, Lee Giles, Prasenjit Mitra, Susan Gauch, Min-Yen Kan, Pradeep Teregowda, Juan Pablo Fernández Ramírez, Pucktada Treeratpituk, Jian Wu, Douglas Jordan, Steve Carman, Jack Carroll, Jim Jansen, and Shuyi Zheng are or have been actively involved in its development. Recently, a table search feature was introduced.<ref>{{Cite web |title=The CiteSeerX Team |url=https://csxstatic.ist.psu.edu:80/about/team |archive-url=https://web.archive.org/web/20180726034438/https://csxstatic.ist.psu.edu/about/team |archive-date=2018-07-26 |publisher=Pennsylvania State University |df=mdy |access-date=2018-05-01 }}</ref> It has been funded by the [[National Science Foundation]], [[NASA]], and [[Microsoft Research]]. CiteSeer<sup>X</sup> continues to be rated as one of the world's top repositories, and was rated number 1 in July 2010.<ref>{{Cite web |title=Ranking Web of World Repositories: Top 800 Repositories |publisher=Cybermetrics Lab |date=July 2010 |url=https://repositories.webometrics.info/top800_rep.asp |access-date=2010-07-24 |archive-url=https://web.archive.org/web/20100724004342/https://repositories.webometrics.info/top800_rep.asp |archive-date=2010-07-24 |df=mdy}}</ref> It currently has over 6 million documents with nearly 6 million unique authors and 120 million citations.{{clarify timeframe|date = June 2023}} CiteSeer<sup>X</sup> also shares its software, data, databases and metadata with other researchers, currently by [[Amazon S3]] and by [[rsync]].<ref>{{Cite web |title=About CiteSeerX Data |publisher=Pennsylvania State University |access-date=2012-01-25 |url=https://csxstatic.ist.psu.edu/about/data |archive-url=https://web.archive.org/web/20120105193216/https://csxstatic.ist.psu.edu/about/data |archive-date=2012-01-05 }}</ref> Its new modular open source architecture and software (available previously on [[SourceForge]] but now on [[GitHub]]) is built on [[Apache Solr]] and other [[Apache Software Foundation|Apache]] and open source tools, which allows it to be a testbed for new algorithms in document harvesting, ranking, indexing, and information extraction. CiteSeer<sup>X</sup> caches some PDF files that it has scanned. As such, each page includes a [[DMCA]] link which can be used to report copyright violations.<ref>For example, {{Cite web |citeseerx=10.1.1.604.4916 |quote="The document with the identifier "10.1.1.604.4916" has been removed due to a DMCA takedown notice. If you believe the removal has been in error, please contact us through the feedback page, along with the identifier mentioned in this page."|title=CiteSeerx – DMCA Notice|url=https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.604.4916 |url-status=deviated |archive-url=https://web.archive.org/web/20220318075713/https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.604.4916|archive-date=2022-03-18 }}</ref>
Summary:
Please note that all contributions to Niidae Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Encyclopedia:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Search
Search
Editing
CiteSeerX
(section)
Add topic