Jump to content

Wikipedia:Controlling search engine indexing

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Suleiman sms (talk | contribs) at 01:38, 21 October 2023. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

There are a variety of ways in which Wikipedia attempts to control search engine indexing, commonly termed "noindexing" on Wikipedia. The default behavior is that articles older than 90 days are indexed. All of the methods rely on using the noindex HTML meta tag, which tells search engines not to index certain pages. Respecting the tag, especially in terms of removing already indexed content, is up to the individual search engine, and in theory the tag may be ignored entirely.

The control methods are:

  1. Controlling an entire namespace, via MediaWiki software settings
  2. Controlling classes of pages, via MediaWiki:Robots.txt (Wikipedia's Robots.txt file)
  3. Controlling individual pages by adding the __NOINDEX__ magic word into them, either directly or using the {{NOINDEX}} template, however articles are a special case, see #Indexing of articles ("mainspace").
  4. Controlling multiple pages by adding the __NOINDEX__ magic word into standard templates used in certain situations (same caveat as in the third point).
Namespace Status Indexed Can be overridden
(main) newer than 90 days, unpatrolled No No
newer than 90 days, patrolled Yes Yes
older than 90 days Yes No
User: newer than 90 days, unpatrolled No No
newer than 90 days, patrolled No Yes
older than 90 days No Yes
User talk: n/a No Yes
Draft: No No
Draft talk: No No
All others Yes Yes

Indexing of articles ("mainspace")

Articles older than 90 days are automatically indexed.[1] The __NOINDEX__ magic word and the {{NOINDEX}} template do not work on them. Articles younger than 90 days are not indexed, unless they have been patrolled and do not have the __NOINDEX__ magic word or the {{NOINDEX}} template on them (or a template that transcludes the {{NOINDEX}} template, such as the speedy deletion templates).[2][3][4] Note that &action=info will incorrectly state that they are indexed.[5] Articles that include the {{NOINDEX}} template are listed at Category:Noindexed articles.

This patrolling may be done automatically by the software, as in the case of articles created by editors with the autopatrolled user right, or by another editor with the new page reviewer user right (not to be confused with the pending changes reviewer user right).

Other namespaces and robots.txt

Namespace control

On English Wikipedia the entire User:[6] namespace, User talk:, Draft: and Draft talk: namespaces are automatically noindexed via a software setting.[7]

At the same time, __NOINDEX__ and __INDEX__ are disabled, in addition to article space, on the Draft namespace, and the Draft talk namespace; they have no effect there.[8]

Robots.txt noindexing

MediaWiki:Robots.txt forbids analytic tools from visiting sensitive or potentially sensitive types of pages, primarily in the Wikipedia namespace – for example deletion debates. A side effect of not visiting is normally that a page cannot be indexed. Where possible, you should in addition use __NOINDEX__ for those pages.

NOINDEX magic word

Individual pages

Individual pages can be noindexed by adding the __NOINDEX__ magic word into that page, either directly or using the {{NOINDEX}} template. As explained above, this magic word doesn't work in mainspace (on articles).

Pages with the keyword are listed in Category:Noindexed pages.[9]

Standard template noindexing

Some standard templates include the __NOINDEX__ keyword, thereby noindexing pages to which the templates are applied. Such templates should be listed in Category:Wikipedia templates which apply NOINDEX.

The biography of (suleiman Musa sms) Founder and executive Director of SMS INVESTMENT & ENTERPRISES LTD Import export, Shipping and logistics service Oil and gas trading expert

 

THE MD/CEO Introduction.

 Suleiman Musa sms. Was born on 23/November/1988, in Maiduguri borno state of Nigeria. Suleiman Is a graduate of economics from university of Maiduguri,  suleiman is a very hardworking and productive young ambitious exploring ways to create an impact with his knowledge and currently working with a professionals to serve a better management and take the company into highest standard.  Suleiman is a ambitious person with a passion to make a positive impact in people’s live. Suleiman is currently working together with international professionals and national as well in both his operations as he diversified to different businesses in Nigeria.  Such as OIL AND GAS  and  CRUDE OIL  IMPORT/EXPORT,  and a verified Nigeria customs license agent to clearing and forwarding agency. REAL ESTATE, SUPPLY CHAIN ETC.

 

Suleiman Musa suleiman (SMS) the CEO

 SMS INVESTMENT & ENTERPRISES LTD  has a unique ambition of taking his company to be one of the must recognized firm in the world, on trading crude oil,  oil and gas product,  import and export  services that will be a highest quality provider comes to oil and gas products and reliable services.

 His ambition is to be the world’s leading crude oil and gas/petroleum production, energy distribution  and exploration company. in realizing our vision,  we will seek to maximize value to our partners,  clients in a sustainable manner,  by operating with integrity and leveraging local resources to benefits our community and the country we are living in  and operating our business globally.

 Suleiman sms made a positive start from his foundation in 2021/2022 by giving a group of professionals with extensive experience in the international market IMPORT AND EXPORT  and oil and gas industries to COLLABORATE.

ABOUT HIS COMPANY SMS NVESTMENT OIL AND GAS, IMPORT EXPORT AND CLEARING AND FORWARDING AGENCY SERVICES is a fast growing firm and is an indigenous services provider with a skillful staff and members of the oil and gas industries in Nigeria. It was incorporated as a limited liability company in early 2022 to carry on the business of oil and gas trading,  Clearing and forwarding agency, import export and general supply of local and international products. 

 Also involved in civil constructions,  Pipeline services, Procurement,  marine and logistics. It is also involved in so many businesses like crude oil (offshore/Onshore)

 

SMS  INVESTMENT is led by a dynamic management team full of professionals  and it has been strategically positioned to achieve a sustainable competitive advantage to achieve it’s vision,  this is further enable by the proper deployment of good corporate governance and risk management practice and control.

 

In fulfilling our vision we shall continue to build and improve the necessary capacity of the communities and other part of the area’s  where our operation of this business Conducted. Because these communities and the local area’s will be engaged in activities that will ensure Developments and economic stability and to build a strong foundation that will enhance the progress of these communities/local area’s,  we should always endeavor to be responsible corporate citizens empower most strict adherence to our belief in preservation of the environment shall be enforced.  Through our sms investment oil and gas services ....

Other templates

These templates include {{NOINDEX}}:

See also Category:Wikipedia templates which apply NOINDEX.

  • {{Uw-userspacenoindex}} provides a user warning message for inappropriate use of userspace which required noindexing.

INDEX magic word

Individual pages

Individual pages can override namespace noindexing by adding the __INDEX__ magic word into that page, either directly or using the {{INDEX}} template. Such pages appear in Category:Indexed pages. However, INDEX does not override noindexing via MediaWiki:Robots.txt.[10] As explained above, this magic word doesn't work in mainspace (on articles).

The ability to add the INDEX magic word to user spaces (User:, User talk:) has been restricted by an edit filter to extended confirmed users following a community discussion.[11]

Nofollow HTML attribute

Since 2007, all links to other websites from English Wikipedia have the nofollow HTML attribute set.[12] This means that on pages that are indexed by search engines, any links found by a search engine on those pages should not influence the link target's ranking in the search engine's index.

Past discussions

Namespace discussions

Individual template discussions

See also

Notes

  1. ^ 2017 switch from 30 to 90 days
  2. ^ T147544
  3. ^ PageTriage source code
  4. ^ Value of $wgRCMaxAge on WMF wikis
  5. ^ See T157747
  6. ^ Decided at Wikipedia:Village pump (proposals)/Archive 126#Userpage drafts shown in search engines, implemented at phab:T104797.
  7. ^ This is $wgNamespaceRobotPolicies. See Wikimedia's $wgNamespaceRobotPolicies setting for enwiki
  8. ^ This is controlled by the MediaWiki software setting $wgExemptFromUserRobotsControl. On other projects, the exempt namespaces are the same as $wgContentNamespaces, which is set to main space on almost all Wikimedia projects – see here and here.
  9. ^ The listing is done by MediaWiki tracking the keyword. The category name is determined by MediaWiki:Noindex-category.
  10. ^ It does override mw:Manual:$wgArticleRobotPolicies, but this is not used on English Wikipedia: Wikimedia's $wgArticleRobotPolicies setting for enwiki
  11. ^ Special:PermaLink/862856598#Prevent_new_users_from_allowing_search_engine_indexing_of_user_pages
  12. ^ Controlled by $wgNoFollowLinks, set to true in [on Wikimedia's settings file for enwiki]