Jump to content

Talk:R (programming language)

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Former good article nomineeR (programming language) was a Engineering and technology good articles nominee, but did not meet the good article criteria at the time. There may be suggestions below for improving the article. Once these issues have been addressed, the article can be renominated. Editors may also seek a reassessment of the decision if they believe there was a mistake.
Article milestones
DateProcessResult
November 4, 2010Good article nomineeNot listed

Milestones

The table in the Milestones section show R version with format x.y, e.g. R 3.6. However, except for some of the historical releases, the formal version format is x.y.z, e.g. R 3.6.0. The dates associated with each entry appears to point to when the x.y.0 release was done. Should the 'Release' version be updated to use x.y.0 format?


useR! conference list

I had just done some work to try to improve the useR! section but I am not sure if the conference needs so much space, especially with the list of previous venues. The initial addition of the useR! section did not include the list[1] and when it was added, it was shorter only listing conferences up until 2012.[2] Since the article does not highlight specific useR! conferences, it might be best to remove the list entirely. What do you all think? Moon motif (talk) 10:43, 26 December 2022 (UTC)[reply]

Separating out Milestones section content?

Should the content of the Milestones section be separated out? I feel like parts of it are more suited to the History section like R becoming part of the official GNU project in release 0.60 or the Features section like the pipe operator |> and anonymous function shortcut syntax in release 4.1.0. This is even more opinion-based but some of the included milestones don't seem like actual milestones? I really don't mean to offend anyone but including release 3.3.3's deprecation of Windows XP support or 2.11's 64-bit Windows support does not seem historic? (I had not heard of R then so I do not know for sure but it seems normal to deprecate old operating systems that have been succeeded and support new architectures) Considering how much real estate the Milestones section takes up, I'm unsure of how to navigate this. Also, I don't feel comfortable being single judge to decide what releases to or not to keep. Moon motif (talk) 07:58, 17 January 2023 (UTC)[reply]

I support trimming it and moving it to the history section. The selection of things to include seems like blatant original research right now. We should be basing it on what changes are covered in sources beyond the R changelog. – Joe (talk) 08:27, 17 January 2023 (UTC)[reply]

Add .rhistory

.rhistory is another filetype that stores the history of the code executed in a R session. I want to add it to the file types list but I am new to Wikipedia and I don't know how. AHWikipedian (talk) 11:55, 13 November 2023 (UTC)[reply]

I have added that in for you Pansydyke (talk) 16:55, 19 January 2024 (UTC)[reply]

Moved Comparison with alternatives/Python to talk

@Newystats: I moved this paragraph to talk:

Comparison with alternatives/Python

Python and R are interpreted, dynamically typed programming languages with duck typing that can be extended by importing packages. Python is a general-purpose programming language while R is specifically designed for doing statistical analysis. Python has a BSD-like license in contrast to R's GNU General Public License but still permits modifying language implementation and tools.[1]

Why is R being compared with Python? Python is a general-purpose programming language, but R is a specific-purpose programming language. This paragraph is comparing an apple with an orange. R_(programming_language)#Interfaces says you can embed R to Python by installing Rpy2. The implication is you can have both full Python and full R.

You can also embed full Python and other languages in R, as described in Yihui Xie; Joseph J. Allaire; Garrett Grolemund (30 December 2023), R Markdown: The Definitive Guide, Chapman & Hall, Wikidata Q76441281

Regarding Python has a BSD-like license in contrast to R's GNU General Public License but still permits modifying language implementation and tools.:

  1. This contrast is immaterial.
  2. This sentence is the only one that is cited. The book title of the citation is intriguing: "Python vs. R for Data Science." However, the paragraph doesn't paraphrase the book's thesis.

Timhowardriley (talk) 20:32, 3 January 2024 (UTC)[reply]

@Timhowardriley: I object to removing the section on "Comparison with alternatives". If you think it's biased, please propose changes that remove the bias.
Wikipedia has many comparisons like this that provide a valuable service. Only yesterday I got substantial help with something I was doing from a crudely similar comparison on Wikipedia. In my judgment deleting the entire "Comparison with alternatives" section degrades the quality of this article.
I'm restoring that entire section including the discussion of Python. I plan to add other material, but I'm not exactly certain what just yet.
DavidMCEddy (talk) 14:24, 5 January 2024 (UTC)[reply]
Python and R are the two leading programming languages in data science and the comparison is very frequently discussed in relevant sources, so I think it makes sense to include it here. However I agree that the section as it stands is pretty shallow. – Joe (talk) 14:42, 5 January 2024 (UTC)[reply]
Regarding If you think it's biased, please propose changes that remove the bias.: Comparisons between products and services are best handled through a table. For a narrative comparison to be unbiased, it requires a lot of words to fairly describe each differentiating characteristic. Most importantly, Wikipedia articles need to be reliably sourced. As Wikipedia editors of this product, we are inherently biased. Instead, a reliable source (like Consumer Reports) needs to compare R with a competitor, then we can paraphrase that material. On the other hand, simply name-dropping the NY Times is misleading. I got past the pay-lock once to read the article. I remember it being very supportive R and having only a mention of SAS. Moreover, it quoted SAS's marketing manage who refuted the SAS disparagements. The Comparison of statistical packages link in the "See also" section is the proper way to compare R with its competitors. Regarding I'm restoring the ... discussion of Python: Please refute any of my claims that this is a lousy paragraph. Timhowardriley (talk) 23:28, 5 January 2024 (UTC)[reply]
Regarding I plan to add other material, but I'm not exactly certain what just yet.: The cart is in front of the horse. Wikipedia articles need to be reliably sourced. Step one is to discover something relevant in your secondary research. Step two is to paraphrase that material into the Wikipedia article. Otherwise, it's original research. Timhowardriley (talk) 23:55, 5 January 2024 (UTC)[reply]
Accepted re. deleting the "Comparison with alternatives". Thanks, DavidMCEddy (talk) 16:24, 6 January 2024 (UTC)[reply]

Removing the description was "as a programming language to teach introductory statistics at the University of Auckland."

The introduction says "was started by professors Ross Ihaka and Robert Gentleman as a programming language to teach introductory statistics at the University of Auckland." This struck me as a bit odd, as R is not generally considered a tool to teach introductory statistics. Anyway, there's a hyperlink that references this introductory statistics comment. This is what it actually says, with no reference whatsoever to teach introductory statistics.


Early History - 1990 • Ross Ihaka joins the Department of Statistics at the University of Auckland. • Robert Gentleman spends 1990 in Auckland on sabbatical from the University of Waterloo. • During a chance encounter in the corridor, the following exchange takes place: Gentleman: “Let’s write some software.” Ihaka: “Sure, that sounds like fun.” • The initial goal is to build a testbed for trying out ideas and to publish a paper or two.Early History - 1990 Drkirkby (talk) 15:15, 3 February 2024 (UTC)[reply]

The quote from the PDF link is, "We set a goal of developing enough of a language to teach introductory statistics courses at Auckland." I added this quote to the citation. Timhowardriley (talk) 17:39, 3 February 2024 (UTC)[reply]
The "Let's write some software" quote is on page 10 of the PDF. The citation points to page 12 of the PDF. Timhowardriley (talk) 17:45, 3 February 2024 (UTC)[reply]
My error, I missed that.
I'm having the misfortune of having to learn some introductory statistics with Minitab. I wish we were using R, but the university does not consider R an appropriate language to teach introductory statistics. I can see their point to be honest. Drkirkby (talk) 22:23, 4 February 2024 (UTC)[reply]
The best way to learn statistics is to get a good eraser. ;-) Timhowardriley (talk) 04:38, 5 February 2024 (UTC)[reply]
  1. ^ Grogan, Michael (2018). Python vs. R for Data Science. O'Reilly Media, Inc.