Jump to content

Talk:R (programming language)/Archive 3

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Archive 1Archive 2Archive 3

Weight on language over environment in opening sentence(s)

Current article: "R is a programming language".

First line of the (official) R Project's main page: "R is a free software environment for statistical computing and graphics."
First line of the (official) R Project's about page: "R is a language and environment for statistical computing and graphics."

(Note: emphasis added by me in the above quotations.)

The current introduction exclusively mentions the language identity in the opening definition, and first paragraphs. Only later is the environment mentioned, seemingly as a subordinate issue. That doesn't fit with the definitions/descriptions given by the R Project themselves.

—DIV (137.111.13.4 (talk) 01:42, 11 February 2022 (UTC))

Of course, this article is explicitly about the programming language aspect, but maybe that shouldn't be the case. Maybe it should be moved to "R (computing)", say. Alternatively, what about adding a new "R (software environment)" article to complement the existing one??
—DIV (137.111.13.4 (talk) 01:45, 11 February 2022 (UTC))

typical

A recent edit summary says it is not typical to use R from the command line. Is there a WP:RS for that? Does that mean by number of people, or amount of time it is used by different people? (That is, it is weighted by usage/person.) This might also affect the above discussion about language vs. environment. Note, for example, C is a programming language, but Visual C is an environment for developing C programs. Gah4 (talk) 21:42, 15 February 2022 (UTC)

Seems to me that the intent of the original phrasing was that R is typically used interactively in a REPL; whether that REPL is launched through the main CLI executable or RStudio doesn't seem particularly important. bonob (talk) 10:51, 16 February 2022 (UTC)
I know about REPL, but didn't recognize the acronym, so had to look it up. I didn't read the original that way, but maybe. Gah4 (talk) 14:09, 16 February 2022 (UTC)

Ok

Function Sjsjsksjdh Jddjj 2409:4063:6C8D:C9D:0:0:B28B:4808 (talk) 13:02, 11 April 2022 (UTC)

Hi

Yeah 86.2.214.41 (talk) 05:33, 31 May 2022 (UTC)

footnote numbering

The first footnote I see is #6. What happened to #1-#5? Socialresearch (talk) 15:10, 12 June 2022 (UTC)

@Socialresearch: They're in the infobox. – Joe (talk) 16:05, 12 June 2022 (UTC)
Ah, okay, thanks. Socialresearch (talk) 22:06, 12 June 2022 (UTC)

useR! conference list

I had just done some work to try to improve the useR! section but I am not sure if the conference needs so much space, especially with the list of previous venues. The initial addition of the useR! section did not include the list[1] and when it was added, it was shorter only listing conferences up until 2012.[2] Since the article does not highlight specific useR! conferences, it might be best to remove the list entirely. What do you all think? Moon motif (talk) 10:43, 26 December 2022 (UTC)

Separating out Milestones section content?

Should the content of the Milestones section be separated out? I feel like parts of it are more suited to the History section like R becoming part of the official GNU project in release 0.60 or the Features section like the pipe operator |> and anonymous function shortcut syntax in release 4.1.0. This is even more opinion-based but some of the included milestones don't seem like actual milestones? I really don't mean to offend anyone but including release 3.3.3's deprecation of Windows XP support or 2.11's 64-bit Windows support does not seem historic? (I had not heard of R then so I do not know for sure but it seems normal to deprecate old operating systems that have been succeeded and support new architectures) Considering how much real estate the Milestones section takes up, I'm unsure of how to navigate this. Also, I don't feel comfortable being single judge to decide what releases to or not to keep. Moon motif (talk) 07:58, 17 January 2023 (UTC)

I support trimming it and moving it to the history section. The selection of things to include seems like blatant original research right now. We should be basing it on what changes are covered in sources beyond the R changelog. – Joe (talk) 08:27, 17 January 2023 (UTC)

Moved Comparison with alternatives/Python to talk

@Newystats: I moved this paragraph to talk:

Comparison with alternatives/Python

Python and R are interpreted, dynamically typed programming languages with duck typing that can be extended by importing packages. Python is a general-purpose programming language while R is specifically designed for doing statistical analysis. Python has a BSD-like license in contrast to R's GNU General Public License but still permits modifying language implementation and tools.[1]

Why is R being compared with Python? Python is a general-purpose programming language, but R is a specific-purpose programming language. This paragraph is comparing an apple with an orange. R_(programming_language)#Interfaces says you can embed R to Python by installing Rpy2. The implication is you can have both full Python and full R.

You can also embed full Python and other languages in R, as described in Yihui Xie; Joseph J. Allaire; Garrett Grolemund (30 December 2023), R Markdown: The Definitive Guide, Chapman & Hall, Wikidata Q76441281

Regarding Python has a BSD-like license in contrast to R's GNU General Public License but still permits modifying language implementation and tools.:

  1. This contrast is immaterial.
  2. This sentence is the only one that is cited. The book title of the citation is intriguing: "Python vs. R for Data Science." However, the paragraph doesn't paraphrase the book's thesis.

Timhowardriley (talk) 20:32, 3 January 2024 (UTC)

@Timhowardriley: I object to removing the section on "Comparison with alternatives". If you think it's biased, please propose changes that remove the bias.
Wikipedia has many comparisons like this that provide a valuable service. Only yesterday I got substantial help with something I was doing from a crudely similar comparison on Wikipedia. In my judgment deleting the entire "Comparison with alternatives" section degrades the quality of this article.
I'm restoring that entire section including the discussion of Python. I plan to add other material, but I'm not exactly certain what just yet.
DavidMCEddy (talk) 14:24, 5 January 2024 (UTC)
Python and R are the two leading programming languages in data science and the comparison is very frequently discussed in relevant sources, so I think it makes sense to include it here. However I agree that the section as it stands is pretty shallow. – Joe (talk) 14:42, 5 January 2024 (UTC)
Regarding If you think it's biased, please propose changes that remove the bias.: Comparisons between products and services are best handled through a table. For a narrative comparison to be unbiased, it requires a lot of words to fairly describe each differentiating characteristic. Most importantly, Wikipedia articles need to be reliably sourced. As Wikipedia editors of this product, we are inherently biased. Instead, a reliable source (like Consumer Reports) needs to compare R with a competitor, then we can paraphrase that material. On the other hand, simply name-dropping the NY Times is misleading. I got past the pay-lock once to read the article. I remember it being very supportive R and having only a mention of SAS. Moreover, it quoted SAS's marketing manage who refuted the SAS disparagements. The Comparison of statistical packages link in the "See also" section is the proper way to compare R with its competitors. Regarding I'm restoring the ... discussion of Python: Please refute any of my claims that this is a lousy paragraph. Timhowardriley (talk) 23:28, 5 January 2024 (UTC)
Regarding I plan to add other material, but I'm not exactly certain what just yet.: The cart is in front of the horse. Wikipedia articles need to be reliably sourced. Step one is to discover something relevant in your secondary research. Step two is to paraphrase that material into the Wikipedia article. Otherwise, it's original research. Timhowardriley (talk) 23:55, 5 January 2024 (UTC)
Accepted re. deleting the "Comparison with alternatives". Thanks, DavidMCEddy (talk) 16:24, 6 January 2024 (UTC)

References

  1. ^ Grogan, Michael (2018). Python vs. R for Data Science. O'Reilly Media, Inc.

Removing the description was "as a programming language to teach introductory statistics at the University of Auckland."

The introduction says "was started by professors Ross Ihaka and Robert Gentleman as a programming language to teach introductory statistics at the University of Auckland." This struck me as a bit odd, as R is not generally considered a tool to teach introductory statistics. Anyway, there's a hyperlink that references this introductory statistics comment. This is what it actually says, with no reference whatsoever to teach introductory statistics.


Early History - 1990 • Ross Ihaka joins the Department of Statistics at the University of Auckland. • Robert Gentleman spends 1990 in Auckland on sabbatical from the University of Waterloo. • During a chance encounter in the corridor, the following exchange takes place: Gentleman: “Let’s write some software.” Ihaka: “Sure, that sounds like fun.” • The initial goal is to build a testbed for trying out ideas and to publish a paper or two.Early History - 1990 Drkirkby (talk) 15:15, 3 February 2024 (UTC)

The quote from the PDF link is, "We set a goal of developing enough of a language to teach introductory statistics courses at Auckland." I added this quote to the citation. Timhowardriley (talk) 17:39, 3 February 2024 (UTC)
The "Let's write some software" quote is on page 10 of the PDF. The citation points to page 12 of the PDF. Timhowardriley (talk) 17:45, 3 February 2024 (UTC)
My error, I missed that.
I'm having the misfortune of having to learn some introductory statistics with Minitab. I wish we were using R, but the university does not consider R an appropriate language to teach introductory statistics. I can see their point to be honest. Drkirkby (talk) 22:23, 4 February 2024 (UTC)
The best way to learn statistics is to get a good eraser. ;-) Timhowardriley (talk) 04:38, 5 February 2024 (UTC)

Misuse of print() and return()

In a lot of the code examples the print() and return() functions are used incorrectly. For example R does not require the print(x) function to print the values of a vector x. Simply using the name of the function will do that. There are places where print() should be used (for example in the middle of a function), but not in most of the code shown. This is an important distinction between R and other languages.

Likewise you do not need return() at the end of a function definition. Whatever is on the last line will be returned. You do need to use return() if you are returning from the middle of a function. Again, this is an important distinction between R and other languages.

I would like to edit the examples to reflect this, unless someone has a reason for not doing so. Mcsmom (talk) 15:05, 18 February 2024 (UTC)

*Regarding Misuse of print() and return(): I disagree. print() and return() are not misused.
*Regarding ... the print() and return() functions are used incorrectly.: I disagree. They are used correctly.
*Regarding For example R does not require the print(x) function to print the values of a vector x.: Correct. If x is on a line by itself, then the interpreter will send it to print() for you.
*Regarding This is an important distinction between R and other languages.: I disagree. It's a shortcut and not important.
*Regarding Likewise you do not need return() at the end of a function definition.: Correct. If the last expression is left unassigned, then the interpreter will return it for you. Indeed, R_(programming_language)#Programmer_created_functions explains this in the comments.
*Regarding Again, this is an important distinction between R and other languages.: I disagree. It's a shortcut and not important.
*Regarding I would like to edit the examples to reflect this...: I will revert these edits b/c they will confuse a reader not familiar with R's shortcuts. Indeed, when I was new to the language and encountered these shortcuts in code, I was confused. The article's audience is intended to be as broad as possible. However, a new section titled, "Shortcuts" would be an appropriate place to enlighten a reader new to the language. Timhowardriley (talk) 22:43, 18 February 2024 (UTC)
Additional thought: Just because a language allows for a syntactic construct, it doesn't mean it's wise to use it. For example, COBOL used to allow a function to alter a local variable of another function. See Computer_program#Coupling. Software engineering principles emphasize readability over cryptic syntax constructs. For example, this version of the article has a mistake trying to explain the return() shortcut. The return() is not optional b/c the expression is assigned to the variable z. Timhowardriley (talk) 00:29, 19 February 2024 (UTC)

Add .rhistory

.rhistory is another filetype that stores the history of the code executed in a R session. I want to add it to the file types list but I am new to Wikipedia and I don't know how. AHWikipedian (talk) 11:55, 13 November 2023 (UTC)

I have added that in for you Pansydyke (talk) 16:55, 19 January 2024 (UTC)
Thanks. AHWikipedian (talk) 18:16, 29 February 2024 (UTC)