Jump to content

Talk:Programming with Big Data in R

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by MaxEnt (talk | contribs) at 04:49, 22 May 2020 (Trimmed from lead: sp). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
WikiProject iconComputing Start‑class
WikiProject iconThis article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
???This article has not yet received a rating on the project's importance scale.
WikiProject iconStatistics Start‑class Low‑importance
WikiProject iconThis article is within the scope of WikiProject Statistics, a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
LowThis article has been rated as Low-importance on the importance scale.

Trimmed from lead

These afterthoughts at the bottom of the lead are not adding clarity IMO, so I've trimmed them off, until someone figures out a way to sort this out enough to make it a value add.

It is clear that pbdR is not only suitable for small clusters, but is also more stable for analyzing big data and more scalable for supercomputers.[third-party source needed] In short, pbdR

  • does not like Rmpi, snow, snowfall, do-like,[clarification needed] nor parallel packages in R,
  • does not focus on interactive computing nor master/workers,
  • but is able to use both SPMD and task parallelisms.

MaxEnt 04:44, 22 May 2020 (UTC)[reply]

The actual markup:

It is clear that pbdR is not only suitable for small [[Computer cluster|clusters]], but is also more stable for analyzing [[big data]] and more scalable for [[supercomputer]]s.<ref>{{cite book|author=Schmidt, D., Ostrouchov, G., Chen, W.-C., and Patel, P.|title=Tight Coupling of R and Distributed Linear Algebra for High-Level Programming with Big Data|year=2012|pages=811–815|journal=High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion|url=http://dl.acm.org/citation.cfm?id=2477156|doi=10.1109/SC.Companion.2012.113|isbn=978-0-7695-4956-9}}</ref>{{third-party-inline|date=October 2014}} In short, pbdR
* does ''not'' like Rmpi, {{clarify|text=snow, snowfall, do-like,|date=October 2014}} nor parallel packages in R,
* does ''not'' focus on interactive computing nor master/workers,
* but is able to use ''both'' SPMD and task parallelisms.

Probably the restored version needs to begin "According to D. Schmidt, et al, R is suitable for $purpose.

Then the three verbs 'like', 'focus', and 'able' need to revised into encyclopedia tone. — MaxEnt 04:48, 22 May 2020 (UTC)[reply]