At a high level, the tidyverse is a language for solving data science challenges with R code. Its primary goal is to facilitate a conversation between a human and a computer about data. Less abstractly, the tidyverse is a collection of R packages that share a high-level design philosophy and low-level grammar and data structures, so that learning one package makes it easier to learn the next.
No abstract
This paper presents the reshape package for R, which provides a common framework for many types of data reshaping and aggregation. It uses a paradigm of 'melting' and 'casting', where the data are 'melted' into a form which distinguishes measured and identifying variables, and then 'cast' into a new shape, whether it be a data frame, list, or high dimensional array. The paper includes an introduction to the conceptual framework, practical advice for melting and casting, and a case study.
Understanding evolutionary relationships among crops, their wild progenitors, and close relatives provides the requisite framework for conserving and using crop genetic diversity (Fielder et al., 2015; Dempewolf et al., 2017; Migicovsky and Myles, 2017). While the evolutionary histories of many annual crop species have been reconstructed, well-resolved phylogenies remain elusive for many crop genera, in particular those that include woody perennials (Barakat et al., 2012). Long-lived plants such as woody vines and trees have several basic biological attributes that complicate phylogenetic reconstruction: they are often obligate outcrossers that are highly heterozygous, undergo extensive interspecific hybridization, exhibit little among-population variation, and commonly share haplotypes among species (Petit and Hampe, 2006). Traditional approaches to molecular phylogenetics, including the sequencing of chloroplast and nuclear genes, have contributed to the resolution of relationships in some groups (Soltis et al., 1999; Rokas et al., 2003). The advent of high-throughput sequencing and analysis has greatly enhanced our capacity to analyze hundreds of thousands of sites from across the genome and offers great potential to advance resolution of relationships in groups that have posed challenges to traditional phylogenetic approaches (e.g., Cavender-Bares et al., 2015; Hipp et al., 2014, Uribe-Convers et al., 2016). Approximately 75% of woody perennial crops are clonally propagated, including most fruit and nut trees (
Many data analysis problems involve the application of a split-apply-combine strategy, where you break up a big problem into manageable pieces, operate on each piece independently and then put all the pieces back together. This insight gives rise to a new R package that allows you to smoothly apply this strategy, without having to worry about the type of structure in which your data is stored.The paper includes two case studies showing how these insights make it easier to work with batting records for veteran baseball players and a large 3d array of spatio-temporal ozone measurements.
This article discusses ggplot2, an open source R package, based on a grammatical theory of graphics. The underlying theory has been discussed in depth elsewhere so this article illustrates some of the consequences of the theory for creating new graphics, the importance of programmable graphics, and the rich ecosystem that has grown up around ggplot2.
In spatial statistics the ability to visualize data and models superimposed with their basic social landmarks and geographic context is invaluable. ggmap is a new tool which enables such visualization by combining the spatial information of static maps from Google Maps, OpenStreetMap, Stamen Maps or CloudMade Maps with the layered grammar of graphics implementation of ggplot2. In addition, several new utility functions are introduced which allow the user to access the Google Geocoding, Distance Matrix, and Directions APIs. The result is an easy, consistent and modular framework for spatial graphics with several convenient tools for spatial data analysis.
the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.