Hadley Wickham scite author profile

This paper presents the reshape package for R, which provides a common framework for many types of data reshaping and aggregation. It uses a paradigm of 'melting' and 'casting', where the data are 'melted' into a form which distinguishes measured and identifying variables, and then 'cast' into a new shape, whether it be a data frame, list, or high dimensional array. The paper includes an introduction to the conceptual framework, practical advice for melting and casting, and a case study.

show abstract

ggplot2

Wickham

2009

12,303

1,343

View full text Add to dashboard Cite

Understanding evolutionary relationships among crops, their wild progenitors, and close relatives provides the requisite framework for conserving and using crop genetic diversity (Fielder et al., 2015; Dempewolf et al., 2017; Migicovsky and Myles, 2017). While the evolutionary histories of many annual crop species have been reconstructed, well-resolved phylogenies remain elusive for many crop genera, in particular those that include woody perennials (Barakat et al., 2012). Long-lived plants such as woody vines and trees have several basic biological attributes that complicate phylogenetic reconstruction: they are often obligate outcrossers that are highly heterozygous, undergo extensive interspecific hybridization, exhibit little among-population variation, and commonly share haplotypes among species (Petit and Hampe, 2006). Traditional approaches to molecular phylogenetics, including the sequencing of chloroplast and nuclear genes, have contributed to the resolution of relationships in some groups (Soltis et al., 1999; Rokas et al., 2003). The advent of high-throughput sequencing and analysis has greatly enhanced our capacity to analyze hundreds of thousands of sites from across the genome and offers great potential to advance resolution of relationships in groups that have posed challenges to traditional phylogenetic approaches (e.g., Cavender-Bares et al., 2015; Hipp et al., 2014, Uribe-Convers et al., 2016). Approximately 75% of woody perennial crops are clonally propagated, including most fruit and nut trees (

show abstract

The Split-Apply-Combine Strategy for Data Analysis

Wickham¹

2011

J. Stat. Soft.

2,105

1,311

View full text Add to dashboard Cite

Many data analysis problems involve the application of a split-apply-combine strategy, where you break up a big problem into manageable pieces, operate on each piece independently and then put all the pieces back together. This insight gives rise to a new R package that allows you to smoothly apply this strategy, without having to worry about the type of structure in which your data is stored.The paper includes two case studies showing how these insights make it easier to work with batting records for veteran baseball players and a large 3d array of spatio-temporal ozone measurements.

show abstract

ggplot2

Wickham

2011

WIREs Computational Stats

2,325

1,234

View full text Add to dashboard Cite

show abstract

ggmap: Spatial Visualization with ggplot2

Kahle¹,

Wickham²

2013

The R Journal

1,636

1,015

View full text Add to dashboard Cite

In spatial statistics the ability to visualize data and models superimposed with their basic social landmarks and geographic context is invaluable. ggmap is a new tool which enables such visualization by combining the spatial information of static maps from Google Maps, OpenStreetMap, Stamen Maps or CloudMade Maps with the layered grammar of graphics implementation of ggplot2. In addition, several new utility functions are introduced which allow the user to access the Google Geocoding, Distance Matrix, and Directions APIs. The result is an easy, consistent and modular framework for spatial graphics with several convenient tools for spatial data analysis.

show abstract

ggplot2

Wickham¹

2016

20,804

957

View full text Add to dashboard Cite

the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hadley Wickham

Welcome to the Tidyverse

Data Analysis

Reshaping Data with thereshapePackage

ggplot2

The Split-Apply-Combine Strategy for Data Analysis

ggplot2

ggmap: Spatial Visualization with ggplot2

ggplot2

Contact Info

Product

Resources

About