sfba.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A Mastodon instance for the San Francisco Bay Area. Come on in and join us!

Server stats:

2.4K
active users

#tidyverse

9 posts9 participants0 posts today

ggplot2 is the gold standard when it comes to data visualization.

The image in this post showcases examples of ggplot2 visualizations, demonstrating its versatility to create a wide range of plots with nearly limitless customization options.

Check out my online course, "Data Visualization in R Using ggplot2 & Friends," for a deeper dive into creating stunning plots with ggplot2.

More info: statisticsglobe.com/online-cou

gganimate is a powerful extension for ggplot2 that transforms static visualizations into dynamic animations. By adding a time dimension, it allows you to illustrate trends, changes, and patterns in your data more effectively.

The attached animated visualization, which I created with gganimate, showcases a ranked bar chart of the top 3 countries for each year based on inflation since 1980.

More information: statisticsglobe.com/online-cou

Understanding probability distributions is key to making informed decisions in statistics and data science. Probability distributions describe how the values of a variable are expected to behave, making them crucial for interpreting data and predicting outcomes.

The visualization shown in this post illustrates the distributions.

Further details: statisticsglobe.com/online-cou

Visualizing gene structures in R? gggenes, an extension of ggplot2, simplifies the process of creating clear and informative gene diagrams, making genomic data easier to interpret and share.

Visualization: cran.r-project.org/web/package

Click this link for detailed information: statisticsglobe.com/online-cou

Local regression is a non-parametric method for fitting smooth curves to data by applying multiple localized regressions. It is useful for uncovering non-linear relationships when the data’s exact form is unknown. Proper use of local regression can reveal trends in noisy data, but poor implementation might lead to misleading results.

Image: en.wikipedia.org/wiki/Local_re

More details: eepurl.com/gH6myT

There are now so many "backends" for {dplyr}—duckdb with {duckplyr}, polars with {tidypolars}, various database engines with {dbplyr}, {data.table} with {dtplyr}. Is there a blog post or flow chart somewhere with pros and cons of each? Like, comparisons of memory requirements, speed, and how likely they are to "just work"?