riva quiroga · mauricio vargas riva quiroga. polls! ... 2 326 0.21 premium e si1 59.8 61 3.89 3.84...

@rivaquirogaRiva Quiroga

Translating datasets using {datalang}: the development of {datos} package for the R4DS Spanish translation

#useR2019: the French keyboard experience

> when technologyis not built around yourown language

language gap

500 dollarsenglish course

450 dollarsminimum wage

language gap

happy R users

sticker-drivenproject

the translators

translators + reviewers

≠ varieties of Spanish

≠ backgrounds

≠ R experience

≠ github experience

≠ github experience��

2017 2018

translation: text

2017 2018

translation: text + datasets

2017 2018

infrastructure

github.com/cienciadedatos

Edgar Ruiz

Edgar Ruiz Mauricio Vargas

Edgar Ruiz Riva QuirogaMauricio Vargas

polls!

mtcars🚗

mtcars🚗 -> mtautos

2017 2018

programminghistorian.org / @ProgHist

2017 2018

> head(diamonds)

# A tibble: 6 x 10

price carat cut color clarity depth table x y z

1 326 0.23 Ideal E SI2 61.5 55 3.95 3.98 2.43

2 326 0.21 Premium E SI1 59.8 61 3.89 3.84 2.31

3 327 0.23 Good E VS1 56.9 65 4.05 4.07 2.31

4 334 0.29 Premium I VS2 62.4 58 4.2 4.23 2.63

5 335 0.31 Good J SI2 63.3 58 4.34 4.35 2.75

6 336 0.24 Very good J VVS2 62.8 57 3.94 3.96 2.48

> head(diamantes)

# A tibble: 6 x 10

precio quilate corte color claridad profundidad tabla x y z

1 326 0.23 Ideal E SI2 61.5 55 3.95 3.98 2.43

2 326 0.21 Premium E SI1 59.8 61 3.89 3.84 2.31

3 327 0.23 Bueno E VS1 56.9 65 4.05 4.07 2.31

4 334 0.29 Premium I VS2 62.4 58 4.2 4.23 2.63

5 335 0.31 Bueno J SI2 63.3 58 4.34 4.35 2.75

6 336 0.24 Muy bueno J VVS2 62.8 57 3.94 3.96 2.48

> diamantes <- rename(diamonds, quilate = carat, corte = cut, …

one possible approach

> diamantes <- rename(diamonds, quilate = carat, corte = cut, … %>%

mutate(corte = case_when(cut == “Good” ~ “Bueno” …

{datalang}

another approach

github.com/edgararuiz/datalang

> library(datalang)> my_spec <- system.file("specs/diamonds.yml", package = "datos")> translate_data(my_spec)

1 326 0.23 Ideal E SI2 61.5 55 3.95 3.98 2.43

2 326 0.21 Premium E SI1 59.8 61 3.89 3.84 2.31

3 327 0.23 Bueno E VS1 56.9 65 4.05 4.07 2.31

4 334 0.29 Premium I VS2 62.4 58 4.2 4.23 2.63

5 335 0.31 Bueno J SI2 63.3 58 4.34 4.35 2.75

6 336 0.24 Muy bueno J VVS2 62.8 57 3.94 3.96 2.48

source: ggplot2::diamonds

name: diamantes

variables:

price:

trans: precio

desc: Precio en dólares americanos

values:

Good: Bueno

Very Good: Muy bueno

Fair: Regular

> head(diamantes)

# A tibble: 6 x 10

1 326 0.23 Ideal E SI2 61.5 55 3.95 3.98 2.43

2 326 0.21 Premium E SI1 59.8 61 3.89 3.84 2.31

3 327 0.23 Bueno E VS1 56.9 65 4.05 4.07 2.31

4 334 0.29 Premium I VS2 62.4 58 4.2 4.23 2.63

5 335 0.31 Bueno J SI2 63.3 58 4.34 4.35 2.75

6 336 0.24 Muy bueno J VVS2 62.8 57 3.94 3.96 2.48

> ?diamantes

{datos}

translating book + datasets

{datos}

it’s easy to contribute!

2017 2018 2019

conference-driven package

{datos}

too bulky!

{datos}

create_promise <- function(path, package = "datos") {

spec_loc <- system.file(path, package = package)

spec <- read_yaml(spec_loc)

new_name <- spec$df$name

origin <- spec$df$source

envir <- as.environment(paste0("package:", package))

rlang::env_bind_lazy(

envir,

!! new_name := translate_data(spec_loc, eval(parse_expr(origin))

{datos}

> datos::diamantesError: diamantes is not an exported object from 'namespace:datos'

{datos}

delayedAssign('diamantes',

eval(parse(file.path(system.file('scripts','diamantes.txt', package = 'datos')))))

cienciadedatos.github.io/datos

Edgar Ruiz

Edgar Ruiz Riva Quiroga Mauricio Vargas

Mauro Lepore

Edgar Ruiz Riva Quiroga Mauricio Vargas

Daniela VázquezMauro Lepore Rayna Harris

datasets

translations

reviews

datasets

translations

reviews

datasets

translations

reviews

🎉🎉

datasets

translations

reviews

🎉🎉

datasets

translations

reviews

🎉🎉

but are we near the end of this project?

2018 ∞

what we have created is not just a book translation

and a package

human and technical infrastructure

our biggest contribution developing {datos}

> brand new errors

2018 ∞

Riva Quiroga / @rivaquiroga

riva quiroga · mauricio vargas riva quiroga. polls! ... 2 326 0.21 premium e si1 59.8 61 3.89 3.84...

Documents

presentation riva

stimulate 2005if-personalization / luz. m. quiroga...

byron quiroga

overview – version 2.31

astrologia real - oscar quiroga

disonancias en arqueología histórica (quiroga)

alejandro quiroga - making spaniards

porfolio german quiroga

doctoring . horacio quiroga

operating guide - riva real easy – riva real easy

riva woodburning & multi-fuel - wordpress.com · riva...

latinoamericana - quiroga

steward: diego quiroga

riva hawthorn

quiroga f. power point

riva compact

riva ansbacher

riva bella

2.31 geo 530

riva riva 75 venere, 2006, 1.990.000 € for sale