Type: Package Package: sparklyr Title: R Interface to Apache Spark Version: 1.9.5.9000 Authors@R: c(person(given = "Javier", family = "Luraschi", role = "aut", email = "jluraschi@gmail.com"), person(given = "Kevin", family = "Kuo", role = "aut", email = "kevin.kuo@rstudio.com", comment = c(ORCID = "0000-0001-7803-7901")), person(given = "Kevin", family = "Ushey", role = "aut", email = "kevin@rstudio.com"), person(given = "JJ", family = "Allaire", role = "aut", email = "jj@rstudio.com"), person(given = "Samuel", family = "Macedo", role = "ctb", email = "samuelmacedo@recife.ifpe.edu.br"), person(given = "Hossein", family = "Falaki", role = "aut", email = "hossein@databricks.com"), person(given = "Lu", family = "Wang", role = "aut", email = "lu.wang@databricks.com"), person(given = "Andy", family = "Zhang", role = "aut", email = "yue.zhang@databricks.com"), person(given = "Yitao", family = "Li", role = "aut", email = "yitaoli1990@gmail.com", comment = c(ORCID = "0000-0002-1261-905X")), person(given = "Jozef", family = "Hajnala", role = "ctb", email = "jozef.hajnala@gmail.com"), person(given = "Maciej", family = "Szymkiewicz", role = "ctb", email = "mszymkiewicz@gmail.com", comment = c(ORCID = "0000-0003-1469-9396")), person(given = "Wil", family = "Davis", role = "ctb", email = "william.davis@worthingtonindustries.com"), person(given = "Edgar", family = "Ruiz", role = c("aut", "cre"), email = "edgar@rstudio.com"), person(family = "RStudio", role = "cph"), person(family = "The Apache Software Foundation", role = c("aut", "cph"))) Maintainer: Edgar Ruiz Description: R interface to Apache Spark, a fast and general engine for big data processing, see . This package supports connecting to local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end, and provides an interface to Spark's built-in machine learning algorithms. License: Apache License 2.0 | file LICENSE URL: https://spark.posit.co/ BugReports: https://github.com/sparklyr/sparklyr/issues Depends: R (>= 3.2) Imports: config (>= 0.2), DBI (>= 1.0.0), dbplyr (>= 2.5.0), dplyr (>= 1.0.9), generics, globals, glue, httr (>= 1.2.1), jsonlite (>= 1.4), methods, openssl (>= 0.8), purrr, rlang (>= 0.1.4), rstudioapi (>= 0.10), tidyr (>= 1.2.0), tidyselect, uuid, vctrs, withr, xml2 Suggests: arrow (>= 0.17.0), broom, diffobj, foreach, ggplot2, iterators, janeaustenr, Lahman, mlbench, nnet, nycflights13, R6, r2d3, RCurl, reshape2, shiny (>= 1.0.1), parsnip, testthat, rprojroot Encoding: UTF-8 SystemRequirements: Spark: 2.x, or 3.x, or 4.x Collate: 'core_invoke.R' 'spark_ide.R' 'connection_spark.R' 'spark_data.R' 'arrow.R' 'config.R' 'spark_context.R' 'connection.R' 'connection_databricks.R' 'connection_kubernetes.R' 'connection_shell.R' 'connection_livy.R' 'connection_livy_utils.R' 'connection_progress.R' 'connection_qubole.R' 'connection_test.R' 'connection_yarn.R' 'core_gateway.R' 'core_jobj.R' 'core_serialize.R' 'core_utils.R' 'utils.R' 'tbl_spark.R' 'spark_sql.R' 'dplyr_spark.R' 'sdf_interface.R' 'stratified_sample.R' 'sdf_io.R' 'dplyr_sql.R' 'data_copy.R' 'spark_apply.R' 'data_read.R' 'data_write.R' 'dbi.R' 'dplyr_hof.R' 'dplyr_sql_translation.R' 'dplyr_verbs.R' 'imports.R' 'install_spark.R' 'install_spark_versions.R' 'install_spark_windows.R' 'install_tools.R' 'jobs_api.R' 'ml_aft_survival_regression.R' 'ml_als.R' 'ml_model_constructors.R' 'ml_kmeans.R' 'ml_bisecting_kmeans.R' 'ml_decision_tree.R' 'ml_evaluation.R' 'ml_feature_encoders.R' 'ml_feature_imputer.R' 'ml_feature_lsh.R' 'ml_feature_math.R' 'ml_feature_normalizers.R' 'ml_feature_scalers.R' 'ml_feature_selectors.R' 'ml_feature_sql.R' 'ml_feature_string_indexer.R' 'ml_feature_text_vectorizers.R' 'ml_feature_tokenizers.R' 'ml_feature_vectors.R' 'ml_fpm.R' 'ml_gaussian_mixture.R' 'ml_gbt.R' 'ml_generalized_linear_regression.R' 'ml_isotonic_regression.R' 'ml_lda.R' 'ml_linear_regression.R' 'ml_linear_svc.R' 'ml_logistic_regression.R' 'ml_mapping_tables.R' 'ml_metrics.R' 'ml_multilayer_perceptron.R' 'ml_naive_bayes.R' 'ml_one_vs_rest.R' 'ml_param_utils.R' 'ml_persistence.R' 'ml_pipeline.R' 'ml_power_iteration.R' 'ml_print_utils.R' 'ml_random_forest.R' 'ml_stat.R' 'ml_transformer.R' 'ml_tuning.R' 'ml_utils.R' 'precondition.R' 'project_template.R' 'sdf_stat.R' 'tidyr_utils.R' 'sdf_wrapper.R' 'sdf_unnest.R' 'sdf_utils.R' 'spark_compat.R' 'spark_compile.R' 'spark_submit.R' 'spark_utils.R' 'stream.R' 'stream_data.R' 'stream_ui.R' 'tidiers_ml_classification.R' 'tidiers_ml_other.R' 'tidiers_ml_regression.R' 'tidiers_utils.R' 'tidyr_nest.R' 'tidyr_pivot_utils.R' 'tidyr_pivot_longer.R' 'tidyr_pivot_wider.R' 'tidyr_reshape.R' 'tune-grid-spark.R' 'utils_cast.R' 'worker.R' 'worker_apply.R' 'zzz.R' Config/roxygen2/version: 8.0.0 Config/pak/sysreqs: libicu-dev libxml2-dev libssl-dev Repository: https://sparklyr.r-universe.dev Date/Publication: 2026-06-26 14:53:25 UTC RemoteUrl: https://github.com/sparklyr/sparklyr RemoteRef: HEAD RemoteSha: 223f4b1c859d078cf21737e5fff5b4640a87c519 NeedsCompilation: no Packaged: 2026-06-26 16:36:10 UTC; root Author: Javier Luraschi [aut], Kevin Kuo [aut] (ORCID: ), Kevin Ushey [aut], JJ Allaire [aut], Samuel Macedo [ctb], Hossein Falaki [aut], Lu Wang [aut], Andy Zhang [aut], Yitao Li [aut] (ORCID: ), Jozef Hajnala [ctb], Maciej Szymkiewicz [ctb] (ORCID: ), Wil Davis [ctb], Edgar Ruiz [aut, cre], RStudio [cph], The Apache Software Foundation [aut, cph]