This vignette assumes a basic understanding of
define_water
and the S4 water
class. See
vignette("intro", package = "tidywater")
for more
information.
To showcase tidywater’s acid-base equilibrium functions, let’s use a common water treatment problem. In this analysis, a hypothetical drinking water utility wants to know how much their pH will be impacted by varying doses of alum. They also want to ensure that their finished water has a pH of 8.
We can create a quick model by manually inputting the utility’s typical water quality. Then we’ll dose the water with their typical alum dose of 30 mg/L, and then a proposed 20mg/L dose. Finally, we’ll see how much caustic is required to raise the pH back to 8.
# Use define_water to prepare for tidywater analysis
no_alum_water <- define_water(ph = 8.3, temp = 18, alk = 150)
# Dose 30 mg/L of alum
alum_30 <- no_alum_water %>%
chemdose_ph(alum = 30) %>%
solvedose_ph(target_ph = 8, chemical = "naoh")
alum_30 # Caustic dose required to raise pH to 8 when 30 mg/L of alum is added
># [1] 10.3
# Dose 20 mg/L of alum
alum_20 <- no_alum_water %>%
chemdose_ph(alum = 20) %>%
solvedose_ph(target_ph = 8, chemical = "naoh")
alum_20 # Caustic dose required to raise pH to 8 when 20 mg/L of alum is added
># [1] 6.2
As expected, a lower alum dose requires a lower caustic dose to reach the target pH.
Note: How can you remember the difference between
solvedose_ph
vs chemdose_ph
? Any function
beginning with “solve” is named for what it is solving for based on one
input: SolveWhatItReturns_Input. So, solvedose_ph
is
solving for a dose based on a target pH.
Other treatment functions are set up as
WhatHappensToTheWater_WhatYouSolveFor. So with chemdose_ph
,
chemicals are being dosed, and we’re solving for the resulting pH (and
other components of acid/base chemistry). chemdose_toc
models the resulting TOC after chemicals are added, and
dissolve_pb
calculates lead solubility in the distribution
system.
_chain
functionsBut what if the utility wants to test a variety of alum doses on a
range of their water quality? Here, we’ll use the power of tidywater’s
_chain
functions to extend this analysis to a full
dataframe.
We’ll use tidywater’s built-in water quality data,
water_df
, then apply define_water_chain
to
convert the data to a water
object. We use
define_water_chain
so that other models can be added to the
dataframe. This function takes a dataframe input, then outputs all
parameters in a water
class column. This is true for all
tidywater functions with the _chain
suffix.
_chain
functions are handy in a piped code block where
you’ll need to use many tidywater functions, such as
chemdose_ph
, chemdose_toc
, etc. After applying
define_water_chain
, we’ll also use
balance_ions_chain
to create a new variable with the ions
balanced for all the “raw” water
objects in the
dataframe.
We’ll also set a range of alum doses to see how they affect each water quality scenario.
# Set a range of alum doses
alum_doses <- tibble(alum_dose = seq(20, 60, 10))
# use tidywater's built-in synthetic data water_df, for this example
raw_water <- water_df %>%
slice_head(n = 2) %>%
define_water_chain(output_water = "raw") %>%
balance_ions_chain(input_water = "raw") %>%
# join alum doses to create several dosing scenarios
cross_join(alum_doses)
chemdose_ph_chain
and pluck_water
Now that we’re set up, let’s dose some alum! To do this, we’ll use
chemdose_ph_chain
, a function with the _chain
suffix introduced earlier but whose tidywater base is
chemdose_ph
. The chemdose_ph_chain
function
requires dosed chemicals to match the argument’s notation. In this case,
our chemical is already properly named. Other chemicals, such as
caustic, ferric sulfate, soda ash and more would need to be named
naoh
, fe2so43
, and na2co3
,
respectively. Most tidywater chemicals are named with their chemical
formula, all lowercase and no special characters.
There are two ways to dose chemicals.
You can pass an appropriately named column into the function, or
You can specify the chemical in the function.
Let’s look at both options using the alum doses from before, and adding hydrochloric acid. You should notice that the ouputs of both methods are the same.
# 1. Use existing column in data frame to dose a chemical
dose_water <- raw_water %>%
mutate(hcl = 5) %>%
chemdose_ph_chain(input_water = "raw", alum = alum_dose) %>%
pluck_water(input_water = c("raw", "dosed_chem_water"), parameter = "ph") %>%
select(-c(raw, dosed_chem_water))
head(dose_water)
># balanced_water alum_dose hcl raw_ph
># 1 <S4 class 'water' [package "tidywater"] with 62 slots> 20 5 7.9
># 2 <S4 class 'water' [package "tidywater"] with 62 slots> 30 5 7.9
># 3 <S4 class 'water' [package "tidywater"] with 62 slots> 40 5 7.9
># 4 <S4 class 'water' [package "tidywater"] with 62 slots> 50 5 7.9
># 5 <S4 class 'water' [package "tidywater"] with 62 slots> 60 5 7.9
># 6 <S4 class 'water' [package "tidywater"] with 62 slots> 20 5 8.5
># dosed_chem_water_ph
># 1 6.60
># 2 6.42
># 3 6.25
># 4 6.07
># 5 5.87
># 6 6.94
# 2. Dose a chemical in the function
dose_water <- raw_water %>%
chemdose_ph_chain(input_water = "raw", alum = alum_dose, hcl = 5) %>%
pluck_water(input_water = c("raw", "dosed_chem_water"), parameter = "ph") %>%
select(-c(raw, dosed_chem_water))
head(dose_water)
># balanced_water alum_dose hcl raw_ph
># 1 <S4 class 'water' [package "tidywater"] with 62 slots> 20 5 7.9
># 2 <S4 class 'water' [package "tidywater"] with 62 slots> 30 5 7.9
># 3 <S4 class 'water' [package "tidywater"] with 62 slots> 40 5 7.9
># 4 <S4 class 'water' [package "tidywater"] with 62 slots> 50 5 7.9
># 5 <S4 class 'water' [package "tidywater"] with 62 slots> 60 5 7.9
># 6 <S4 class 'water' [package "tidywater"] with 62 slots> 20 5 8.5
># dosed_chem_water_ph
># 1 6.60
># 2 6.42
># 3 6.25
># 4 6.07
># 5 5.87
># 6 6.94
Notice in the above code that we used the pluck_water
helper function. This function uses purrr::pluck
to create
a new column for one selected parameter from a water
class
object. You can choose which water
column to pluck from
using the input_water
argument. Next, select the parameter
of interest (which must match the water slot’s name). Finally, the
output column’s name will default to the form
water_parameter
, but there is an option to name it yourself
using the output_column
argument.
solvedose_ph_once
Remember, our original task is to see how alum addition affects the
pH, but the finished water pH needs to be 8. First, we’ll use caustic to
raise the pH to 8. solvedose_ph_once
uses
solvedose_ph
to calculate the required chemical dose (as
chemical, not product) based on a target pH. Similar to
chemdose_ph_chain
, solvedose_ph_once
can
handle chemical selection and target pH inputs as a column or function
arguments. Helpers with the _once
suffix are for tidywater
functions that output numbers instead of waters, including the base
function solvedose_ph
, and will output numeric doses, not
water
objects. Thus, solvedose_ph_chain
doesn’t exist because the water
isn’t changing, so chaining
this function to a downstream tidywater function can be done using
normal tidywater operations.
solve_ph <- raw_water %>%
chemdose_ph_chain("raw", alum = alum_dose) %>%
mutate(target_ph = 8) %>%
solvedose_ph_once(input_water = "dosed_chem_water", chemical = c("naoh", "mgoh2")) %>%
select(-c(raw, dosed_chem_water))
head(solve_ph)
># balanced_water alum_dose target_ph
># 1 <S4 class 'water' [package "tidywater"] with 62 slots> 20 8
># 2 <S4 class 'water' [package "tidywater"] with 62 slots> 20 8
># 3 <S4 class 'water' [package "tidywater"] with 62 slots> 30 8
># 4 <S4 class 'water' [package "tidywater"] with 62 slots> 30 8
># 5 <S4 class 'water' [package "tidywater"] with 62 slots> 40 8
># 6 <S4 class 'water' [package "tidywater"] with 62 slots> 40 8
># chemical dose_required
># 1 naoh 8.3
># 2 mgoh2 6.1
># 3 naoh 12.3
># 4 mgoh2 9.0
># 5 naoh 16.5
># 6 mgoh2 12.0
Now that we have the dose required to raise the pH to 8, let’s dose caustic into the water!
dosed_caustic_water <- raw_water %>%
chemdose_ph_chain(input_water = "raw", output_water = "alum_dosed", alum = alum_dose) %>%
solvedose_ph_once(input_water = "alum_dosed", target_ph = 8, chemical = "naoh") %>%
chemdose_ph_chain(input_water = "alum_dosed", output_water = "caustic_dosed", naoh = dose_required) %>%
pluck_water(input_water = "caustic_dosed", "ph") %>%
select(-c(raw:balanced_water, alum_dosed))
head(dosed_caustic_water)
># alum_dose target_ph chemical dose_required
># 1 20 8 naoh 8.3
># 2 30 8 naoh 12.3
># 3 40 8 naoh 16.5
># 4 50 8 naoh 20.5
># 5 60 8 naoh 24.4
># 6 20 8 naoh 6.2
># caustic_dosed caustic_dosed_ph
># 1 <S4 class 'water' [package "tidywater"] with 62 slots> 7.99
># 2 <S4 class 'water' [package "tidywater"] with 62 slots> 7.98
># 3 <S4 class 'water' [package "tidywater"] with 62 slots> 8.00
># 4 <S4 class 'water' [package "tidywater"] with 62 slots> 8.02
># 5 <S4 class 'water' [package "tidywater"] with 62 slots> 8.01
># 6 <S4 class 'water' [package "tidywater"] with 62 slots> 7.99
You can see the resulting pH from dosing caustic has raised the pH to 8 +/- 0.02 SU. Doses are rounded to the nearest 0.1 mg/L to make the calculations go a little faster.
As you use more tidywater helper functions with larger data sets,
you’ll notice the code can take a few minutes to run. All helper
functions use functions from the
furrr package. To reduce
processing time, you can activate furrr
’s parallel
processing power by using plan()
at the beginning of your
script. plan()
depends on what type of operating system you
have, more info on that in the
Controlling How Futures are
Resolved table.
# For most operating systems, especially Windows, use this at the beginning of your script
# We recommend removing the `workers` argument to use your computer's full power.
plan(multisession, workers = 2)
# rest of script
# At the end of the script, here's an option to explicitly close the multisession processing
plan(sequential)
In this tutorial, we were introduced to tidywater helper functions
_chain
and _once
, which can be used to apply
base functions to a dataframe. Outputs of _chain
functions
are water
objects, meanwhile outputs of _once
functions are numerical. We also used the pluck_water
helper function to extract parameters of interest from our
dataframes.
We implemented these helper functions to complete an example dosing
water with coagulant (alum) and adjusting the resulting pH to a target
pH of 8 using solvedose_ph
and chemdose_ph
functions. To try another example with helper functions and learn about
the blend_waters
function, see
vignette("blend_waters", package = "tidywater")
.