Julia for Data Science¶

# for reproducibility
versioninfo()

Julia Version 1.7.3
Commit 742b9abb4d (2022-05-06 12:58 UTC)
Platform Info:
  OS: macOS (x86_64-apple-darwin21.4.0)
  CPU: Intel(R) Core(TM) i9-9980HK CPU @ 2.40GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-12.0.1 (ORCJIT, skylake)
Environment:
  JULIA_NUM_THREADS = 8
  JULIA_EDITOR = code

From previous two tutorials, we practiced a few essential data wrangling steps in R and Python.

Pipes
Data ingestion
Data filtering (rows) and selection (columns)
Data sorting and ranking
Data merging (joins)
Mutate (dplyr) or transform (Julia)
Pivot (dplyr) or reshape (Julia)
Group by
Data summaries
Visualization

The Julia package DataFrames.jl is the analog of dplyr and data.table in R and panda in Python.

Optional reading: Comparison of DataFrames.jl with Python/R/Stata.

using AlgebraOfGraphics, CairoMakie, CSV, DataFrames, Dates, Pipe

# path to MIMIC data
mimic_path = Sys.islinux() ? "/home/shared/1.0" : "/Users/huazhou/Desktop/mimic-iv-1.0"

"/Users/huazhou/Desktop/mimic-iv-1.0"

# for printing all columns of DataFrame
ENV["COLUMNS"] = 1000

1000

Data ingestion¶

Plain text files can be parsed by the CSV.jl package.

icustays_tbl¶

We use the dateformat argument to correctly parse charttime as DateTime.

icustays_tbl = CSV.File(
    mimic_path * "/icu/icustays.csv.gz",
    dateformat = "yyyy-mm-dd HH:MM:SS"
    ) |> DataFrame

admissions_tbl¶

admissions_tbl = CSV.File(
    mimic_path * "/core/admissions.csv.gz",
    dateformat = "yyyy-mm-dd HH:MM:SS"
    ) |> DataFrame

patients_tbl¶

patients_tbl = CSV.File(mimic_path * "/core/patients.csv.gz") |> DataFrame

chartevents_tbl¶

We use the dataformat argument to correctly parse charttime as DateTime.

@time chartevents_tbl = CSV.File(
    mimic_path * "/icu/chartevents_filtered_itemid.csv.gz", 
    dateformat = "yyyy-mm-dd HH:MM:SS"
    ) |> 
    DataFrame

  1.804059 seconds (32.94 k allocations: 777.766 MiB, 2.18% gc time, 3.87% compilation time)

Let's visualize the heart rate readings for a specific stay.

#filter(row -> row.stay_id == 30600691 && row.itemid == 220045, chartevents_tbl) |> 
chartevents_subset = @pipe chartevents_tbl |> 
    filter(row -> row.stay_id == 30600691 && row.itemid == 220045, _) |> 
    select(_, [:charttime, :valuenum]) |>
    DataFrame

# be patient: time-to-first-plot is long!
x = chartevents_subset[!, :charttime]
y = chartevents_subset[!, :valuenum]
df = (; x, y)
plt = data(df) *
    mapping(:x, [:y] .=> "heart rate") *
    visual(Scatter)
draw(plt)

Target cohort (from R session)¶

Let's continue on with the task we did with R. We aim to develop a predictive model, which computes the chance of dying within 30 days of ICU stay intime based on baseline features

first_careunit
age at intime
gender
ethnicity
first measurement of the following vitals since ICU stay intime
- 220045 for heart rate
- 223761 for Temperature Fahrenheit

We restrict to the first ICU stays of each unique patient.

Wrangling and merging data frames¶

Our stragegy is

Identify and keep the first ICU stay of each patient.
Identify and keep the first vital measurements during the first ICU stay of each patient.
Join four data frames into a single data frame.

Important data wrangling concepts: group_by, sort, slice, joins, and pivot.

Step 1: restrict to the first ICU stay of each patient¶

icustays_df has 76,540 rows, which is reduced to 53,150 unique ICU stays.

icustays_tbl_1ststay = @pipe icustays_tbl |>
    sort(_, [:subject_id, :intime]) |>
    unique(_, :subject_id)

Step 2: restrict to the first vital measurements during the ICU stay¶

Key data wrangling concepts: select, left_join, right_join, group_by, arrange, pivot.

@time chartevents_tbl_1ststay = @pipe chartevents_tbl |>
    # pull in the intime/outtime of each ICU stay
    rightjoin(_, select(icustays_tbl_1ststay, :stay_id, :intime, :outtime), on = :stay_id) |> 
    # only keep items during this ICU intime
    filter(row -> ismissing(row.charttime) ? false : (row.charttime ≥ row.intime && row.charttime ≤ row.outtime), _) |>
    # only keep the first charttime for each stay_id x item
    sort(_, [:stay_id, :itemid, :charttime]) |>
    unique(_, [:stay_id, :itemid]) |>
    # do not need charttime, intime and outtime anymore
    select(_, Not([:charttime, :intime, :outtime])) |>
    # pivot_wider (R) or reshape (Julia)
    unstack(_, [:subject_id, :hadm_id, :stay_id], :itemid, :valuenum) |>
    # more informative column names
    rename(_, Dict(
        "220045" => "heart_rate", 
        "223761" => "temp_f",
        ))

  8.693733 seconds (96.13 M allocations: 4.301 GiB, 12.29% gc time, 58.83% compilation time)

Step 3: merge data frames¶

New data wrangling concept: mutate.

@time mimic_icu_cohort = @pipe icustays_tbl_1ststay |>
    # merge data frames
    leftjoin(_, admissions_tbl, on = [:subject_id, :hadm_id]) |>
    leftjoin(_, patients_tbl, on = [:subject_id]) |>
    leftjoin(_, chartevents_tbl_1ststay, on = [:stay_id, :subject_id, :hadm_id]) |>
    # age_intime is the age at ICU stay intime
    insertcols!(_, :age_intime => _.anchor_age .+ year.(_.intime) .- _.anchor_year) |>
    # whether the patient died within 30 days of ICU stay intime
    insertcols!(_, :hadm_to_death => _.deathtime .- _.intime) |>
    insertcols!(_, :thirty_day_mort => _.hadm_to_death .≤ Millisecond(2592000000))
# missing in thirty_day_mort means patient not die
replace!(mimic_icu_cohort.thirty_day_mort, missing => false)
mimic_icu_cohort

  6.223667 seconds (9.67 M allocations: 1.919 GiB, 3.31% gc time, 67.03% compilation time)

Data visualization¶

It is always a good idea to visualize data as much as possible before any statistical analysis.

Remember we want to model:

thirty_day_mort ~ first_careunit + age_intime + gender + ethnicity + heart_rate + temp_f

Let's start with a numerical summary of variables of interest.

@pipe mimic_icu_cohort |>
    select(_, [
        :first_careunit, 
        :gender, 
        :ethnicity, 
        :age_intime, 
        :heart_rate, 
        :temp_f, 
        :thirty_day_mort
        ]) |> 
    describe(_)

Univariate summaries¶

Bar plot of first_careunit.

@pipe mimic_icu_cohort |> 
    groupby(_, :first_careunit) |> 
    combine(_, nrow) |>
    barplot(
        _.first_careunit.refs, 
        _.nrow,
        axis = (xticks = (1:size(_, 1), _.first_careunit), title = "First Care Unit", xticklabelrotation = 45.0)
)

Bivariate summaries¶

Tally of thirty_day_mort vs first_careunit.

@pipe mimic_icu_cohort |> 
    groupby(_, [:first_careunit, :thirty_day_mort]) |> 
    combine(_, nrow) |>
    disallowmissing(_, :thirty_day_mort) |>
    barplot(
        _.first_careunit.refs, 
        _.nrow, 
        stack = _.thirty_day_mort,
        color = _.thirty_day_mort,
        axis = (xticks = (1:size(_, 1), _.first_careunit), title = "First Care Unit", xticklabelrotation = 45.0)
    )

Pros and Cons of Julia¶

Pros

Julia solves the notorious two language problem in scientific computing. Julia combines the functionality and ease of use of Python, R, Matlab, SAS and Stata with the speed of C/C++ and Java. News: Julia Joins Petaflop Club.
As a new language, Julia integrates well with modern hardware (GPUs, parallel and distributed computing).
Excel domains such as differential equations, auto-differentiation, and optimization.
Interoperability with other languages (Python, R, Matlab, C, C++, Fortran).

Cons

Smaller ecosystem? Not anymore. On the contrary, some ecosystems (e.g., plotting, auto-diff, DL) are too rich/confusing for user to choose.
Smaller user base, compared to Python and R.
Lack of IDEs as feature-rich as RStudio.
Compilation time of some packages (Plots.jl, LoopVectorization.jl, etc) can be long. Time-to-first-plot issue

	subject_id	hadm_id	stay_id	first_careunit	last_careunit	intime	outtime	los
	Int64	Int64	Int64	String	String	DateTime	DateTime	Float64
1	17867402	24528534	31793211	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2154-03-03T04:11:00	2154-03-04T18:16:56	1.58745
2	14435996	28960964	31983544	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2150-06-19T17:57:00	2150-06-22T18:33:54	3.02562
3	17609946	27385897	33183475	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2138-02-05T18:54:00	2138-02-15T12:42:05	9.74172
4	18966770	23483021	34131444	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2123-10-25T10:35:00	2123-10-25T18:59:47	0.350544
5	12776735	20817525	34547665	Neuro Stepdown	Neuro Stepdown	2200-07-12T00:33:00	2200-07-13T16:44:40	1.67477
6	10215159	24283593	34569476	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2124-09-20T15:05:29	2124-09-21T22:06:58	1.2927
7	14489052	26516390	35056286	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2118-10-26T10:33:56	2118-10-26T20:28:10	0.412662
8	15914763	28906020	36909804	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2176-12-14T12:00:00	2176-12-17T11:47:01	2.99098
9	16256226	20013290	39289362	Neuro Stepdown	Neuro Stepdown	2150-12-20T16:09:08	2150-12-21T14:58:40	0.951065
10	19194449	21641999	39387567	Coronary Care Unit (CCU)	Coronary Care Unit (CCU)	2123-11-12T02:53:35	2123-11-12T13:52:03	0.457269
11	15537237	27472769	39467232	Neuro Intermediate	Neuro Intermediate	2156-02-28T17:38:00	2156-02-29T16:57:08	0.97162
12	15332976	29762192	39883649	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2165-08-16T15:20:48	2165-08-17T17:09:47	1.07568
13	16841280	26340268	32550034	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2149-01-14T09:24:00	2149-01-15T23:56:41	1.60603
14	12974563	29618057	32563675	Neuro Stepdown	Neuro Stepdown	2138-11-13T23:30:01	2138-11-15T16:25:19	1.70507
15	18599212	28538226	33267162	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2129-06-01T16:27:39	2129-06-06T17:01:33	5.02354
16	14609218	20606189	34947848	Neuro Stepdown	Neuro Stepdown	2174-06-28T21:13:00	2174-07-05T17:01:32	6.82537
17	10390732	22177535	35370343	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2147-06-20T19:40:57	2147-06-22T11:47:38	1.67131
18	17675016	29235706	36961856	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2173-04-23T17:59:30	2173-05-10T16:06:47	16.9217
19	12687112	26132667	37445058	Neuro Stepdown	Neuro Stepdown	2162-05-31T18:08:45	2162-06-04T10:16:13	3.67185
20	18423151	27753193	30073725	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2169-07-14T19:04:06	2169-07-15T13:21:12	0.761875
21	17216313	22563195	30201049	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2180-03-03T16:20:10	2180-03-04T18:37:42	1.09551
22	17530304	21776160	30655167	Trauma SICU (TSICU)	Trauma SICU (TSICU)	2129-03-21T07:09:00	2129-03-22T15:26:48	1.34569
23	12207593	22795209	30000646	Coronary Care Unit (CCU)	Coronary Care Unit (CCU)	2194-04-29T01:39:22	2194-05-03T18:23:48	4.69752
24	10656173	25778760	30001555	Medical Intensive Care Unit (MICU)	Medical Intensive Care Unit (MICU)	2177-09-27T11:23:13	2177-09-28T18:26:00	1.2936
25	14311522	24622512	30002548	Cardiac Vascular Intensive Care Unit (CVICU)	Cardiac Vascular Intensive Care Unit (CVICU)	2111-08-17T13:13:43	2111-08-18T18:50:31	1.23389
26	10208468	25796414	30002925	Medical Intensive Care Unit (MICU)	Medical Intensive Care Unit (MICU)	2134-06-05T03:37:00	2134-06-05T22:45:15	0.797396
27	10682002	20035892	30003087	Medical Intensive Care Unit (MICU)	Medical Intensive Care Unit (MICU)	2132-12-01T20:58:25	2132-12-07T21:18:19	6.01382
28	16235911	28956560	30003306	Surgical Intensive Care Unit (SICU)	Surgical Intensive Care Unit (SICU)	2188-06-05T23:38:19	2188-06-08T00:32:17	2.03748
29	12509799	25897223	30004530	Cardiac Vascular Intensive Care Unit (CVICU)	Cardiac Vascular Intensive Care Unit (CVICU)	2165-07-31T09:40:35	2165-08-03T16:29:09	3.28373
30	18860233	27978004	30007216	Medical Intensive Care Unit (MICU)	Medical Intensive Care Unit (MICU)	2191-03-10T12:33:00	2191-03-11T18:15:59	1.23818
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮

	subject_id	hadm_id	admittime	dischtime	deathtime	admission_type	admission_location	discharge_location	insurance	language	marital_status	ethnicity	edregtime	edouttime	hospital_expire_flag
	Int64	Int64	DateTime	DateTime	DateTime?	String31	String?	String31?	String15	String7	String15?	String31	DateTime?	DateTime?	Int64
1	14679932	21038362	2139-09-26T14:16:00	2139-09-28T11:30:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	SINGLE	UNKNOWN	missing	missing	0
2	15585972	24941086	2123-10-07T23:56:00	2123-10-12T11:22:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
3	11989120	21965160	2147-01-14T09:00:00	2147-01-17T14:25:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	UNKNOWN	missing	missing	0
4	17817079	24709883	2165-12-27T17:33:00	2165-12-31T21:18:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	OTHER	missing	missing	0
5	15078341	23272159	2122-08-28T08:48:00	2122-08-30T12:32:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	BLACK/AFRICAN AMERICAN	missing	missing	0
6	19124609	20517215	2169-03-14T12:44:00	2169-03-20T19:15:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	UNKNOWN	missing	missing	0
7	17301855	29732723	2140-06-06T14:23:00	2140-06-08T14:25:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
8	17991012	24298836	2181-07-10T20:28:00	2181-07-12T15:49:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
9	16865435	23216961	2185-07-19T02:12:00	2185-07-21T11:50:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
10	13693648	21640725	2111-01-30T23:43:00	2111-02-02T13:03:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
11	10803182	22438070	2168-01-24T21:14:00	2168-01-27T11:36:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
12	10733959	25108561	2175-08-08T08:56:00	2175-08-10T12:30:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	BLACK/AFRICAN AMERICAN	missing	missing	0
13	13246095	25626292	2186-01-25T10:52:00	2186-01-28T12:41:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
14	18802685	26035883	2143-02-01T16:02:00	2143-02-03T15:29:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	SINGLE	WHITE	missing	missing	0
15	16942914	23562639	2152-07-12T14:46:00	2152-07-16T14:25:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
16	15266824	28353929	2153-02-02T23:26:00	2153-02-05T11:48:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
17	11586661	26986717	2182-06-05T22:51:00	2182-06-08T13:30:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
18	19197569	29780270	2185-04-01T01:09:00	2185-04-05T11:20:00	missing	ELECTIVE	missing	HOME	Medicaid	ENGLISH	missing	WHITE	missing	missing	0
19	16865105	29806879	2189-01-01T13:05:00	2189-01-04T11:00:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
20	19011229	29236557	2134-04-27T22:37:00	2134-05-04T13:10:00	missing	ELECTIVE	missing	ACUTE HOSPITAL	Medicaid	?	missing	UNKNOWN	missing	missing	0
21	17433076	24048827	2124-06-16T18:37:00	2124-06-19T16:29:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
22	16211263	21494617	2115-03-09T15:43:00	2115-03-11T12:58:00	missing	ELECTIVE	missing	HOME	Medicaid	ENGLISH	missing	WHITE	missing	missing	0
23	19170987	28464307	2187-01-06T18:23:00	2187-01-10T12:48:00	missing	ELECTIVE	missing	HOME	Medicaid	ENGLISH	missing	WHITE	missing	missing	0
24	10707837	28513402	2118-12-14T04:37:00	2118-12-16T11:44:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
25	12564481	22701556	2170-08-05T10:38:00	2170-08-09T10:24:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
26	18465134	22377925	2171-04-18T08:57:00	2171-04-19T16:22:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
27	10293374	26739622	2155-08-28T14:30:00	2155-08-31T15:00:00	missing	ELECTIVE	missing	HOME	Other	ENGLISH	missing	WHITE	missing	missing	0
28	11562038	29180864	2146-05-03T13:06:00	2146-05-05T12:53:00	missing	ELECTIVE	missing	HOME	Medicaid	ENGLISH	SINGLE	ASIAN	missing	missing	0
29	15987366	21292141	2138-08-24T20:06:00	2138-10-02T11:30:00	missing	ELECTIVE	missing	HOME HEALTH CARE	Other	ENGLISH	SINGLE	ASIAN	missing	missing	0
30	10318660	24052497	2119-11-14T07:26:00	2119-11-23T18:25:00	missing	ELECTIVE	missing	ACUTE HOSPITAL	Other	ENGLISH	missing	WHITE	missing	missing	0
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮

	subject_id	gender	anchor_age	anchor_year	anchor_year_group	dod
	Int64	String1	Int64	Int64	String15	Date?
1	10000048	F	23	2126	2008 - 2010	missing
2	10002723	F	0	2128	2017 - 2019	missing
3	10003939	M	0	2184	2008 - 2010	missing
4	10004222	M	0	2161	2014 - 2016	missing
5	10005325	F	0	2154	2011 - 2013	missing
6	10007338	F	0	2153	2017 - 2019	missing
7	10008101	M	0	2142	2008 - 2010	missing
8	10009872	F	0	2168	2014 - 2016	missing
9	10011333	F	0	2132	2014 - 2016	missing
10	10011879	M	0	2158	2014 - 2016	missing
11	10012663	F	0	2171	2011 - 2013	missing
12	10012691	F	0	2165	2011 - 2013	missing
13	10013428	M	0	2142	2011 - 2013	missing
14	10014536	F	0	2113	2008 - 2010	missing
15	10017072	M	0	2180	2008 - 2010	missing
16	10018724	F	0	2124	2008 - 2010	missing
17	10018726	M	0	2182	2014 - 2016	missing
18	10019105	M	0	2152	2008 - 2010	missing
19	10020370	F	0	2170	2011 - 2013	missing
20	10020442	M	0	2170	2011 - 2013	missing
21	10020546	M	0	2112	2008 - 2010	missing
22	10018928	F	31	2125	2008 - 2010	missing
23	10022764	F	0	2143	2008 - 2010	missing
24	10022951	F	0	2137	2008 - 2010	missing
25	10021917	M	54	2147	2017 - 2019	missing
26	10025573	M	0	2123	2017 - 2019	missing
27	10025785	F	0	2155	2008 - 2010	missing
28	10029477	F	0	2111	2014 - 2016	missing
29	10035753	F	0	2127	2017 - 2019	missing
30	10033879	F	28	2173	2011 - 2013	missing
⋮	⋮	⋮	⋮	⋮	⋮	⋮

	subject_id	hadm_id	stay_id	charttime	itemid	valuenum
	Int64	Int64	Int64	DateTime	Int64	Float64
1	10003700	28623837	30600691	2165-04-24T05:30:00	220045	65.0
2	10003700	28623837	30600691	2165-04-24T05:38:00	223761	97.6
3	10003700	28623837	30600691	2165-04-24T06:00:00	220045	56.0
4	10003700	28623837	30600691	2165-04-24T06:09:00	220045	55.0
5	10003700	28623837	30600691	2165-04-24T07:00:00	220045	57.0
6	10003700	28623837	30600691	2165-04-24T07:00:00	223761	97.8
7	10003700	28623837	30600691	2165-04-24T08:00:00	220045	56.0
8	10004235	24181354	34100191	2196-02-24T16:39:00	220045	136.0
9	10004235	24181354	34100191	2196-02-24T17:00:00	220045	134.0
10	10004235	24181354	34100191	2196-02-24T17:16:00	220045	144.0
11	10004235	24181354	34100191	2196-02-24T17:48:00	220045	133.0
12	10004235	24181354	34100191	2196-02-24T18:00:00	220045	124.0
13	10004235	24181354	34100191	2196-02-24T19:00:00	220045	113.0
14	10004235	24181354	34100191	2196-02-24T20:00:00	220045	105.0
15	10004235	24181354	34100191	2196-02-24T21:00:00	220045	110.0
16	10004235	24181354	34100191	2196-02-24T22:00:00	220045	104.0
17	10004235	24181354	34100191	2196-02-24T23:00:00	220045	101.0
18	10004235	24181354	34100191	2196-02-25T00:00:00	220045	107.0
19	10004235	24181354	34100191	2196-02-25T01:00:00	220045	106.0
20	10004235	24181354	34100191	2196-02-25T02:02:00	220045	110.0
21	10004235	24181354	34100191	2196-02-25T03:00:00	220045	108.0
22	10004235	24181354	34100191	2196-02-25T04:00:00	220045	114.0
23	10004235	24181354	34100191	2196-02-25T05:00:00	220045	111.0
24	10004235	24181354	34100191	2196-02-25T06:00:00	220045	117.0
25	10004235	24181354	34100191	2196-02-25T07:00:00	220045	118.0
26	10004235	24181354	34100191	2196-02-25T08:00:00	220045	122.0
27	10004235	24181354	34100191	2196-02-25T09:00:00	220045	123.0
28	10004235	24181354	34100191	2196-02-25T10:00:00	220045	115.0
29	10004235	24181354	34100191	2196-02-25T12:00:00	220045	115.0
30	10004235	24181354	34100191	2196-02-25T13:00:00	220045	114.0
⋮	⋮	⋮	⋮	⋮	⋮	⋮

	charttime	valuenum
	DateTime	Float64
1	2165-04-24T05:30:00	65.0
2	2165-04-24T06:00:00	56.0
3	2165-04-24T06:09:00	55.0
4	2165-04-24T07:00:00	57.0
5	2165-04-24T08:00:00	56.0

	subject_id	hadm_id	stay_id	heart_rate	temp_f
	Int64?	Int64?	Int64	Float64?	Float64?
1	12466550	23998182	30000153	104.0	99.1
2	12207593	22795209	30000646	100.0	98.8
3	12980335	23552849	30001148	80.0	95.6
4	12168737	29283664	30001336	65.0	98.5
5	17371178	24502166	30001396	86.0	98.8
6	16513856	24463832	30001446	82.0	98.1
7	19609454	24188515	30001656	99.0	98.9
8	15904173	23836605	30001947	105.0	97.9
9	17921898	28841024	30002415	80.0	97.6
10	17938576	20818145	30002498	81.0	97.8
11	14311522	24622512	30002548	80.0	98.5
12	10208468	25796414	30002925	70.0	98.6
13	10682002	20035892	30003087	74.0	98.4
14	16165135	24791729	30003125	95.0	99.2
15	11423795	20012928	30003226	89.0	99.0
16	14895375	24753602	30003275	88.0	98.0
17	11206784	22308094	30003372	106.0	98.4
18	15332791	20683754	30003598	75.0	96.9
19	11307058	25946296	30003729	70.0	98.4
20	18300445	26541280	30003746	103.0	98.9
21	12227720	29396704	30003749	67.0	missing
22	10369174	24697158	30004144	65.0	97.5
23	17220323	25700666	30004242	60.0	98.1
24	17580058	25858979	30004306	115.0	98.5
25	14335301	27088506	30004462	100.0	95.2
26	12509799	25897223	30004530	68.0	97.6
27	11553072	24760680	30004568	64.0	97.3
28	19272232	28173870	30004576	91.0	97.9
29	12844527	27959182	30004627	81.0	97.9
30	12098571	20553314	30004798	82.0	98.5
⋮	⋮	⋮	⋮	⋮	⋮

	variable	mean	min	median	max	nmissing	eltype
	Symbol	Union…	Any	Union…	Any	Int64	Type
1	first_careunit		Cardiac Vascular Intensive Care Unit (CVICU)		Trauma SICU (TSICU)	0	String
2	gender		F		M	0	Union{Missing, String1}
3	ethnicity		AMERICAN INDIAN/ALASKA NATIVE		WHITE	0	Union{Missing, String31}
4	age_intime	64.4705	18	66.0	102	0	Int64
5	heart_rate	87.4667	0.0	85.0	941.0	15	Union{Missing, Float64}
6	temp_f	98.0343	0.0	98.1	106.0	954	Union{Missing, Float64}
7	thirty_day_mort	0.0996802	0	0.0	1	0	Union{Missing, Bool}