Spark dayname function implementation by kazantsev-maksim · Pull Request #20825 · apache/datafusion

kazantsev-maksim · 2026-03-09T17:31:40Z

Which issue does this PR close?

N/A

Rationale for this change

Add new spark function: https://spark.apache.org/docs/latest/api/sql/index.html#dayname

What changes are included in this PR?

Implementation
SLT tests

Are these changes tested?

Yes, tests added as part of this PR.

Are there any user-facing changes?

No, these are new function.

coderfender · 2026-03-13T01:39:09Z

+
+query T
+SELECT dayname('2008-02-20'::DATE);
+----


Should we also verify that the data type matches the spark data types (Itf8 / LargeUtf8 / Utf8View)

coderfender

minor comments

coderfender · 2026-03-13T01:42:08Z

+fn spark_day_name(days: i32) -> String {
+    let weekday = Date32Type::to_naive_date_opt(days).unwrap().weekday();
+    let display_name = get_display_name(weekday.num_days_from_monday());
+    display_name.unwrap()


potential panic in unwrap ?

I tried to improve it

coderfender · 2026-03-13T01:42:38Z

+}
+
+fn spark_day_name(days: i32) -> String {
+    let weekday = Date32Type::to_naive_date_opt(days).unwrap().weekday();


here as well ? Could we guarantee that unwrap is never going to panic ?

I tried to improve it

coderfender

minor comments

coderfender · 2026-03-25T17:57:38Z

Test failures :

caused by
Arrow error: Cast error: Cannot cast string '' to value of Date32 type
[SQL] SELECT dayname(''::STRING);
at /__w/datafusion/datafusion/datafusion/sqllogictest/test_files/spark/datetime/dayname.slt:68

kazantsev-maksim · 2026-03-29T05:49:05Z

Thanks for the review @coderfender. Could you please another see, when you have a time?

coderfender

I think the changes are looking good @kazantsev-maksim . Left a couple of questions for further review . Thank you very much for your patience :)

coderfender · 2026-03-29T06:17:21Z

+
+query T
+SELECT dayname('2010-04-24'::TIMESTAMP);
+----


Could we also tests to add actual timestamps instead of dates parsed as timestamps here ?

coderfender · 2026-03-29T06:34:28Z

+        DataType::Date32 | DataType::Timestamp(_, _) => spark_day_name_inner(array),
+        DataType::Utf8 | DataType::Utf8View | DataType::LargeUtf8 => {
+            let date_array =
+                cast_with_options(array, &DataType::Date32, &CastOptions::default())?;


What would happen in case of an invalid string / error here ? Could probably handle it here and throw an error if it is a malformed input ?

Invalid strings will be replaced with null values.

coderfender · 2026-03-29T06:36:21Z

+        6 => Some(String::from("Sun")),
+        _ => None,
+    }
+}


I dont think we are handling timezone's support here ? Spark handles timezone in current implementation and this could produce wrong results

coderfender · 2026-03-29T06:44:43Z

+                        TypeSignatureClass::Timestamp,
+                    )]),
+                    TypeSignature::Coercible(vec![Coercion::new_exact(
+                        TypeSignatureClass::Native(logical_date()),


Spark's date java.util.date is Date32 on Rust side of things . Given that we are not handling Date64 in the match arm , do you still think we should have logical_date() as one of the signature supporting Date32 / Date64 ? Perhaps we could change to Date32 ?

coderfender · 2026-03-29T06:53:22Z

+query T
+SELECT dayname('2010-04-24'::TIMESTAMP);
+----
+Sat


We might need tests to cover various timezones (atleast one apart from UTC) :)

andygrove · 2026-04-13T12:31:56Z

+----
+NULL
+
+query T


I tested this in Spark 4.1.1 and found that the empty string behavior depends on ANSI mode:

-- ANSI on (Spark 4 default): spark-sql> SELECT dayname(''); [CAST_INVALID_INPUT] The value '' of the type "STRING" cannot be cast to "DATE" ... -- ANSI off: spark-sql> SET spark.sql.ansi.enabled=false; spark-sql> SELECT dayname(''); NULL

The current implementation returns NULL, which matches the non-ANSI behavior. Since Spark 4 defaults to ANSI mode, it might be worth supporting both behaviors here — maybe through an enable_ansi_mode flag similar to how mod/pmod handle it in the same crate. That way the caller (e.g. Comet or another Spark-compatible engine) can choose whether invalid string-to-date casts should error or return NULL, matching whichever ANSI mode the user has configured.

How does this work in terms of nullability? The nullability of dayname seems accurate to Spark (depends on input) but if invalid strings are returned as null, that could violate the contract here. Is it because in Spark that config applies at a previous cast layer instead of during the function execution as in here?

Spark function: dayname

1da480c

github-actions Bot added sqllogictest SQL Logic Tests (.slt) spark labels Mar 9, 2026

Fix tests

412c9b5

coderfender reviewed Mar 13, 2026

View reviewed changes

Comment thread datafusion/sqllogictest/test_files/spark/datetime/dayname.slt

coderfender reviewed Mar 13, 2026

View reviewed changes

Kazantsev Maksim added 7 commits March 13, 2026 22:24

Fix PR issues

7500ee6

Fix PR issues

7352c8b

Fix PR issues

56378d3

Add empty string test case

835585c

Add empty string test case

ddbbb43

Add empty string test case

8ecb079

Add empty string test case

b07e2ad

Kazantsev Maksim added 5 commits March 28, 2026 21:33

Merge remote-tracking branch 'origin/main' into spark_dayname

631e9bc

fix

76bcf9c

fix

e900486

fix

7562581

fix

8a7e413

coderfender reviewed Mar 29, 2026

View reviewed changes

Kazantsev Maksim added 3 commits March 29, 2026 21:40

tests

1809710

Fix PR issues

989c812

Merge remote-tracking branch 'origin/main' into spark_dayname

07e598a

andygrove reviewed Apr 13, 2026

View reviewed changes

WIP

ab9a5ad

Kazantsev Maksim added 4 commits April 14, 2026 20:57

Add ansi mode flag

a3452c1

Fix tests

7ec344b

Fix tests

85978d4

Fix tests

9ef2714

+              ----
+              NULL
+              query T

Conversation

kazantsev-maksim commented Mar 9, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coderfender left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderfender left a comment

Choose a reason for hiding this comment

Uh oh!

coderfender commented Mar 25, 2026

Uh oh!

kazantsev-maksim commented Mar 29, 2026

Uh oh!

coderfender left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants