git.net

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[jira] [Commented] (DAFFODIL-258) DFDL Entities: Implement Raw Bytes Entities


    [ https://issues.apache.org/jira/browse/DAFFODIL-258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16438222#comment-16438222 ] 

Michael Beckerle commented on DAFFODIL-258:
-------------------------------------------


This notion of lowering a delimter to iso-8859-1 characters aka bytes is interesting.

Suppose a terminator is in utf-8. Consider:
{code}
val terminatorIn8859 = new String(terminator.getBytes("utf-8"), StandardCharsets.ISO_8859_1)
{code}
So the multi-byte characters in this utf-8 are "lowered" to multiple characters (aka bytes) in ISO-8859-1. These lowered characters can then be inter-mixed with raw bytes - which are not "raw" but valid character codes in iso-8859-1 where every byte is valid.

> DFDL Entities: Implement Raw Bytes Entities
> -------------------------------------------
>
>                 Key: DAFFODIL-258
>                 URL: https://issues.apache.org/jira/browse/DAFFODIL-258
>             Project: Daffodil
>          Issue Type: New Feature
>          Components: DFDL Language, Front End
>            Reporter: Taylor Wise
>            Priority: Major
>             Fix For: deferred
>
>
> Implement raw byte entities.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)