OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

cti message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: RE: [cti] Idea for Internationalization (was: Draft tranche plan for achieving our July target date for draft specs (STIX 2.0, TAXII 2.0, CybOX 3.0))


 

Five minutes before this email, I was CCed on a Japanese translation of an advisory we published earlier today.

 

Background:  We have FS-ISAC members in Japan, as well as a partner org in Japan that we work with.

 

We have a Japanese translator on staff, who identifies the important advisories, and translates enough of them into Japanese so the recipients can evaluate the applicability and importance.  The original, English verison is also included, so if the recipient deems it applicable, he/she can translate the rest.

 

This is all for human-readable reports, but the use case seems similar.

 

Proposal:

 

1.        Add an optional language tag in all top level constructs. 

2.       Add an optional Alternate Language tag to Relationship objects.

3.       Producers can create multiple language-specific versions of whatever top-level objects they wish. 

4.       Producers can create Alternate Language relationships between these alternate language objects.

5.       Consumers can choose to maintain alternate language versions of the objects, or can choose to maintain some or all of the alternate language versions.

6.       If the consumer chooses to maintain Alternate Languages, the Alternate Language relationship objects would support the relationship between the alternate language versions of the same object.

 

Use Cases:

 

1.        I produce content in English, but have Japanese constituents.  I publish everything in English, and a subset of the info in Japanese.  For those objects that I also publish in Japanese, I link the English and Japanese versions together with an Alternate Language relationship.

2.       I consume the content in #1, and English is my primary language.  Upon receipt, I discard the Japanese versions and the alternate language relationships.

3.       I consume the content in #1, but Japanese is my primary language.  Upon receipt, I consume both the English and Japanese versions.  When both exist for a given item, I display the Japanese version first, and provide a link to the English version.  When only an English version exists, I display the English version.

 

For consideration,

 

Chris Ricard

FS-ISAC

 

 

 


From: Jordan, Bret
Sent: Monday, February 1, 2016 9:02 PM
To: Masuoka, Ryusuke
Cc: cti@lists.oasis-open.org
Subject: Re: [cti] Draft tranche plan for achieving our July target date for draft specs (STIX 2.0, TAXII 2.0, CybOX 3.0)

 

Thanks for the feedback.  The tear line I am trying to figure out is where is this a specification issue and where is it an implementation issue.  

 

One idea that we have tossed around on Slack is the idea that each top level object (TLO) would have a field called "lang".  This would be the language that the object is written in.  This would enable tools to select and filter by a language.  

 

Then given the fact that only a handful of fields for a given TLO have the ability to be translated, you are not going to translate an IP address for example, we have tossed around the idea of creating a "translation" object that could be sent either with the original TLO or separately. 

 

Going down the path this way, though I am not yet advocating that is the best way, would allow organizations to do very interesting things with CTI data:

 

1) A threat intel provider could issue TLOs in language specific versions if they wanted.

 

2) A threat intel provider could produce language translations and attach them to the TLO.  

 

3) End users could augment or add their own translations without needing to re-release the entire TLO and thus avoid versioning issues.

 

There was some initial concern about this model as some believe it might have issues with versioning.  But I do not think so, as you would not want translated objects to auto point to a new version.  They would be tied at the hip, to the version that were created for.

 

The reason for looking at doing something like this is to avoid need for turning every String field in the serialization in to an array of objects.  

 

What would you think of something like this?

 

Thanks,

 

Bret

 

 

 

Bret Jordan CISSP

Director of Security Architecture and Standards | Office of the CTO

Blue Coat Systems

PGP Fingerprint: 63B4 FC53 680A 6B7D 1447  F2C0 74F8 ACAE 7415 0050

"Without cryptography vihv vivc ce xhrnrw, however, the only thing that can not be unscrambled is an egg." 

 

On Feb 1, 2016, at 18:29, Masuoka, Ryusuke <masuoka.ryusuke@jp.fujitsu.com> wrote:

 

Hi, Bret,

 

I guess it depends. But what I see is scenarios like the following:

 

- A Japanese entity receives CTI information pieces in English.

  The entity determines some of them are important/critical

  and worth translating them into Japanese, add descriptions in Japanese

and redistribute them to other Japanese entities (if redistribution is allowed).

  The CTIM (CTI Management System) of a receiving party displays

  the Japanese description whenever possible, while allowing access to

  the original English descriptions.

 

- Japanese entities produce CTI in Japanese (not in English, surprise!).

  An entity decides some of them are important/critical and worth

translating them into English, add descriptions in English,

and redistribute them to other countries (if redistribution is allowed).

The CTIM of a receiving party displays  the English description if so set,

while allowing access to the original Japanese (likely more accurate)

descriptions.

 

Regards,

 

Ryu

 

From: Jordan, Bret [mailto:bret.jordan@bluecoat.com] 
Sent: Tuesday, February 02, 2016 10:17 AM
To: Masuoka, Ryusuke/
益岡 竜介; cti@lists.oasis-open.org
Subject: Re: [cti] Draft tranche plan for achieving our July target date for draft specs (STIX 2.0, TAXII 2.0, CybOX 3.0)

 

Some questions:

  1. Will organizations producing threat intelligence produce one incident for each language?  
    1. Or will they produce one big incident that contains all of the languages?
  1. For an indicator with a localized title / description, would a TAXII server just send you the jp version vs the en_us version?  
    1. Or would you expect the TAXII server to send you both?
  1. What would be the expected behavior if you got a version in a language that you did not speak, say Hungarian?

 

 

Thanks,

 

Bret

 

 

 

Bret Jordan CISSP

Director of Security Architecture and Standards | Office of the CTO

Blue Coat Systems

PGP Fingerprint: 63B4 FC53 680A 6B7D 1447  F2C0 74F8 ACAE 7415 0050

"Without cryptography vihv vivc ce xhrnrw, however, the only thing that can not be unscrambled is an egg." 

 

On Feb 1, 2016, at 18:07, Masuoka, Ryusuke <masuoka.ryusuke@jp.fujitsu.com> wrote:

 

Hi,

 

Not UTF-8 thing (I understand most of modern programming languages

and other standards deal with it correctly).

 

It is about having text fields in multiple languages.

For example, descriptions of a package in English and Japanese.

The system will pick which language to display based on

the language code (en or jp) in the field.

 

Is it something already discussed in Slack?

(Sorry if so.)

 

Regards,

 

Ryu

 

From: cti@lists.oasis-open.org [mailto:cti@lists.oasis-open.org] On Behalf Of Jordan, Bret
Sent: Tuesday, February 02, 2016 9:59 AM
To: Masuoka, Ryusuke/
益岡 竜介
Cc: Barnum, Sean D.; cti@lists.oasis-open.org
Subject: Re: [cti] Draft tranche plan for achieving our July target date for draft specs (STIX 2.0, TAXII 2.0, CybOX 3.0)

 

I would really like to understand this... . Do you mean to make sure the text fields are not ASCII so that you can put in other character sets?  JSON gives us UTF-8 by default.  So this alone should make things easier for our international friends..

 

If this is not what you mean.  Please explain and give us some context.  We have had some passionate debates on Slack about this recently, but I feel now, that we do not really understand the problem that we were trying to solve.  Can you help us understand the problem?  What works, what does not work, what you need it to do and why?

 

I really want to make sure our baby works for everyone.  But as I said on Slack, "I do not want to engineer a space ship when all we need is a bike to run to the corner store and get a coke".

 

Thanks,

 

Bret

 

 

 

Bret Jordan CISSP

Director of Security Architecture and Standards | Office of the CTO

Blue Coat Systems

PGP Fingerprint: 63B4 FC53 680A 6B7D 1447  F2C0 74F8 ACAE 7415 0050

"Without cryptography vihv vivc ce xhrnrw, however, the only thing that can not be unscrambled is an egg." 

 

On Feb 1, 2016, at 17:46, Masuoka, Ryusuke <masuoka.ryusuke@jp.fujitsu.com> wrote:

 

Hi,

 

Is there a place for Internationalization of text fields?

I would like very much to see it in STIX 2.0 (or CTI Common?)

and I am willing to contribute.

 

Regards,

 

Ryu

 

From: cti@lists.oasis-open.org [mailto:cti@lists.oasis-open.org] On Behalf Of Barnum, Sean D.
Sent: Tuesday, February 02, 2016 3:49 AM
To: cti@lists.oasis-open.org
Subject: [cti] Draft tranche plan for achieving our July target date for draft specs (STIX 2.0, TAXII 2.0, CybOX 3.0)

 

All,

 

As discussed at the face to face meeting and briefly on the TC monthly call and the list we plan to work toward our aggressive July target date for draft STIX 2.0, TAXII 2.0 and CybOX 3.0 specs utilizing a more product management approach with roughly monthly tranches focused on resolving all in-scope identified issues relevant to a particular capability area.

 

  1. February 29th - Run Indicators to the ground.  Get these fundamentals worked through to enable us to talk to vendor on the RSA show floor about it.  And have something to show them. 
  2. March 31st – Run remaining cross-cutting issues to ground. Run Identity-based Victim, Source and Actor top level abstractions to ground.
  3. April 30th – Run Incidents (investigations) to ground. Run Asset top level abstraction to ground. Run Campaign to ground.
  4. May 31st – Run controlled vocabularies to ground. Run automated COA default extension to ground. Run analytic support (opinions, assertions, hypotheses, etc.) to ground.
  5. June 30th – All other remaining top level elements. Review pass for consistency (field name choices, naming conventions, structure patterns, etc) and quality.

This should cover the existing in-scope issues in a coherent and dependency-aware iterative fashion. For CybOX, the Indicator tranche is likely to cover patterning and key object support decisions with remaining tranches focused on key object refactoring based on decisions from the Indicator tranche.

Please let us  know if you see any issues with this tranche plan.

 

 

The first tranche (Indicators) is the most relevant for now as it begins today.

Below is a draft plan for the Indicator tranche. This draft Indicator Tranche plan is also in the wiki.

This is a very aggressive plan considering the amount of issues to discuss and decide and the limited time to do it.

We will strive to achieve this plan and encourage active collaboration from everyone to help us accomplish it.

If you have comments, feedback or issues with this draft plan please let us know so that we may adapt as appropriate.

 

 

 

Indicator tranche plan

Objective:

To discuss and reach consensus on all in-scope tracker issues for STIX 2.0 that are required to support common indicator use cases.

Target completion date:

February 29, 2016

Proposed workflow:

  • Raise and describe the issue with a brief wiki writeup
  • Discuss issue on list and/or slack (with summaries made on list). Anyone with proposed solution may add details of their proposal (proposed normative text, examples, diagrams, schema,etc clearly marked as a proposal) to the wiki writeup and announce it to the list.
  • Discuss, debate, review proposals, comment as appropriate within defined time window to work towards consensus. 
  • Discuss key issues on weekly working call.
  • If consensus (unanimous or at least no strong objections) reached:
    • Capture normative language in pre-draft spec document
    • Capture consensus changes in JSON Schema implementation
    • Capture consensus changes in UML model
    • Capture statement of consensus in issue tracker
    • Mark issue tracker as “Consensus Achieved”
    • Clearly mark relevant issue wiki pages as “Consensus Achieved” or potentially move them to a separate Consensus repo to avoid confusion
  • If consensus not achieved (strong objection exists) within allowed time window:
    • Discuss and decide whether issue is absolutely necessary for MVP and if not decide to postpone
    • OR

o   Capture current consensus status in issue tracker, mark as “Consensus Stalled”, move on to other issues and revisit the issue during last week of tranche

 



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]