OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

cti-stix message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [cti-stix] Proposal to establish Sightings (#306) and Relationships (#291) as our official issue topics under active consideration for STIX v2.0


Intelligence is not just information

On Tuesday, 3 November 2015, Jason Keirstead <Jason.Keirstead@ca.ibm.com> wrote:

I understand the theoretical usefulness, but I still stand by the fact that once you get into large scale, it's usefulness as raw data becomes inconsequential... In terms of the graph - I believe sightings is a metric on the edge between an observer and an indicator, and that edge has attributes such as "count" and "last seen". It is not a vertex in and of itself, that would not scale in real world scenarios.You also don't need to store the raw instances of sightings to do the most useful analysis of those metrics (including temporal). I can have a time series database tied to the edge that is storing sighting counts over time, without storing the actual raw sighting instances.

> Think of sightings like case reporting from doctors to the CDC.

The problem is we have to deal with much larger scale than this, and always keep the demands of that scale in mind.

If we only had to deal with ~ 10 billion possible sighting records for an indicator then I would be a happy camper, but that is far from the case.

-
Jason Keirstead
Product Architect, Security Intelligence, IBM Security Systems
www.ibm.com/security | www.securityintelligence.com

Without data, all you are is just another person with an opinion - Unknown


Inactive hide details for "Barnum, Sean D." ---2015/11/03 11:38:59 AM--->Why would I want unique records of all of those sighti"Barnum, Sean D." ---2015/11/03 11:38:59 AM--->Why would I want unique records of all of those sightings, to what purpose is it serving? What peop

From: "Barnum, Sean D." <sbarnum@mitre.org>
To: Jason Keirstead/CanEast/IBM@IBMCA, Terry MacDonald <terry@soltra.com>
Cc: Jerome Athias <athiasjerome@gmail.com>, "Jordan, Bret" <bret.jordan@bluecoat.com>, "cti-stix@lists.oasis-open.org" <cti-stix@lists.oasis-open.org>, "Wunder, John A." <jwunder@mitre.org>, "Taylor, Marlon" <Marlon.Taylor@hq.dhs.gov>, "Davidson II, Mark S" <mdavidson@mitre.org>
Date: 2015/11/03 11:38 AM
Subject: Re: [cti-stix] Proposal to establish Sightings (#306) and Relationships (#291) as our official issue topics under active consideration for STIX v2.0





>Why would I want unique records of all of those sightings, to what purpose is it serving? What people care about in a sighting is a count of indicators, so that they can give increased >significance to those that are currently "live”.

The count is like a heartbeat. It tells you if that TTP is still “alive” but that is really all it does.
It is the actual sighting details that give you deeper insight (“intelligence”) into what is happening and how you might prevent or mitigate it.
There are many forms of analysis that can be done on the sighting information but the most obvious and prevalent have to do with “when” and “who”.
Temporal analysis across the actual sightings can yield all sorts of insight beyond just “alive” or “dead”.
Similarly, analysis of who is sighting the indicator and when can give very valuable insight into victim targeting, who is being affected that might not know it yet and who will likely be affected next.
If the sightings include details of what was actually observed rather than just a “matched pattern” count this information can also be very valuable in understanding the nature of the TTP and how variations of it may be being applied to different subsets of the victim targeting pool.

Think of sightings like case reporting from doctors to the CDC. If you want to know if a potential contagion is something to worry about then counts give you the first measure but if you want to actually study the epidemiology, know how fast and how far it is spreading, know where it will likely spread next, know what sort of victims are most susceptible, know which methods are successful in slowing or stopping it and want to get ahead of it, you will need the actual “sightings”.

sean


From: Jason Keirstead <Jason.Keirstead@ca.ibm.com>
Date:
Tuesday, November 3, 2015 at 8:48 AM
To:
Terry MacDonald <terry@soltra.com>
Cc:
Jerome Athias <athiasjerome@gmail.com>, "Jordan, Bret" <bret.jordan@bluecoat.com>, "cti-stix@lists.oasis-open.org" <cti-stix@lists.oasis-open.org>, John Wunder <jwunder@mitre.org>, "Taylor, Marlon" <Marlon.Taylor@hq.dhs.gov>, Mark Davidson <mdavidson@mitre.org>, "Barnum, Sean D." <sbarnum@mitre.org>
Subject:
RE: [cti-stix] Proposal to establish Sightings (#306) and Relationships (#291) as our official issue topics under active consideration for STIX v2.0

> Um, why do we want the same ID? If the attacker has sent our Org the same pdf 2000 times then don’t we want to record that fact, and link the Sighting objects (with Observable Instances) to the Incident object with 2000 relationship objects? Then > don’t we want to also send that group of Incident, Sightings, Observables and Relationships to others in our Threat Sharing group so that they are aware of them? That is accurate.

I am going to humbly suggest that there is no way any large organization has the resources to do what you are suggesting (record unique sighting objects in the graph for every occurrence of an indicator). I could easily have tens of thousands or more sightings of a single indicator in 1 day alone in real-world situations... extrapolate that out to millions of indicators monitored and months of data and you will see how this would impact your graph. Why would I want unique records of all of those sightings, to what purpose is it serving? What people care about in a sighting is a count of indicators, so that they can give increased significance to those that are currently "live". IE - the way I see things happening in most all implementations is the sighting will be ingested, it will be used to increment some counts, and it will then be discarded. I can't possibly see any implementation storing raw sightings, at least not at an enterprise scale, unless it has some arbitrary cap like "store raw sightings for 24 hours and then discard"


-
Jason Keirstead
Product Architect, Security Intelligence, IBM Security Systems
www.ibm.com/security | www.securityintelligence.com

Without data, all you are is just another person with an opinion - Unknown


Inactive hide details for Terry MacDonald ---2015/10/30 07:32:02 PM---Um, why do we want the same ID? If the attacker has sent Terry MacDonald ---2015/10/30 07:32:02 PM---Um, why do we want the same ID? If the attacker has sent our Org the same pdf 2000 times then don’t

From:
Terry MacDonald <terry@soltra.com>
To:
"Jordan, Bret" <bret.jordan@bluecoat.com>, Jason Keirstead/CanEast/IBM@IBMCA
Cc:
"Wunder, John A." <jwunder@mitre.org>, Mark Davidson <mdavidson@mitre.org>, "Sean D. Barnum" <sbarnum@mitre.org>, Jerome Athias <athiasjerome@gmail.com>, "Taylor, Marlon" <Marlon.Taylor@hq.dhs.gov>, "cti-stix@lists.oasis-open.org" <cti-stix@lists.oasis-open.org>
Date:
2015/10/30 07:32 PM
Subject:
RE: [cti-stix] Proposal to establish Sightings (#306) and Relationships (#291) as our official issue topics under active consideration for STIX v2.0





Um, why do we want the same ID? If the attacker has sent our Org the same pdf 2000 times then don’t we want to record that fact, and link the Sighting objects (with Observable Instances) to the Incident object with 2000 relationship objects? Then don’t we want to also send that group of Incident, Sightings, Observables and Relationships to others in our Threat Sharing group so that they are aware of them? That is accurate.


If we are worried about the size of storing the PDF multiple times, then it is up to the implementation to recognize that the MD5 of the attachment item is the same and then actually only store it once (just like MS Exchange servers have been doing since mid-2000’s).


How do we identify to others that the above data came from us?


If the ID of the object just generated from the <HashofContent> then there is no easy way to do this. If the ID of the object generated from the namespace.<HashofContent> then we have more chance.


But what happens if we decide to update the Incident? The ID is now namespace.<NewHashofContent>. Now how do we de-duplicate? Do we now have to puclish a relationship object explicitly stating the is a replacement object for the namespace.<HashofContent> object?


And what about relationship objects? Part of the power of separate top-level objects is that we can now just tell people about the relationship, but we can keep the actual data node it refers to a secret. Therefore in some implementations the only link to tie relationships together is the fact both relationships share an ID:


e.g
RelationshipA (src: CampaignA -> Threat ActorA)
RelationshipB (src: IndicatorA -> CampaignA)
RelationshipC (src: IndicatorB -> CampaignA)


The recipient may not have the CampaignA data or ThreatActorA data, but they will still know that the IndicatorA and IndicatorB are related to the same campaign thanks to the relationship contains in the same IDs.
This completely breaks if the ID’s change over time.

We need an ID solution that:

To go over the FW use case again

At this point, the main taxi repo knows that the File objects are all related.


Cheers


Terry MacDonald

Senior STIX Subject Matter Expert

SOLTRA
| An FS-ISAC and DTCC Company
+61 (407) 203 206 |
terry@soltra.com


From:
Jordan, Bret [mailto:bret.jordan@bluecoat.com]
Sent:
Saturday, 31 October 2015 4:54 AM
To:
Jason Keirstead <Jason.Keirstead@ca.ibm.com>
Cc:
Wunder, John A. <jwunder@mitre.org>; Terry MacDonald <terry@soltra.com>; Mark Davidson <mdavidson@mitre.org>; Sean D. Barnum <sbarnum@mitre.org>; Jerome Athias <athiasjerome@gmail.com>; Taylor, Marlon <Marlon.Taylor@hq.dhs.gov>; cti-stix@lists.oasis-open.org
Subject:
Re: [cti-stix] Proposal to establish Sightings (#306) and Relationships (#291) as our official issue topics under active consideration for STIX v2.0

Lets run the FW use case to the ground, since most everyone should understand it...



FW 1 see a series of weaponized PDFs come down. Say it sees the same Weaponized PDF 2,000 times over a period of 3 days. A large phishing attack with a lot of click happy users.


1) Now it is highly unlikely with the current model that the FW will remember and use the same ID value (UUID) for each Indicator+Observable+MAEC data blob it issues for this Weaponized PDF. In fact, it will probably have 2,000 different UUID IDs for the same Indicator.


2) Now when you compound this by 60,000 client in the network issuing Sightings, this becomes to blow up quickly.


Maybe... Just maybe.... The FW could take the JSON Indicator that it is going to issue and hash the data blob and use that hash as the ID. Then at least each FW that is running the same code and is seeing basically the same thing with the same amount of data-enrichment, will issue the same ID value.


We will have a totally different problem in TAXII Land in the Query REST API. Because you will probably want to do something like:


/t2/query/indicator/file_name/FreeFood.pdf
or
/t2/query/indicator/file_hash/<some file hash of the PDF>



Thanks,


Bret




Bret Jordan CISSP

Director of Security Architecture and Standards | Office of the CTO
Blue Coat Systems

PGP Fingerprint: 63B4 FC53 680A 6B7D 1447 F2C0 74F8 ACAE 7415 0050
"Without cryptography vihv vivc ce xhrnrw, however, the only thing that can not be unscrambled is an egg."



[attachment "graycol.gif" deleted by Jason Keirstead/CanEast/IBM]



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]