Restore threadFlowLocation.kind #202

ghost · 2018-07-17T22:02:34Z

We had previously removed annotatedCodeLocation.kind, along with the four "kind-dependent" properties target, targetLocation, values, and state. We propose to partially reverse that decision:

Restore threadFlowLocation.kind, giving it (to start with) the same set of values as the old annotatedCodeLocation.kind.
Add additional values to describe exceptions (throw, catch, finally, others? e.g. rethrow?)
Clarify that other values are allowed.
Do not restore the kind-dependent properties.

@michaelcfanning FYI

The text was updated successfully, but these errors were encountered:

Change draft for #202: Restore kind.

fishoak · 2018-10-03T21:17:28Z

I have some general observations about threadFlowLocation.kind and some specific things. I'll start with the general stuff.

My impression is that this is threatening to get very unwieldy. I postulate that it will be impossible to specify a set of kinds in this list that is going to satisfy all parties. I would urge that we find the bare minimum set of kinds and stick with those.

Right now the kinds are a mix of three things: the kind of node in the control flow, the form of data access that occurs, and "declaration" which is neither of the previous two. The only one I have a strong reaction to is that last one. It's OK to have a declaration in the flow if it is also associated with something that is implicitly executed, but then the right thing would be for it to be counted as whatever that executable thing is (e.g, an assignment or a constructor). In C, "int x;" has no executable content and so IMHO should not appear in the flow. The proposed "endScope" kind feels a bit like this too.

The data access kinds "assignment", "usage", and the proposed "alias", "passthrough", and "sanitizer" are starting to feel like overkill.

We now have several ways in which something can be called/entered. There is "functionEnter", but also "applicationEntryPoint" is being proposed, and "catch" is kind of similar in that it is an entry to the body of an exception handler. There are others that one might want, such as a signal handler. In that vein, should the entry point to a thread count as functionEnter? Also, a colleague pointed out that it is often useful to distinguish between the kinds of method being called (constructor, destructor, copy-constructor, method, static method, etc...).

To avoid this, it might be worth considering whether we want to simply have a generic "enter" and "exit", and to rely on the fact that one can have additional kinds in the list, e.g.: ["enter", "function"] or ["exit", "handler"]. This would work for scopes too: ["exit", "scope"] would replace the proposed "endScope".

"continuation" might be problematic. First, that's an overloaded term in PL so it will invite confusion. Second, if you're going to have it, then you will also want a "jump". Plus, it's just the next place in the flow, and I don't see a super good reason for distinguishing either jumps or this continuation from other nodes in the flow.

In the "callReturn" note, I just want to confirm that "terminates" does not mean that the execution terminates but that the particular flow that is encoded has come to an end, possibly because the analyzer has concluded it isn't worth showing it.

ghost · 2018-10-04T00:42:45Z

I agree that this could get unwieldy. I've tried to incorporate everyone's favorite kind values. Let's just stop here, shall we? :-)

I don't feel strongly about declaration -- anybody else have an opinion on that?

I don't really agree with the idea that assignment, usage, alias, passthrough, and sanitizer are overkill if your goal is to track tainted data. "a" is tainted, but we assigned it to "b"; now "b" is tainted. "c" is now an alias for "b", so "c" is tainted. But "c" was sanitized, so we're good.

usage goes as far back as SARIF v1, and I don't remember the justification -- Michael?

Yekaterina explicitly asked for passthrough and endScope, arguing as follows:

Yes, “passthrough” and “endScope” would be useful to us. The former because we differentiate between just usage (e.g. variable was assigned null and then used, that is, dereferenced) and propagation of taint. The latter is useful for explaining why we report memory and resource leaks.

continuation is a request from the Microsoft Static Driver Verifier team; they use it in their native code flows. What is "PL"? In any case, we do try to avoid overloading terms, but there are only so many words in the language. :-) Do you have an alternative?

I suppose we ought to add jump (even though we already have branch) in case someone uses a goto.

The idea of factoring "enter"/"exit" out from "application", "function", "handler", etc. is intriguing. Michael, what do you think?

Yes, "terminate" would refer to the end of the code flow, not the end of program execution. HOWEVER, please look at the latest change draft. That word no longer appears; I've rewritten the notes in this section.

michaelcfanning · 2018-10-04T00:52:09Z

I think 'usage' is likely something that doesn't have utility for the specific scenario (icon/other kind-specific visualizations) we have in mind. If 'endScope' is interesting then some notion of resource/memory allocation/acquisition may be interesting.

Yes, I agree that simplifying enter/exit may be useful.

ghost · 2018-10-04T16:23:51Z

Actually I take it back about needing jump. A jump is just a branch without a condition, if you take "branch" to mean "execute an instruction other than the next one in order". I recall assembly languages where you had BRZ (branch if zero), BRN (branch if non-zero), and just plain BR (unconditional branch). Or JZ, JNZ, and just plain J if the assembler authors liked "jump" better than "branch".

In any case, I don't think we need both.

fishoak · 2018-10-10T15:55:55Z

I don't really agree with the idea that assignment, usage, alias, passthrough, and sanitizer are overkill if your goal is to track tainted data. "a" is tainted, but we assigned it to "b"; now "b" is tainted. "c" is now an alias for "b", so "c" is tainted. But "c" was sanitized, so we're good.

These terms have a particular meaning for a form of taint analysis, and they do not necessarily make sense for all kinds of taint analysis. It's fine to have these in the standard, but we need to make it clear that they are very loosely defined and likely incomplete.

continuation is a request from the Microsoft Static Driver Verifier team; they use it in their native code flows. What is "PL"? In any case, we do try to avoid overloading terms, but there are only so many words in the language. :-) Do you have an alternative?

By "PL" I mean Programming Languages. See https://en.wikipedia.org/wiki/Continuation. I can't find a good alternative wording, and I don't feel strongly about this, so maybe it's fine.

From a graph-theoretic perspective, there's no difference between control passing to the target of an unconditional jump, and control just passing to the next statement in the normal flow. Similarly, the target that immediately follows a loop control predicate evaluating to false is not all that different from the false target of an if statement. As long as the "continuation" kind is not required to be present that's fine. Perhaps the text should say something like this: "These kinds are primarily intended to help a human reader understand the flow of control through the thread, and should not be interpreted as a formal description of the semantics of the path."

Yes, "terminate" would refer to the end of the code flow, not the end of program execution. HOWEVER, please look at the latest change draft. That word no longer appears; I've rewritten the notes in this section.

OK. that change looks great.

michaelcfanning · 2019-01-25T19:37:57Z

E-BALLOT #3 PROPOSAL

Restore threadFlowLocation.kind, but make it an array "kinds". Make it an open-ended string, but recommend the set of values approved by the TC at the F2F meeting.

SCHEMA CHANGES

In the threadFlowLocation object:
- Remove the kind property (in case it was still around).
- Add a property kinds of type string[], optional, minItems: 0, default: []

The recommended values are:

acquire
release
enter
exit
call
return
branch

implicit
false
true
caution
danger
unknown
unreachable

taint
function
handler
lock
memory
resource
scope

ghost · 2019-04-06T17:46:55Z

Approved in e-ballot-3.

ghost added 2.1.0-CSD.1 Will be fixed in SARIF v2.1.0 CSD.1. design-improvement labels Jul 17, 2018

ghost pushed a commit that referenced this issue Aug 15, 2018

Change draft for #202: Restore kind.

185e394

ghost pushed a commit that referenced this issue Aug 15, 2018

Merge pull request #218 from oasis-tcs/users/lgolding/202-restore-kind

a8838ab

Change draft for #202: Restore kind.

ghost mentioned this issue Aug 27, 2018

Did we break code flows in v2? #194

Closed

ghost self-assigned this Sep 24, 2018

ghost added the impact-non-breaking-change label Sep 24, 2018

ghost pushed a commit that referenced this issue Sep 24, 2018

Change draft for #194 and #202: threadFlowLocation changes.

b729af3

ghost added the change-draft-available label Sep 24, 2018

ghost pushed a commit that referenced this issue Oct 3, 2018

#202: Add more threadFlowLocation.kind values.

1c7dc72

ghost pushed a commit that referenced this issue Oct 4, 2018

#202: Add a note number.

af6de27

ghost added the discussion-ongoing label Oct 8, 2018

michaelcfanning removed the change-draft-available label Nov 27, 2018

michaelcfanning added the p1 Priority 1 issue to close label Jan 24, 2019

michaelcfanning added design-approved The TC approved the design and I can write the change draft and removed discussion-ongoing labels Jan 25, 2019

michaelcfanning mentioned this issue Feb 20, 2019

Create set of general icon types for review by TC #260

Closed

ghost added the e-ballot-3 label Mar 18, 2019

ghost pushed a commit that referenced this issue Mar 26, 2019

Change draft + merge for #202: threadFlowLocation.kinds

64f7dfa

ghost added code-flows change-draft-available merged Changes merged into provisional draft. tc-34 labels Mar 26, 2019

ghost pushed a commit that referenced this issue Mar 28, 2019

Reject obsolete change draft for #194/#202.

520312d

ghost added resolved-fixed and removed change-draft-available labels Apr 6, 2019

ghost closed this as completed Apr 6, 2019

ghost mentioned this issue Apr 16, 2019

Feedback from Harlene (MS) #379

Closed

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restore threadFlowLocation.kind #202

Restore threadFlowLocation.kind #202

ghost commented Jul 17, 2018

fishoak commented Oct 3, 2018

ghost commented Oct 4, 2018

michaelcfanning commented Oct 4, 2018

ghost commented Oct 4, 2018 •

edited by ghost

fishoak commented Oct 10, 2018

michaelcfanning commented Jan 25, 2019 •

edited by ghost

ghost commented Apr 6, 2019

Restore threadFlowLocation.kind #202

Restore threadFlowLocation.kind #202

Comments

ghost commented Jul 17, 2018

fishoak commented Oct 3, 2018

ghost commented Oct 4, 2018

michaelcfanning commented Oct 4, 2018

ghost commented Oct 4, 2018 • edited by ghost

fishoak commented Oct 10, 2018

michaelcfanning commented Jan 25, 2019 • edited by ghost

E-BALLOT #3 PROPOSAL

SCHEMA CHANGES

ghost commented Apr 6, 2019

ghost commented Oct 4, 2018 •

edited by ghost

michaelcfanning commented Jan 25, 2019 •

edited by ghost