Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Define mismatches between DOI levels and log levels #40

Open
khatchad opened this issue Jun 20, 2018 · 14 comments
Open

Define mismatches between DOI levels and log levels #40

khatchad opened this issue Jun 20, 2018 · 14 comments
Assignees

Comments

@khatchad
Copy link
Member

khatchad commented Jun 20, 2018

How do we relate these two concepts of "interest?" The hypothesis we have is that there should be a correlation between the concept of DOI in general to the interestness of the log statement, i.e., it's level.

┆Issue is synchronized with this Asana task

@khatchad
Copy link
Member Author

Can you make some kind of table here?

DOI range Log Level
Less than 10 CRITICAL
5 < x < 10 WARNING

@khatchad
Copy link
Member Author

Another approach would be to use a machine learning technique where you learn how the developer changes the log levels manually. When they change it manually, you examine what the DOI value is at that time.

@khatchad khatchad added this to the Transformation milestone Jun 20, 2018
@yiming-tang-cs
Copy link
Contributor

Can you make some kind of table here?

I've thought about this problem. Because different developers have different programming styles/ways /levels, the same DOI value could mean the different interests for different developers. The transformation depends on how the developers programming and how they activate the tasks. In this case, I think I should have a general solution. My idea is to get the highest and lowest DOI value, then divide the DOI range into 7 partitions (because of 7 logging levels). How do you think?

@khatchad khatchad mentioned this issue Jun 25, 2018
@khatchad khatchad assigned khatchad and unassigned yiming-tang-cs Jun 25, 2018
@khatchad
Copy link
Member Author

➤ Raffi Khatchadourian commented:

Let me think about this. In the meantime, could you have a boolean returning method that returns true if there is a mismatch and false otherwise? Right now, it can just return false. Based on the return value of this method, which we will fill in later, you should be able to action.

@khatchad khatchad assigned yiming-tang-cs and unassigned khatchad Jun 25, 2018
@khatchad
Copy link
Member Author

➤ Raffi Khatchadourian commented:

I think it is a great idea. Please proceed.

@khatchad
Copy link
Member Author

Please remember to document this decision in the paper draft if you decide to stick with it.

@yiming-tang-cs yiming-tang-cs mentioned this issue Jun 27, 2018
@yiming-tang-cs
Copy link
Contributor

If all DOI values are same, what should I do? Can I tell the developers that I cannot get enough data to analyze?

@khatchad
Copy link
Member Author

Hm. That is making me think twice about this strategy. Please make it very modular so that the logic is in one place and we can change it very easily. The comparison should be to the logging level. I don't think it really is relative to all logging statements. Do you agree?

@khatchad
Copy link
Member Author

I think you may need to go with the machine learning approach I mentioned earlier, or need to examine some history of DOI values. Using the ML approach, you probably won't have DOI information in the training set, which would mean that you would need some other way of determining DOI (like git history), which we don't have yet. So, I'd say go with the original plan of hard coding until we can figure it out.

@yiming-tang-cs
Copy link
Contributor

Hm. That is making me think twice about this strategy. Please make it very modular so that the logic is in one place and we can change it very easily. The comparison should be to the logging level. I don't think it really is relative to all logging statements. Do you agree?

What do you mean " I don't think it really is relative to all logging statements"? So do you think I do not need to get MAX and MIN DOI from all logging statements?

@yiming-tang-cs
Copy link
Contributor

I think you may need to go with the machine learning approach I mentioned earlier, or need to examine some history of DOI values. Using the ML approach, you probably won't have DOI information in the training set, which would mean that you would need some other way of determining DOI (like git history), which we don't have yet. So, I'd say go with the original plan of hard coding until we can figure it out.

I agree. I've also thought about git history. I think I cannot get DOI value from git history because one commit in the git history is not equal to one event in the DOI model. One commit could be generated by several events. We may need other ways to get training set.

@khatchad
Copy link
Member Author

khatchad commented Jun 29, 2018 via email

@khatchad
Copy link
Member Author

khatchad commented Jul 2, 2018 via email

@khatchad
Copy link
Member Author

Looks like we have an issue here.

@khatchad khatchad reopened this Jan 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants